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METHODS OF DIAGNOSIS OF BREAST CANCER, COMPOSITIONS 
AND METHODS OF SCREENING FOR MODULATORS OF BREAST 

CANCER 

5 . ' 

CROSS-REFERENCES TO RELATED APPLICATIONS 

This application claims priority to USSN 60/263,965, filed January 24, 2001 ; 
tJSSN 60/265,928, filed February 2, 2001; USSN 09/829,472 filed April 9, 2001; USSN 
60/282,698, filed April 9, 2001; USSN 60/288,590, filed May 4, 2001; and USSN 
10 60/294,443, filed May 29, 2001, all of which are incorporated herein by reference m their 
entirety. 

FIELD OF THE INVENTION 

The invention relates to the identification of nucleic acid and protein 
15 expression profiles and nucleic acids, products, and antibodies thereto that are involved in 
breast cancer; and to the use of such expression profiles and compositions in the diagnosis, 
prognosis and therapy of breast cancer. The invention finther relates to metiiods for 
identifying and using agents and/or targets that inhibit breast cancer. 

20 BACKGROUND OF THE INVENTION 

Breast cancer is one of the most firequently diagnosed cancers and the second 
leading cause of female cancer death in North America and northern Europe, with lung 
cancer being the leading cause. Lifetime incidence of the disease in the United States is one- 
in-eight, with a l-in-29 Ufetime risk of dying fmm breast cancer. Early detection of breast 

25 cancer, using mammography, clinical breast examination, and self breast examination, has 

dramatically improved the treatment of the disease, although sensitivity is still major concern, 
as mammographic sensitivity has been estimated at only 60%-90%. Treatment of breast 
cancer consists largely of surgical lumpectomy or mastectoiny, radiation therapy^ anti- 
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hormone therapy, and/or chemotherapy. Although many breast cancer patients are effectively 
treated, the current therapies can all induce serious side effects which diminish quaUty of life. 
Deciding on a particular course of treatment is typically based on a variety of prognostic 
parameters and markers (Fitzgibbons et al, 2000, Arch. Pathol. Lab. Med, 124:966-978; 
5 Hamilton and Piccart, 2000, Ann. Oncol. 1 1 :647-663), including genetic predispostion 
markers BRCA-1 and BRC A-2 (Robson, 2000, J. Clin. Oncol. 18:1 13sup-l 1 8sup). 

Imaging of breast cancer for diagnosis has been problematic and limited. In 
addition, dissemination of tumor cells (metastases) to locoregional lymph nodes is an 
important prognostic factor; five year survival rates drop from 80 percent in patients with no 

10 lymph node metastases to 45. to 50 percent in those patients who do have lymph node 

metastases. A recent report showed that micrometastases can be detected from lymph nodes 
using reverse transcriptase-PCR methods based on the presence of xnRNA for 
carcinoembryonic antigen, which has previously been shown to be present in the vast 
majority of breast cancers but not in normal tissues. Liefers et at.. New England J. of Med. 

15 339(4):223 (1998). 

The identification of novel therapeutic targets and diagnostic markers is 
essential for improving the current treatment of breast cancer pa.tients. Recent advances in 
molecular medicine have increased the interest in tumor-specific cell surface antigens that 
could serve as targets for various immunotherapeutic or small molecule strategies. Antigens 

20 suitable for immunotherapeutic strategies should be highly expressed in cancer tissues and 
ideally not expressed in normal adult tissues. Expression in tissues that are dispensable for 
life, however, may be tolerated. Examples of such antigens include Her2/neu and the B-cell 
antigen CD20. Humanized monclonal antibodies directed to Her2/neu 
(Herceptin®/trastuzumab) are currently in use for the treatment of metastatic breast cancer 

25 (Ross and Fletcher, 1998, Stem Cells 16:413-428). Similarly, anti-CD20 monoclonal 

antibodies (Rituxin®/rituximab) are used to effectively treat non-Hodgekin's lymphoma 
(Maloney et al., 1997, Blood 90:2188-2195; Leget and Czuczman, 1998, Curr. Opin. Oncol. 
10:548-^551). 

Other potential immvmotherapeutic targets have been identified for breast 
30 cancer. One such target is polymorphic epithehal mucin (MUGl). MUCl is a transmembrane 
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protein, present at the apical surface of glandular epithelial cells. It is often overexpressed in 
breast cancer, and typically exhibits an altered glycosylation pattern, resulting in an 
antigenically distinct molecule, and is in early clinical trials as a vaccine target (Gilewski et 
al., 2000, Clin. Cancer Res. 6:1693-1701; SchoU et al., 2000, J. Inununother. 23:570-580).. 
5 The tumor-expressed protein is often cleaved into the circulation, where it is detectable as the 
tumor marker, CA 15-3 (Bon et al., 1997, Clin. Chem. 43:585-593). However, many patients 
have tumors that express neither HER2 nor MUC-1; therefore, it is clear that other targets 
need to be identified to manage locaUzed and metastatic disease. Many other genes have been 
reported to be overexpressed in breast cancer, such as EGFR (Sainsbury et al., 1987, Lancet 
10 1(8547):1398-1402), c-erbB3 (Naidu et al., 1988, Br. J. Cancer 78:1385-1390), FGFR2 
(Penault-Llorca et al., 1991, Int. J. Cancer 61:170-176), PKW (Preiherr et al., 2000, 
Anticancer Res. 20:2255-2264), MTAl (Nawa et al., 2000, J. Cell Biochem. 79:202-212), 
breast cancer associated gene 1 (Kurt et al., 2000, Breast Cancer Res. Treat. 59:41-48). 
Although monoclonal antibodies to the protein products of some of these overexpressed 
15 genes have been reported (for review, see Green et al., 2000, Cancer Treat. Rev. 26:269-286), 
none are currently approved for breast cancer therapy in the US. 

Disclosures of certain genes and ESTs described as being expressed in bre^t 
cancer are found in international patent appUcations WO-99/33869, WO-97/25426, WO- 
97/02280 and WO-00/55173, WO-98/45328 and WO-00/22130. Similarly, genes and ESTs 
20 described as being expressed in breast cancer are disclosed in US Patent Nos. 5,759,776 and 
5,€93,522. The utility of such genes is described in each of these publications, and their 
disclosures are incorporated herein in their entirety. 

While industry and academia have identified novel sequences, there has not 
been an equal effort exerted to identify the fimction of these novel sequences. The 
25 elucidation of a role for novel proteins and compounds in disease states for identification of 
therapeutic targets and diagnostic markers is essential for improving the current treatment of 
breast cancer patients. Accordingly, provided herein are molecular targets for therapeutic 
intervention in breast and other cancers. Additionally, provided herein are methods tiiat can 
be used in diagnosis and prognosis of breast cancer. Further provided are methods that can 
30 be used to screen candidate bioactive agents for the abiUty to modulate breast cancer. 

3 



BNSfXX:iD: <VVO Uatoi J tf / /A 2_f.> 



wo 02/059377 



PCT/US02/02242 



. SUMMARY OF THE INVENTION 

The present invention therefore provides nucleotide sequences of genes that 
are up- and down-regulated in breast cancer cells. Such genes are useful for diagnostic 
purposes, and also as targets for screening for therapeutic compounds that modulate breast 
cancer, such as hormones or antibodies. Other aspects of the invention will become apparent 
to the skilled artisan by the following description of the invention. 

In one aspect, the present invention provides a method of detecting a breast 
cancer-associated transcript in a cell from a patient, the method comprising contacting a 
biological sample from the patient with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-25. 

In one embodiment, the present invention provides a method of determining 
the level of a breast cancer associated transcript in a cell from a patient. 

In one embodiment, the present invention provides a method of detecting a 
breast cancer-associated transcript in a cell from a patient, the method comprising contacting 
a biological sample from the patient with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-25. 

In one embodiment, the polynucleotide selectively hybridizes to a sequence at 
least 95% identical to a sequence as shown in Tables 1-25. 

In one embodiment, the biological sample is a tissue sample. In another 
eifabodiment, the biological sample comprises isolated nucleic acids, e.g., mRNA, 

In one embodiment, the polynucleotide is labeled, e.g., with a fluorescent 

label. 

In one embodiment, the polynucleotide is immobilized on a solid surface. 

In one embodiment, the patient is undergoing a therapeutic regimen to treat 
breast cancer. In another embodiment, the patient is suspected of having metastatic breast 
cancer. 

In one embodiment, the patient is a human. 

In one embodiment, the breast cancer associated transcript is mRNA. 
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In one embodiment, the method further comprises the step of ampUfymg 
nucleic acids before the step of contacting the biological sample with the polynucleotide. 

In another aspect, the present invention provides a method of monitoring the 
efficacy of a therapeutic treatment of breast cancer, the method comprising the steps of: (i) 
5 providing a biological sample from a patient undergoing the therapeutic treatment; and (ii) 
determining the level of a breast cancer-associated transcript in the biological sample by 
contacting the biological sample with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-25, thereby monitoring 
the efficacy of the therapy. In a further embodiment, the patient has metastatic breast cancer. 
10 In a further embodiment, the patient has a drag resistant form of breast cancer. 

In one embodiment, the method further comprises the step of: (iii) comparing 
the level of the breast cancer-associated transcript to a level of the breast cancer-associated 
transcript in a biological sample from the patient prior to, or earUer in, the therapeutic 
treatment. 

1 5 Additionally, provided herein is a method of evaluating the effect of a 

candidate breast caiicer drag comprising administering the drag to a patient and removing a 
cell sample from the patient. The expression profile of the cell is then determined. This 
method may further comprise comparing the expression profile to an expression profile of a 
healthy individual. In a preferred embodiment, said expression profile includes a gene of 

20 Tables 1-25. 

In one aspect, the present invention provides aii isolated nucleic- acid molecule 
consisting of a polynucleotide sequence as shown in Tables 1-25. 

In one embodiment, an expression vector or cell comprises the isolated nucleic 

acid. 

25 In one aspect, the present invention provides an isolated polypeptide which is 

encoded by a nucleic acid molecule having polynucleotide sequence as shown in Tables 1-25. 

In another aspect, the present invention provides an antibody that specifically 
binds to an isolated polypeptide which is encoded by a nucleic acid molecule having 
polynucleotide sequence as shown in Tables 1-25. 
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In one embodinaent, the antibody is conjugated to an effector component, e.g., 
a fluorescent label, a radioisotope or a cytotoxic chemical. 

In one embodiment, the antibody is an antibody fragment. In another 
embodiment, the antibody is humanized. 

In one aspect, the present invention provides a method of detecting a breast 
cancer cell in a biological sample from a patient, the method comprising contacting the 
biological sample with an antibody as described herein. 

In another aspect, the present invention provides a method of detecting 
antibodies specific to breast cancer in a patient, the method comprising contacting a 
biological sample from the patient with a polypeptide encoded by a nucleic acid comprising a 
sequence from Tables 1-25. 

In another aspect, the present invention provides a method for identifying a 
compound that modulates a breast cancer-associated polypeptide, the method comprising the 
steps of: (i) contacting the compound with a breast cancer-associated polypeptide, the 
polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 
80% identical to a sequence as shown in Tables 1-25; and (ii) determining the functional 
effect of the compound upon the polypeptide. 

In one embodiment, the functional effect is a physical effect, an enzymatic 
effect, or a chemical effect. 

In one embodiment, the polypeptide is expressed in a eukaryotic host cell or 
cell membrane. In another embodiment, the polypeptide is recombinant. 

In one embodiment, the functional effect is determined by measuring ligand 
binding to the polypeptide. 

In another aspect, the present invention provides a method of inhibiting 
proliferation of a breast cancer-associated cell to treat breast cancer in a patient, the method 
comprising the step of administering to the subject a therapeutically effective amount of a 
compound identified as described herein. 

In one embodiment, the compound is an antibody. 

In another aspect, the present invention provides a drug screening assay 
comprising the steps of: (i) administering a test compound to a mammal having breast cancer 

6 
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or to a cell sample isolated therefrom; (ii) comparing the level of gene expression of a 
polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 
as shown in Tables 1-25 in a treated cell or manraial with the level of gene expression of the 
piolynucleotide in a control cell sample or mammal, wherein a test compound that modulates 
5 the level of expression of the polynucleotide is a candidate for the treatment of breast cancer. 

In one embodiment, the control is a mammal with breast cancer or a cell 
sample therefrom that has not been treated \yith the test compound. In another embodiment, 
the control is a normal cell or mammal. 

In one embodiment, the test compovmd is administered in varying amounts or 
10 concentrations. In another embodiment, the test compound is administered for varying time 
periods. In another embodiment, the comparison can occur after addition or removal of the 
dmg candidate. 

In one embodiment, the levels of a plurality of polynucleotides that selectively 
hybridize to a sequence at least 80% identical to a sequence as shown in Tables 1-25 are 
15 individually compared to their respective levels in a control cell sample or mammaL In a 
preferred embodiment the plurality of polynucleotides is from three to ten. 

In another aspect, the present invention provides a method for treating a 
mammal having breast cancer comprising administering a compound identified by the assay 
described herein. 

20 In another aspect, the present invention provides a pharmaceutical 

composition for treating a mammal having breast cancer, the composition comprising a 
compound identified by the assay described herein and a physiologically acceptable 
excipient. 

In one aspect, the present invention provides a method of screening drug 
25 candidates by providing a cell expressing a gene that is up- and down-regulated as in a breast 
cancer. In one embodiment, a gene is selected from Tables 1-25. The method further 
includes adding a dmg candidate to the cell and determining the effect of the drug candidate 
on the expression of the expression profile gene. 

In one embodiment, the method of screening drug candidates includes 
30 comparing the level of expression in the absence of the dmg candidate to the level of 



OISDOCn>: <WO aaOS9377A21l_> 



wo 02/059377 



PCT/US02/02242 



expression in the presence of the drug candidate, wherein the concentration of the drug 
candidate can vary when present, and wherein the comparison can occur aflCT addition or 
removal of the dmg candidate. In a preferred embodiment, the cell expresses at least two 
expression profile genes. The profile genes may show an increase or decrease. 
5 , Also provided is a method of evaluating the effect of a candidate breast cancer 

drug comprising administering the drug to a transgenic animal expressing or over-expressing 
the breast cancer modulatory protein, or an animal lacking the breast cancer modulatory 
protein, for example as a result of a gene knockout. 

Moreover, provided herein is a biochip comprising one or more nucleic acid 

10 segments of Tables 1-25, wherein the biochip comprises fewer than 1000 nucleic acid probes. 
Preferably, at least two nucleic acid segments are included. More preferably, at least three 
nucleic acid segments are included. 

Furthermore, a method of diagnosing a disorder associated with breast cancer 
is provided. The method comprises determining the expression of a gene of Tables 1-25, 

1 5 preferably a gene of Table 25, in a first tissue type of a first individual, and comparing the 
distribution to the expression of the gene firom a second normal tissue type firom the first 
individual or a second unaffected individual. A difference in the expression indicates that the 
first individual has a disorder associated with breast cancer. 

In a fiirther embodiment, the biochip also includes a polynucleotide sequence 

20 of a gene that is not up- and down-regulated in breast cancer. 

In one embodiment a method for screening for a bioactive agent capable of 
interfering with the binding of a breast cancer modulating protein (breast cancer modulatory 
protein) or a fragment thereof and an antibody which binds to said breast cancer modulatory 
protein or fragment thereof. In a preferred embodiment, the method comprises combining a 

25 breast cancer modulatory protein or fragment thereof, a candidate bioactive agent and an 
antibody which binds to said breast cancer modulatory protein or firagment thereof. The 
method fiuther includes determining the binding of said breast cancer modulatory protein or 
fragment thereof and said antibody. Wherein there is a change in binding, an agent is 
identified as an interfering agent. The interfering agent can be an agonist or an antagonist. 

30 Preferably, the agent inhibits breast cancer. 
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Also provided herein are methods of eliciting an immmie response in an 
individual. In one embodiment a method provided herein comprises administering to an 
individual a composition comprising a breast cancer modulating protein, or a fragment 
thereof. In another embodiment, the protein is encoded by a nucleic acid selected from those 
5 of Tables 1-25. 

Further provided herein are compositions capable of eliciting an immune 
response in an individual. In one embodiment, a composition provided herein comprises a 
breast cancer modulating protein, preferably encoded by a nucleic acid of Tables 1-25, more 
preferably of Table 25, or a fragment thereof, and a pharmaceutically acceptable carrier. In 

10 another embodiment, said composition comprises a nucleic acid comprising a sequence 
encoding a breast cancer modulating protein, preferably selected from the nucleic acids of 
Tables 1-25, and a pharmaceutically acceptable carrier. 

Also provided are methods of neutralizing the effect of a breast cancer protein, 
or a fragment thereof, comprising contacting an agent specific for said protein with said 

15 protein in an amount sufficient to effect neutralization. In another embodiment, the protein is 
encoded by a nucleic acid selected from those of Tables 1-25. 

In another aspect of the invention, a method . of treating an individual for breast 
cancer is provided. In one embodiment, the method comprises administering to said 
individual an inhibitor of a breast cancer modulating protein. In another embodiment, the 

20 method comprises administering to a patient having breast cancer an antibody to a breast 

c^cer modulating protein conjugated to a therapeutic moiety. Such a therapeutic moiety can 
be a cytotoxic agent or a radioisotope. 

DETAILED DESCRIPTION OF THE INVENTION 
25 In accordance with the objects outlined above, the present invention provides 

novel methods for diagnosis and prognosis evaluation for breast cancer (PC), including 
metastatic breast cancer, as well as methods for screening for compositions which modulate 
breast cancer. Also provided are methods for treating breast cancer. 

Tables 1-24B provide unigene cluster identification numbers for the 
30 nucleotide sequence of genes that exhibit increased or decreased expression in breast cancer 

9 
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samples. Tables 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 18, 19, 20, 21, and 22 list those genes that are 
up-regulated in breast cancer cells. Table 14 lists those genes that are highly upregulated in 
breast cancer cells. Table 1, 2, 3, 15, and 23 list genes that are down-regulated in breast 
cancer cells and Table 16, lists genes that are highly down-regulated in breast cancer genes. 
5 The Tables also provide an exemplar accession number that provides a nucleotide sequence 
that is part of the unigerie cluster. 

Definitions 

The term "breast cancer proteia" or '^breast cancer polynucleotide" or "breast 

10 cancer-associated transcript" refers to nucleic acid and polypeptide polymorphic variants, 
alleles, mutants, and interspecies homologues that: (1) have a nucleotide sequence that has 
greater than about 60% nucleotide sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, 
preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% or greater nucleotide 
sequence identity, preferably over a region of over a region of at least about 25, 50, 100, 200, 

15 509, 1000, or more nucleotides, to a nucleotide sequence of or associated with a gene of 

Tables 1-25; (2) bind to antibodies, e.g., polyclonal antibodies, raised against an immimogen 
comprising an amino acid sequence encoded by a nucleotide sequence of or associated with a 
gene of Tables 1-25, and conservatively modified variants thereof; (3) specifically hybridize 
under stringent hybridization conditions to a nucleic acid sequence, or the complement 

20 thereof of Tables 1-25 and conservatively modified variants thereof or (4) have an amino acid 
sequence that has greater than about 60% amino acid sequenceidentity, 65%, 70%, 75%, 
80%, 85%, 90%, preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% or greater 
amino sequence identity, preferably over a region of over a region of at least about 25, 50, 
100, 200, 500, 1000, or more amino acid, to an amino acid sequence encoded by a nucleotide 

25 sequence of or associated with a gene of Tables 1-25. A polynucleotide or polypeptide 
sequence is typically firom a mammal including, but not limited to, primate, e.g., human; 
rodent, e.g., rat, mouse, hamster; cow, pig, horse, sheep, or other mammal. A "breast cancer 
polypeptide" and a "breast cancer polynucleotide," iaclude both naturally occurring or 
recombinant forms. 

10 
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A "foil length" breast cancCT protein or nucleic acid refers to a breast cancer 
polypeptide or polynucleotide sequence, or a variant thereof, that contains all of the elements 
normally contained in one or more naturally occurring, wild type breast cancer 
polynucleotide or polypeptide sequences. The "foil length" may be prior to, or after, various 
5 stages of post-translation processing or splicing, including alternative sphcing. 

"Biological sample" as used hereiQ is a sample of biological tissue or fluid that 
contains nucleic acids or polypeptides, e.g., of a breast cancer protein, poljoiucleotide or 
transcript. Such samples include, but are not limited to, tissue isolated from primates, e.g., 
himians, or rodents, e.g., mice, and rats. Biological samples may also include sections of 
10 tissues such as biopsy and autopsy samples, frozen sections taken for histologic purposes, 
blood, plasma, serum, sputum, stool, tears, mucus, hair, skin, etc. Biological samples also 
include explants and primary and/or transformed cell cultures derived from patient tissues. A 
biological sample is typically obtained from a eukaryotic organism, most preferably a 
mammal such as a.primate e.g., chimpanzee or human; cow; dog; cat; a rodent, e.g., guinea 
15 pig, rat, mouse; rabbit; or a bird; reptile; or fish. 

"Providing a biological sample" means to obtain a biological sample for use in 
methods described in this invention. Most often, tliis will be done by removing a sample of 
cells from an animal, but can also be accomplished by using previously isolated cells (e.g*., 
isolated by another person, at another time, and/or for another purpose), or by perfomiing the 
20 methods of the invention in vivo. Archival tissues, having treatment or outcome history, will 
be particularly usefol. 

The temis "identical" or percent "identity," in the context of two or more 
nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that 
are the same or have a specified percentage of amino acid residues or nucleotides that are the 
25 same (i.e., about 60% identity, preferably 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 
95%, 96%, 97%, 98%, 99%, or higher identity over a specified region, when compared and 
aligned for maximum correspondence over a comparison window or designated region) as 
measured using a BLAST or BLAST 2.0 sequence comparison algorithms with default 
parameters described below, or by manual alignment and visual inspection (see, e.g., NCBI 
30 web site http://www.ncbi.nbn.nih.gov/BLAST/ or the like). Such sequences are then said to 

11 
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be "substantially identical." This definition also refers to, or may be applied to, the 
compliment of a test sequence. The definition also includes sequences that have deletions 
and/or additions, as well as those that have substitutions, as well as naturally occurring, e.g., 
polymorphic or allelic variants, and man-made variants. As described below, the preferred 
5 algorithms can account for gaps and the Uke. Preferably, identity exists over a region that is 
at least about 25 amino acids or nucleotides in length, or more preferably over a region that is 
50-100 amino acids, or nucleotides in length. 

For sequence comparison, typically one sequence acts as a reference sequence, 
to which test sequences are compared. When using a sequence comparison algorithm, test 

10 and reference sequences are entered into a computer, subsequence coordinates are designated, 
if necessary, and sequence algorithm program parameters are designated. Preferably, default 
program parameters can be used, or alternative parameters can be designated. The sequence 
cornparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 

15 A "comparison window", as used herein, includes reference to a segment of 

one of the number of contiguous positions selected firom the group consisting typically of 
from 20 to 600, usually about 50 to about 200, more usually about 100 to about 150 in which 
a sequence may be compared to a reference sequence of the same number of contiguous 
positions after the two sequences are optimally aUgned. Methods of alignment of sequences 

20 for comparison are well-known in the art. Optimal alignment of sequences for comparison 
can be conducted, e.g., by the local homology algorithm of Smith & Watermm, Adv. AppL 
Math, 2:482 (1981), by the homology alignment algorithm of Needleman & Wxmsch, J, Mol 
Biol 48:443 (1970), by the search for similarity method of Pearson & Lipman, Proc. Nat 'I 
Acad, Set USA 85:2444 (1988), by computerized implementations of these algorithms 

25 (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual alignment and 
visual inspection {see, e.g., Current Protocols in Molecular Biology (Ausubel et aL, eds. 
1995 supplement)). 

Preferred examples of algorithms that are suitable for determining percent 

30 sequence identity and sequence similarity include the BLAST and BLAST 2.0 algorithms. 
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which are described in Altschul et al, Nuc. Acids Res. 25:3389-3402 (1977) and Altschul et 
aL, J. Mol Biol. 215:403-410 (1990). BLAST and BLAST 2.0 are used, with the parameters 
described herein, to determine percent sequence identity for the nucleic acids and proteins of 
the invention. Software for performing BLAST analyses is pubUcly available through the 

5 National Center for Biotechnology Lifonnation (http://www.ncbi.nhn.nih.gov/). This 

algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short 
words of length W in the query sequence, which either match or satisfy some positive-valued 
threshold score T when aligned with a word of the same length in a database sequence. T is 
referred to as the neighborhood word score threshold (Altschul et aL, supra). These initial 

10 neighborhood word hits act as seeds for initiating searches to find longer HSPs containing 
them. The word hits are extended in both directions along each sequence for as far as the 
cumulative alignment score can be increased. Cumulative scores are calculated using, e.g., 
for nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always > 0) and N (penalty score for mismatching residues; always < 0). For amino acid 

15 sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word 
hits in each direction are halted when: the cumulative alignment score falls off by the 
quantity X from its maximum achieved value; the cumulative score goes to zero or below, 
due to the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm parameters W, T, and X determine the 

20 sensitivity and speed of the aUgnment. The BLASTN program (for nucleotide sequences) 
uses as defaults a wordlength (W) of 1 1, an expectation (E) of 10, M=5, N=-4 and a 
comparison of both strands. For amino acid sequences, the BLASTP program uses as 
defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix 
{see HenikofiF& Henikoff, Proc. Natl Acad, Set USA 89:10915 (1989))' alignments (B) of 

25 50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands. 

The BLAST algorithm also performs a statistical analysis of the similarity 
between two sequences (see, e.g., Karlin & Altschul, Proc. Nat 7. Acad. ScL USA 90:5873- 
5787 (1993)). One measure of similarity provided by the BLAST algorithm is the smallest 
Sinn probability (P(N)), which provides an indication of the probabiUty by which a match 

30 between two nucleotide or amino acid sequences would occur by chance. For example, a 
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nucleic acid is considered similar to a reference sequence if the smallest sum probability in a 
comparison of the test nucleic acid to the reference nucleic acid is less than about 0.2, more 
preferably less than about 0.01, and most preferably less than about 0.001. Log values may 
be large negative numbers, e.g., 5, 10, 20, 30, 40, 40, 70, 90, 1 10, 150, 170, etc. 
5 An indication that two nucleic acid sequences or polypeptides are substantially 

identical is that the polypeptide encoded by the first nucleic acid is immimologically cross 
reactive with the antibodies raised against the polypeptide encoded by the second nucleic 
acid, as described below. Thus, a polypeptide is typically substantially identical to a second 
polypeptide, e.g., where the two peptides differ only by conservative substitutions. Another 

10 indication that two nucleic acid sequences are substantially identical is that the two molecules 
or their complements hybridize to each other under stringent conditions, as described below. 
Yet another indication that two nucleic acid sequences are substantially identical is that the 
same primers can be used to amplify the sequences. 

A "host cell" is a naturally occurring cell or a transformed cell that contains an 

1 5 expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the like. Host cells may be 
prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian cells such as CHO, HeLa, and the like {see, e.g., the American Type Culture 
Collection catalog or web site, www.atcc.org). 

20 The terms "isolated," "purified," or "biologically pure" refer to material that is 

substantially or essentially free from components that normally accompany it as found in its 
native state. Purity and homogeneity are typically determined using analytical chemistry 
techniques such as polyacrylamide gel electrophoresis or high performance Uquid 
chromatography. A protein or nucleic acid that is the predominant species present in a 

25 preparation is substantially purified. In particular, an isolated iiucleic acid is sq)arated from 
some open reading frames that naturally flank the gene and encode proteins other than protein 
encoded by the gene. The term "purified" in some embodiments denotes that a nucleic acid 
or protein gives rise to essentially one band in an electrophoretic gel. Preferably, it means 
that the nucleic acid or protein is at least 85% pure, more preferably at least 95% pure, and 

30 most preferably at least 99% pure. "Purify" or "purification" in other embodiments means 
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removing at least one contaminant from the composition to be purified. In this sense, 
purification does not require that the purified compound be homogenous, e.g., 100% pure. 

The terms "polypeptide," "peptide" and "protein" are used interchangeably 
herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers 
5 in which one or more amino acid residue is an artificial chemical mimetic of a corresponding 
naturally occurring anmio acid, as well as to naturally occurring amino acid polymers, those 
containing modified residues, and non-naturally occurring amino acid polymer. 

The term "amino acid" refers to naturally occurring and synthetic amino acids, 
as well as amino acid analogs and amino acid mimetics that function similarly to the naturally 
10 occurring amino acids. Naturally occurring amino acids are those encoded by the genetic 
code, as well as those amino acids that are later modified, e.g., hydroxyproline, y- 
carboxyglutamate, and O-phosphoserine. Amino acid analogs refers to compoimds that have 
the same basic chemical stmcture as a naturally occurring amino acid, e.g., an a carbon that is 
botmd to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, 
15 norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs may have 

modified R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic 
chemical structure as a naturally occurring amino acid. Amino acid mimetics refers to 
chemical compounds that have a structure that is different from the general chemical 
structure of an amino acid, but that functions similarly to a naturally occurring amino acid. 
20 Amino acids may be referred to herein by either their commonly knovra three 

letter symbols or by the one-letter symbols recommended by the lUPAC-IUB Biochemical 
Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 
accepted single-letter codes. 

"Conservatively modified variants" applies to both amino acid and nucleic 
25 acid sequences. With respect to particular nucleic acid sequences, conservatively modified 
variants refers to those nucleic acids which encode identical or essentially identical amino 
acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical or associated, e.g., naturally contiguous, sequences. Because of the 
degeneracy of the genetic code, a large number of functionally identical nucleic acids encode 
30 most proteins. For instance, the codons GCA, GCC, GCG and GCU all encode the amino 
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acid alanine. Thus, at every position where an alanine is specified by a codon, the codon can 
be altered to another of the corresponding codons described without altering the encoded 
polypeptide. Such nucleic acid variations are "silent variations," which are one species of 
conservatively modified variations. Every nucleic acid sequence herein which encodes a 
5 polypeptide also describes silent variations of the nucleic acid. One of skill will recognize 
that in certain contexts each codon in a nucleic acid (except AUG, which is ordinarily the 
only codon for methionine, and TGG, which is ordinarily the only codon for tryptophan) can 
be modified to yield a functionally identical molecule. Accordingly, often silent variations of 
a nucleic acid which encodes a polypeptide is implicit in a described sequence with respect to 

10 the expression product, but not with respect to actual probe sequences. 

As to amino acid sequences, one of skill will recognize that individual 
substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein 
sequence .which alters, adds or deletes a single amino acid or a small percentage of amino 
acids in the encoded sequence is a "conservatively modified variant" where the alteration 

15 results, in tibie substitution of an amino acid with a chemically similar amino acid. 

Conservative substitution tables providing functionally similar amino acids are well known in 
the art. Such conservatively modified variants are in addition to and do not exclude 
polymorphic variants, interspecies homologs, and alleles of the invention.typically 
conservative substitutions for one another: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), 

20 Glutamic acid (E); 3) Asparagine (N). Glutamine (Q); 4) Arginine (R), Lysine (K); 5) 

Isoleucine (I), Leucine (L), Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y),. 
Tryptophan (W); 7) Serine (S), Threonine (T); and 8) Cysteine (C), Methionine (M) (^see, 
e,g„ Creighton, Proteins (1984)). 

Macromolecular structures such as polypeptide structures, can be described in 

25 terms of various levels of organization. For a general discussion of this organization, see, 
e.g.. Alberts et al. Molecular Biology of the Cell {3^^ ed., 1994) and Cantor & Schimmel, 
Biophysical Chemistry Part I: TJte Conformation of Biological Macromolecules (1980). 
*Trimary structure" refers to the amino acid sequence of a particixlar peptide. "Secondary 
structure" refers to locally ordered, three dimensional structures within a polypeptide. These 

30 structures are conmionly known as domains. Domains are portioiis of a polypeptide that 
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often form a compact unit of the polypeptide and are typically 25 to approximately 500 
amino acids long. Typical domains are made up of sections of lesser organization such as 
stretches of p-sheet and a-helices. "Tertiary structure" refers to the complete three 
dimensional structure of a polypeptide monomer. "Quatemary structure" refers to the three 
5 dimensional structure formed, usually by the noncovalent association of independent tertiary 
units. Anisotropic terms are also known as energy terms. 

**Nucleic acid" or "oligonucleotide" or "polynucleotide" or grammatical . 
eqxiivalents used herein means at least two nucleotides covalently linked together. 
Oligonucleotides are typically from about 5, 6, 7, 8, 9, 10, 12, 15, 25, 30, 40, 50 or more 
10 nucleotides in length, up to about 100 nucleotides in length. Nucleic acids and 

polynucleotides are a polymers of any length, including longer tengths, e.g., 200, 300, 500, 
1000, 2000, 3000, 5000, 7000, 10,000, etc. A nucleic acid of the present invention will 
generally contain phosphodiester bonds, although in some cases, nucleic acid analogs are 
included that may have altemate backbones, comprising, e.g., phosphoramidate, 
15 phosphorothioate, phosphorodithioate, or O-methylphophoroamidite linkages (see Eckstein, 
Oligonucleotides and Analogues: A Practical Approach, Oxford University Press); and 
peptide nucleic acid backbones and linkages. Other analog nucleic acids include those with 
positive backbones; non-ionic backbones, and non-ribose backbones, including those 
described in U.S. Patent Nos. 5,235,033 and 5,034,506, and Chapters 6 and 7, ASC 
20 Symposium Series 580, Carbohydrate Modifications in Antisense Research, Sanghui & 

Cook, eds.. Nucleic acids containing one or more carbocyclic sugars are also included within 
one definition of nucleic acids. Modifications of the ribose-phosphate backbone may be done 
for a variety of reasons, e.g. to increase the stabiUty and half-life of such molecules in 
physiological environments or as probes on a biochip. Mixtures of naturally occurring 
25 nucleic acids and analogs can be made; alternatively, mixtures of different nucleic acid 
analogs, and mixtures of naturally occurring nucleic acids and analogs may be made. 

A variety of references disclose such nucleic acid analogs, including, for 
example, phosphoramidate (Beaucage et al.. Tetrahedron 49(10):1925 (1993) and references 
therein; Letsinger, J. Org. Chem. 35:3800 (1970); Sprinzl et al., Eur. J. Biochem. 81:579 
30 (1977); Letsinger et al., NucL Acids Res. 14:3487 (1986); Sawai et al, Chem. Lett. 805 
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(1984), Letsinger et aL, J. Am. Chem. Soc. 1 10:4470 (1988); and Pauwels et aL, Chemica 
Scripta 26:141 91986))^ phosphorothioate (Mag et aL, Nucleic Acids Res. 19:1437 (1991); 
and U.S. Patent No. 5,644,048), phosphorodithioate (Briu et al., J. Am. Chem. Soc. 111:2321 
(1989), O-methylphophoroamidite linkages (see Eckstein, Oligonucleotides and Analogues: 
5 A Practical Approach, Oxford University Press), and peptide nucleic acid backbones and 

linkages (see Eghohn, J. Am. Chem. Soc. 114:1895 (1992); Meier et al., Chem. Int. Ed. Engl. 
31 :1008 (1992); Nielsen, Nature, 365:566 (1993); Carlsson et al., Nature 380:207 (1996), all 
of which are incorporated by reference). Other analog nucleic acids include those with 
. positive backbones (Denpcy et al., Proc. Natl. Acad, Sci. USA 92:6097 (1995); non-ionic 

10 backbones (U.S. Patent Nos. 5,386,023, 5,637,684, 5,602,240, 5,216,141 and 4,469,863; 
Kiedrowshi et al., Angew. Chem. Intl. Ed. English 30:423 (1991); Letsinger et al., J. Am. 
Chem. Soc. 1 10:4470 (1988); Letsmger et aL, Nucleoside & Nucleotide 13:1597 (1994); 
Chapters 2 and 3, ASC Symposium Series 580, "Carbohydrate Modifications in Antisense 
Research", Ed. Y.S. Sanghui and P. Dan Cook; Mesmaeker et aL, Bioorganic & Medicinal 

15 Chem..Lett. 4:395 (1994); Jeffs et al., J. Biomolecular NMR 34:17 (1994); Tetrahedron Lett. 
37:743 (1996)) and non-ribose backbones, including those described in U.S. Patent Nos. 
5,235,033 and 5,034,506, and Chapters 6 and 7, ASC Symposium Series 580, "Carbohydrate 
Modifications in Antisense Research", Ed. Y.S. Sanghui and P. Dan Cook. Nucleic acids 
containing one or more carbocyclic sugars are also mcluded within one definition of nucleic 

20 acids (see Jenkins et al., Chem. Soc. Rev. (1995) pp 169-176). Several nucleic acid analogs 
are described in Rawls, C & E News June 2, 1997 page 35. All of these references are hereby 
expressly incorporated by reference. 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide 
nucleic acid analogs. These backbones are substantially non-ionic under neutral conditions, in 

25 contrast to the highly charged phosphodiester backbone of naturally occiirring nucleic acids. 
This results in two advantages. Firs^ the PNA backbone exhibits improved hybridization 
kinetics. PNAs have larger changes in the melting temperature (Tm) for mismatched versus 
perfectly matched basepairs. DNA and RNA typically exhibit a 2-4°C drop in Tm for an 
internal mismatch. With the non-ionic PNA backbone, the drop is closer to 7-9**C. Similarly, 

30 due to their non-ionic nature, hybridization of the bases attached to these backbones is 
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relatively insensitive to salt concentration. In addition, PNAs are not degraded by cellular 
enzymes, and thus can be more stable. 

The nucleic acids may be single stranded or double stranded, as specified, or 
contain portions of both double stranded or single stranded sequence. As will be appreciated 

5 by those in the art, the depiction of a single strand also defines the sequence of the 

complementary strand; thus the sequences described herein also provide the complement of 
the sequence. The nucleic acid may be DNA, both genomic and cDNA, RNA or a hybrid, 
where the nucleic acid may contain combinations of deoxyribo- and ribo-nucleotides, and 
combinations of bases, including uracil, adenine, thymine, cytosine, guanine, inosine, 

10 xanthine hypoxanthine, isocytosine, isoguanine, etc. "Transcript" typically refers to a 

naturally occurring RNA, e.g., a pre-mRNA, hnRNA, or mRNA. As used herein, the term 
"nucleoside" includes nucleotides and nucleoside and nucleotide analogs, and modified 
nucleosides such as amino modified nucleosides. In addition, "nucleoside" includes non- 
naturally occurring analog structures. Thus, e.g. the individual units of a peptide nucleic 

15 acid, each containing a base, are referred to herein as a nucleoside. 

A "label" or a "detectable moiety" is a composition detectable by 
spectroscopic, photochemical, biochemical, immunochemical, chemical, or other physical 
means. For example, useful labels include ^^P, fluorescent dyes, electron-dense reagents, 
en2ymes (e.g., as commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins 

20 or other entities which can be made detectable, e.g,, by incorporating a radiolabel into the 

peptide or used to detect antibodies specifically reactive with the pq)tide. The labels may be 
incorporated into the breast cancer nucleic acids, proteins and antibodies at any position. 
Any method known in the art for conjugating the antibody to the label may be employed, 
including those methods described by Hunter et al.. Nature. 144:945 (1962); David et al., 

25 Biochemistry. 13:1014 (1974); Pain et al., J. Immunol. Meth., 40:219 (1981); and Nygren, L 
Histochem. and Cvtochem.. 30:407 f 1982\ 

An "effector" or "effector moiety" or "effector component" is a molecule that 
is bound (or linked, or conjugated), either covalently, through a linker or a chemical bond, or 
noncovalentiy, through ionic, van der Waals, electrostatic, or hydrogen bonds, to an antibody. 

30 The "effector" can be a variety of molecules including, e.g., detection moieties including 
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radioactive compounds, fluorescent compounds, an enzyme or substrate, tags such as epitope 
tags, a toxin; activatable moieties, a chemotherapeutic agent; a lipase; an antibiotic; or a 
radioisotope emitting "hard" e.g., beta radiation. 

A "labeled nucleic acid probe or oligonucleotide" is one. that is bound, either 
5 covalently, through a linker or a chemical bond, or noncovalently, through ionic, van der 
Waals, electrostatic, or hydrogen bonds to a label such that the presence of the probe may be 
detected by detecting the presence of the label bound to the probe. Altematiyely, method 
using high affinity interactions may achieve the same results where one of a pair of binding 
partners binds to the other, e.g., biotin, streptavidin. 

10 As used herein a "nucleic acid probe or oligonucleotide" is defined as a 

nucleic acid capable of binding to a target nucleic acid of complementary sequence through 
one or more types of chemical bonds, usually through complementary base pairing, usually 
through hydrogen bond formation: As used herein, a probe may include natural (i.e.. A, G, C, 
or T) or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in a probe 

1 5 may be joined by a linkage other than a phosphodiester bond, so long as it does not 

functionally interfere with hybridization. Thus, e.g., probes may be peptide nucleic acids in 
which the constituent bases are joined by peptide bonds fiather than phosphodiester linkages. 
It will be understood by one of skill in the art that probes may bind target sequences lacking 
complete complementarity with the probe sequence depending upon the stringency of the 

20 hybridization conditions. The probes are preferably directly labeled as with isotopes, 

chromophores, lumiphores, chromogens, or indirectly labeled such as with biotin to which a 
streptavidin complex may later bind. By assaying for the presence or absence of the probe, 
one can detect the presence or absence of the select sequence or subsequence. Diagnosis or 
prognosis may be based at the genomic level, or at the level of RNA or protein expression. 

25 The term "recombinant" when used with reference, e.gi, to a cell, or nucleic 

acid, protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been 
modified by the introduction of a heterologous nucleic acid or protein or the alteration of a 
native nucleic acid or protein, or that the cell is derived from a cell so modified. Thus, e.g., 
recombinant cells express genes that are not found within the native (non-recombinant) form 

30 of the cell or express native genes that are otherwise abnormally expressed, under expressed 
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or not expressed at all. By the term "recombinant nucleic acid" herein is meant nucleic acid, 
originally formed in vitro, in general, by the rnanipulation of nucleic acid, e.g., using 
polymerases and endonucleases, in a form not normally found in nature. In this maimer, 
operably linkage of different sequences is achieved. Thus an isolated nucleic acid, in a linear 

5 form, or an expression vector formed in vitro by ligating DNA molecules that are not 

normally joined, are both considered recombinant for the pxuposes of this invention. It is 
understood that once a recombinant nucleic acid is made and reintroduced into a host cell or 
organism, it will replicate non-recombinantly, i.e., using the m vivo cellular machinery of the 
host cell rather than in vitro manipulations; however, such nucleic acids, once produced 

10 recombinantly, although subsequently replicated non-recombinantly, are still considered 

recombinant for the purposes of the invention. Similarly, a '^recombinant protein" is a protein 
made using recombinant techniques, i.e., through the expression of a recombinant nucleic 
acid as depicted above. 

The term "heterologous" when used with reference to portions of a nucleic 

15 acid indicates that the nucleic acid comprises two or more subsequences that are not normally 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences, e.g., from unrelated genes 
arranged to make a new functional nucleic acid, e.g., a promoter from one source and a 
coding region from another source. Similarly, a heterologous protein will often refer to two 

20 or more subsequences that are not found in the same relationship to each other in nature (e.g., 
a fusion protein). 

A "promoter" is defined as an array of nucleic acid control sequences that 
direct transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic 
acid sequences near the start site of transcription, such as, in the case of a polymerase II type 

25 promoter, a TATA element. A promoter also optionally includes distal enhancer or repressor 
elements, which can be located as much as se« ?:al thousand base pairs from the start site of 
transcription. A "constitutive" promoter is a promoter that is active imder most 
environmental and developmental conditions. An "inducible" promoter is a promoter that is 
active under environmental or developmental regulation. The term "operably linked" refers 

30 to a functional linkage betwem a nucleic acid expression control sequence (such as a 
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promoter, or array of transcription factor binding sites) and a second nucleic acid sequence, 
wherein the expression control sequence directs transcription of the nucleic acid 
corresponding to the second sequence. 

An "expression vector" is a nucleic acid construct, generated recombinantly or 
5 synthetically, with a series of specified nucleic acid elements that permit transcription of a 
particular nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, or 
nucleic acid fragment. Typically, the expression vector includes a nucleic acid to be 
- transcribed operably linked to a promoter. 

The phrase "selectively (or specifically) hybridizes to" refers to the binding, 

10 duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under 

stringent hybridization conditions when that sequence is present in a complex mixture (e.g., 
total cellular or library DNA or RNA). 

The phrase "stringent hybridization conditions" refers to conditions under 
which a probe will hybridize to its target subsequence, typically in a complex mixture of 

1 5 nucleic, acids, but to no other sequences. Stringent conditions are sequence-dependent and 
will be different in different circumstances. Longer sequences hybridize specifically at 
higher temperatures. An extensive guide to the hybridization of nucleic acids is found in 
Tijssen, Techniques in Biochemistry and Molecular Biology— Hybridization with Nucleic 
Probes^ "Overview of principles of hybridization and the strategy of nucleic acid assays" 

20 (1993). Generally, stringent conditions are selected to be about 5-10°C lower than the 

thermal melting point (Tm) for the specific sequence at a defined ionic strength pH. The Tm is 
the temperature (under defined ionic strength, pH, and nucleic concenkation) at which 50% 
of the probes complementary to the target hybridize to the target sequence at equilibrium (as 
the target sequences are present in excess, at Tm, 50% of the probes are occupied at 

25 equiUbrium). Stringent conditions will be those in which the salt concentration is less than 
about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion concentration (or other 
salts) at pH 7.0 to 8.3 and the temperature is at least about 30°C for short probes (e.g., 10 to 
50 nucleotides) and at least about 60*^C for long probes (e.g., greater than 50 nucleotides). 
Stringent conditions may also be achieved with the addition of destabilizing agents such as 

30 formamide. For selective or specific hybridization, a positive signal is at least two times 
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background, preferably 10 times background hybridization. Exemplary stringent 
hybridization conditions can be as following: 50% formamide, 5x SSC, and 1% SDS, 
incubating at 42''C, or, 5x SSC, 1% SDS, incubating at dS^'C, with wash in 0.2x SSC, and 
0.1% SDS at 65°C. For PGR, a temperature of about 36°C is typical for low stringency 

5 amplification, although annealing temperatures may vary between about 32'^C and 48°C 

depending on primer length. For high stringency PGR amplification, a temperature of about 
62*'C is typical, although high stringency annealing temperatures can range from about SO^'C 
to about 65°C, depending on the primer length and specificity. Typical cycle conditions for 
both high and low stringency amplifications include a denaturation phase of 90°C - 95®C for 

10 30 sec - 2 min., an aimealing phase lasting 30 sec. - 2 min., and an extension phase of about 
72^'C for 1 - 2 min. Protocols and guidelines for low and high stringency amplification 
reactions are provided, e.g., in Innis et al (1990) PCR Protocols, A Guide to Methods and 
Applications^ Academic Press, Inc. N, Y.). 

Nucleic acids that do not hybridize to each other under stringent conditions are 

15 still substantially identical if the polypeptides which they encode are substantially identical. 
This occurs, e.g., when a copy of a nucleic acid is created using the maximum codon 
degeneracy permitted by the genetic code. In such cases, the nucleic acids typically hybridize 
under moderately stringent hybridization conditions. Exemplary "moderately stringent 
hybridization conditions" include a hybridization in a buffer of 40% formamide, 1 M NaCl, 

20 . 1% SDS at 37°C, and a wash in IX SSC at 45°C, A positive hybridization is at least twice 
background. Those of ordinary skill will readily recognize that alternative hybridization and 
wash conditions can be utilized to provide conditions of similar stringency. Additional 
guidehnes for determining hybridization parameters are provided in niunerous reference, e.g., 
and Current Protocols in Molecular Biology, ed. Ausubel, et ah 

25 The phrase "fimctional effects" in the context of assays for testing compounds 

that modulate activity of a breast cancer protein includes the determination of a parameter 
that is indirectly or directly under the influence of the breast cancer protein or nucleic acid, 
e.g., a functional, physical, or chemical effect, such ais the ability to decrease breast cancer. It 
includes ligand binding activity; cell growth on soft agar; anchorage dependence; contact 
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inhibition and density limitation of growth; cellular proliferation; cellular transformation; 
growth factor or serum dependence; tumor specific marker levels; invasiveness into Matrigel; 
tumor growth and metastasis in vivo\ mRNA and protein expression in cells undergoing 
metastasis, and other characteristics of breast cancer cells. "Functional effects" include in 
vitrOy in vivo, and ex vivo activities. 

By "determining the functional effect" is meant assaying for a compoimd that 
increases or decreases a parameter that is indirectly or directly under the influence of a breast 
cancer protein sequence, e.g., functional, enzymatic, physical and chemical effects. Such 
functional effects can be measxired by any means known to those skilled in the art, e.g., 
changes in spectroscopic characteristics (e.g., fluorescence, absorbance, refiractive index), 
hydrodynamic (e.g., shape), chromatographic, or solubility properties for the protein, 
measuring inducible markers or transcriptional activation of the breast cancer protein; 
measuring binding activity or binding assays, e.g. binding to antibodies or other ligands, and 
measiuing cellular proliferation. Determination of the functional effect of a compound on 
breast cancer can also be performed using breast cancer assays known to those of skill in the 
art such as an m vitro assays, e.g., cell growth on soft agar; anchorage dependence; contact 
inhibition and density limitation of growth; cellular proliferation; cellular transformation; 
growth factor or serum dependence; tumor specific marker levels; invasiveness into Matrigel; 
tiunor growth and metastasis in vivo; mRNA and protein expression in cells \mdergoing 
metastasis, and other characteristics of breast cancer cells. The functional effects can be 
evaluated by many means known to those skilled in the art, e.g., microscopy for quantitative 
or qualitative measures of alterations in morphological features, measurement of changes in 
RNA or protein levels for breast cancer-associated sequences, measurenaent of RNA stability, 
identification of downstream or reporter gene expression (CAT, luciferase, p-gal, GFP and 
the like), e.g., via chemiluniinescence, fluorescence, colorimetric reactions, antibody binding, 
inducible markers, and ligand binding assays. 

"Inhibitors", "activators", and "modulators" of breast cancer polynucleotide 
and polypeptide sequences are used to refer to activating, inhibitory, or modulating molecules 
or compoimds identified using in vitro and in vivo assays of breast cancer polynucleotide and 
polypeptide sequences. Inhibitors are compounds that, e.g., bind to, partially or totally block 
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activity, decrease, prevent, delay activation, inactivate, desensitize, or down regulate the 
activity or expression of breast cancer proteins, e.g., antagonists. Antisense nucleic acids 
may seem to inhibit expression and subsequent function of the protein. "Activators" are 
compoimds that increase, open, activate, facilitate, enhance activation, sensitize, agonize, or 

5 up regulate breast cancer protein activity. Inhibitors, activators, or modulators also include 
genetically modified versions of breast cancer proteins, e.g., versions with altered activity, as 
well as naturally occurring and synthetic ligands, antagonists, agonists, antibodies, small 
chemical molecules and the Uke. Such assays for inhibitors and activators include, e.g., 
expressing the breast cancer protein in vitro, in cells, or cell membranes, applying putative 

10 modulator compounds, and then determining the functional effects on activity, as described 
above. Activators and inhibitors of breast cancer can also be identified by incubating breast 
cancer cells with the test compound and determining increases or decreases in the expression 
of 1 or more breast cancer protems, e.g., 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50 or more breast 
cancer proteins, such as breast cancer proteins encoded by the sequences set out in Tables 1- 

15 25. 

Samples or assays comprising breast cancer proteins that are treated with a 
potential activator, inhibitor, or modulator are compared to control samples without the 
inhibitor, activator, or modulator to examine the extent of inhibition. Control samples 
(imtreated with inhibitors) are assigned a relaitive protein activity value of 100%. Inhibition 

20 of a polypeptide is achieved when the activity value relative to the control is about 80%, 
preferably 50%, more preferably 25-0%. Activation of a breast cancer polypeptide is 
achieved when the activity value relative to the control (untreated with activators) is 1 10%, 
more preferably 150%, more preferably 200-500% (i.e., two to five fold higher relative to the 
control), more preferably 1 000-3000% higher. 

25 The phrase "changes in cell growth" refers to any change in cell growth and 

proliferatiors characteristics in vitro or in vivo, such as formation of foci, anchorage 
independence, semi-solid or soft agar growth, changes in contact inhibition and density 
limitation of growth, loss of growth factor or serum requirements, changes in cell 
morphology, gaining or losing immortalization, gaining or losing tumor specific markers, 

30 ability to form or suppress tumors when injected into suitable animal hosts, and/or 
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immortalization of the cell. See, e.g., Freshney, Culture of Animal Cells a Manual of Basic 
Technique pp. 231-241 (3"" ed. 1994). 

**Txraior cell" refers to precancerous, cancerous, and normal cells in a tumor. 
"Cancer cells," "transformed" cells or "transformation", in tissue culture, refers 
5 to spontaneous or induced phenotypic changes that do not necessarily involve the uptake of 
new genetic material. Although transformation can arise from infection with a transforming 
virus and incorporation of new genomic DNA, or uptake of exogenous DNA, it can also arise 
spontaneously or following exposure to a carcinogen, thereby mutating an endogenous gene. 
Transformation is associated with phenotypic changes, such as immortalization of cells, 
10 aberrant growth control, nonmorpho logical changes, and^or malignancy (see, Freshney, 
Culture of Animal Cells a Manual of Basic Technique (3*^*^ ed. 1994)). 

"Antibody" refers to a polypeptide comprising a framework region from an 
immimoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
The recognized immunoglobuliii genes include the kappa, lambda, alpha, ganmia, delta, 
15 epsilon, and mu constant region genes, as well as the mjriad immunoglobulin variable region 
genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as 
gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 
IgM, IgA, IgD and IgE, respectively. Typically, the antigen-binding region of an antibody or 
its fimctional equivalent will be most critical in specificity and affinity of binding. See Paul, 
20 Fundamental Immunology. 

An exemplary immunoglobulin (antibody) structural unit comprises a 
tetramer. Each tetramer is composed of two identical pairs of pol5T)eptide chains, each pair 
having one "hght" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus 
of each chain defines a variable region of about 100 to 110 or more amino acids primarily 
25 responsible for antigen recognition. The terms variable light chain (Vl) and variable heavy 
chain (V h) refer to these light and heavy chains respectively. 

Antibodies exist, e.g., as intact immunoglobulins or as a number of well- 
characterized fragments produced by digestion with various peptidases. Thus, e.g., pepsin 
digests an antibody below the disulfide linkages in the hinge region to produce F(ab)'2, a 
30 dimer of Fab which itself is a Hght chain joined to Vh-Ch1 by a disulfide bond. The F(ab)'2 
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may be reduced under mild conditions to break the disulfide linkage in the hinge region, 
thereby converting the F(ab)'2 dimer into an Fab' monomer. The Fab' monomer is 
essentially Fab with part of the hinge region (see Fwidamental Immunology (Paul ed., 3d ed. 
1993). While various antibody firagments are defined in terms of the digestion of an intact 

5 antibody, one of skill will appreciate that such fragments may be synthesized de novo either 
chemically or by using recombinant DNA methodology. Thus, the term antibody, as used 
herein, also includes antibody fragments either produced by the modification of whole 
antibodies, or those synthesized de novo using recombinant DNA methodologies (e.g., single 
chain Fv) or those identified using phage display libraries {see, e.g., McCafferty et aL, Nature 

10 348:552-554(1990)) 

For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal 
antibodies, many technique known in the art can be used (see, e.g., Kohler & Milstein, 
Nature 256:495-497 (1975); Kozbor et al. Immunology Today 4:72 (1983); Cole et aL, pp. 
77-96 in Monoclonal Antibodies and Cancer Therapy (1985); Cohgan, Current Protocols in 

15 Immunology (1991); Harlow & Lane, Antibodies, A Laboratory Manual (1988); and Coding, 
Monoclonal Antibodies: Principles and Practice (2d ed. 1986)). Techniques for the 
production of single chain antibodies (U.S. Patent 4,946,778) can be adapted to produce 
antibodies to polypeptides of this invention. Also, transgenic mice, or other organisms such 
as other mammals, may be used to express humanized antibodies. Alternatively, phage 

20 display technology can be used to identify antibodies and heteromeric Fab fragments that 
specifically bind to selected antigens (see, e.g, McCafferty et aL, Nature 348:552-554 
(1990); Marks et aL, Biotechnology 10:779-783 (1992)). 

A "chimeric antibody" is an antibody molecule in which (a) the constant 
region, or a portion thereof, is altered, replaced or exchanged so that the antigen bindmg site 

25 (variable region) is linked to a constant region of a different or altered class, effector function 
and/or species, or an entirely different molecule which confers new properties to the chimeric 
antibody, e,g,, an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) the variable 
region, or a portion thereof, is altered, replaced or exchanged with a variable region having a 
different or altered antigen specificity. 

30 

27 



BNSOCXnO: <WO O20SSS7TA2J_> 



wo 02/059377 



PCT/US02/02242 



Identification of breast cancer-associated sequences 

In one aspect, the expression levels of genes are determined in different 
patient samples for which diagnosis information is desired, to provide expression profiles. 
An expression profile of a particular sample is essentially a "fingerprint" of the state of the 
sample; while two states may have any particular gene similarly expressed, the evaluation of 
a number of genes simultaneously allows the generation of a gene expression profile that is 
characteristic of the state of the cell. That is, normal tissue (e.g., normal breast or other 
tissue) may be distinguished fi-om cancerous or metastatic cancerous tissue of the breast, or 
breast cancer tissue or metastatic breast cancerous tissue can be compared with tissue 
samples of breast and other tissues fi-om surviving cancer patients. By comparing expression 
profiles of tissue in known different breast cancer states, information regarding which genes 
are important (including both up- and down-regulation of genes) in each of these states is 
obtained; 

The identification of sequences that are differentially expressed in breast 
cancer versus non-breast cancer tissue allows the use of this information in a number of ways. 
For example, a particular treatment regime may be evaluated: does a chemotherapeutic drug 
act to down-regulate breast cancer, and thus tumor growth or recurrence, in a particular 
patient. Similarly, diagnosis and treatment outcomes may be done or confirmed by 
comparing patient samples with the known expression profiles. Metastatic tissue can also be 
analyzed to determine the stage of breast cancer in the tissue. Furthermore, these gene 
expression profiles (or individual genes) allow screening of drug candidates with an eye to 
niimicking or altering a particular expression profile; e.g., screening can be done for drugs 
that suppress the breast cancer expression profile. This may be done by making biochips 
comprising sets of the important breast cancer genes, which can then be used in these screens. 
These methods can also be done on the protein basis; that is, protein expression levels of the 
breast cancer proteins can be evaluated for diagnostic purposes or to screen candidate agents. 
In addition, the breast cancer nucleic acid sequences can be administered for gene therapy 
purposes, including the administration of antisense nucleic acids, or the breast cancer proteins 
(including antibodies and other modulators thereof) administered as therapeutic drugs. 
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Thus the present inventioii provides nucleic acid and protein sequences that 
are differentially expressed in breast cancer, herein termed '*breast cancer sequences." As 
outlined below, breast cancer sequences include those that are up-regulated (i.e., expressed at 
a higher level) in breast cancer, as well as those that are down-regulated (i.e., expressed at a 

5 lower level), hi a preferred embodiment, the breast cancer sequences are from humans; 
however, as will be appreciated by those in the art, breast cancer sequences from other 
organisms may be useful in animal models of disease and dmg evaluation; thus, other breast 
cancer sequences are provided, from vertebrates, including mammals, including rodents (rats, 
mice, hamsters, guinea pigs, etc.), primates, farm animals (including sheep, goats, pigs, cows, 

10 horses, etc.) and pets, e.g., (dogs, cats, etc.). Breast cancer sequences from other organisms 
may be obtained using the techniques outlined below. 

Breast cancer sequences can include both nucleic acid and amino acid 
sequences. As will be appreciated by those in the art and is more fully outlined below, breast 
cancer nucleic acid sequences are useful in a variety of applications, mcluding diagnostic 

15 applications, which will detect naturally occurring nucleic acids, as well as screening 

applications; e.g., biochips comprising nucleic acid probes or PGR microtiter plates with 
selected probes to the breast cancer sequences can be generated. 

A breast cancer sequence can be initially identified by substantial nucleic acid 
and/or amino acid sequence homology to the breast cancer sequences outlined herein. Such 

20 homology can be based upon the overall nucleic acid or amino acid sequence, and is 

generally determined as outlined below, using either homology programs or hybridization 
conditions. 

For identifying breast cancer-associated sequences, the breast cancer screen 
typically includes comparing genes identified in different tissues, e.g., normal and cancerous 
25 tissues, or timior tissue samples from patients who have metastatic disease vs. non metastatic 
tissue. Other suitable tissue comparisons include comparing breast cancer samples with 
metastatic cancer samples from other cancers, such as limg, breast, gastrointestinal cancers, 
ovarian, etc. Samples of different stages of breast cancer, e.g., survivor tissue, dmg resistant 
states, and tissue undergoing metastasis, are appUed to biochips comprising nucleic acid 
30 probes. The samples are first microdissected, if applicable, and treated as is known in the art 
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for the preparation of mRNA. Suitable biochips are commercially available, e.g. from 
Affymetrix. Gene expression profiles as described herein are generated and the data 
analyzed. 

In one embodiment, the genes showing changes in expression as between 
5 normal and disease states are compared to genes expressed in other normal tissues, preferably 
normal breast, but also including, and not limited to lung, heart, brain, liver, breast, kidney, 
muscle, colon, small intestine, large intestine, spleen, bone and placenta. In a preferred 
embodiment, those genes identified during the breast cancer screen that are expressed in any 
significant amoimt in other tissues are removed from the profile, although in some 

10 embodiments, this is not necessary. That is, when screening for drugs, it is usually preferable 
that the target be disease specific, to minimize possible side effects. 

In a preferred embodiment, breast cancer sequences are those that are up- 
regulated in breast cancer; that is, the expression of these genes is higher in the breast cancer 
tissue as compared to non-cancerous tissue. "Up-regulation" as used herein often means at 

15 least about a two-fold change, preferably at least about a three fold change, with at least about 
five-fold or higher beiiig preferred. All unigene cluster identification numbers and accession 
niunbers herein are for the GenBank sequence database and the sequences of the accession 
numbers are hereby expressly incorporated by reference. GenBank is known in the art, see, 
e,g,, Benson, DA, et aL, Nucleic Acids Research 26:1-7 (1998) and 

20 http://www.ncbi.nlm.nih.gov/. Sequences are also available in other databases, e.g., 

European Molecular Biology Laboratory (EMBL) and DNA Database of Japan (DDBJ). 
U.S. Patent AppUcation N. 09/687,576, with the same assignee as the present application, 
further discloses related sequences, compositions, and methods of diagnosis and treatment of 
breast cancer is hereby expressly incorporated by reference. 

25 In another preferred embodiment, breast cancer sequences are those that are 

down-regulated in the breast cancer; that is, the expression of these genes is lower in breast 
cancer tissue as compared to non-cancerous tissue (see, e.g., Tables 1,2, 3, 15, 16 etc.). 
"Down-regulation" as used herein often means at least about a two-fold change, preferably at 
least about a three fold change, with at least about five-fold or higher being preferred. 

30 
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Informatics 

The ability to identify genes that are over or under expressed in breast cancer 
can additionally provide high-resolution, high-sensitivity datasets which can be used in the 

5 areas of diagnostics, therapeutics, drug development, pharmacogenetics, protein structure, 
biosensor development, and other related areas. For example, the expression profiles can be 
used in diagnostic or prognostic evaluation of patients with breast cancer. Or as another 
example, subcellular toxico logical information can be generated to better direct drug structure 
and activity correlation {see Anderson, Pharmaceutical Proteomics: Targets, Mechanism, 

10 and Function^ paper presented at the IBC Proteomics conference, Coronado, CA (June 11-12, 

1998)). Subcellular toxicological iiiformation can also be utilized in a biological sensor 

*. 

device to predict the Ukely toxicological effect of chemical exposures and likely tolerable 
exposure thresholds {see U.S. Patent No. 5,81 1,231). Similar advantages accrue from 
datasets relevant to other biomolecules and bioactive agents (e.g., nucleic acids, saccharides, 

15 lipids, drugs, and the like). 

Thus, in another embodiment, the present invention provides a database that 
includes at least one set of assay data. The data contained in the database is acquired, e.g., 
using array analysis either singly or in a hbrary format. The database can be in substantially 
any form in which data can be maintained and transmitted, but is preferably an electronic 

20 database. The electronic database of the invention can be maintained on any electronic 

device allowing for the storage of and access to the database, such as a personal computer, 
but is preferably distributed on a wide area network, such as the World Wide Web. 

The focus of the present section on databases that include peptide sequence 
data is for clarity of illustration only. It will be apparent to those of skill in the art that similar 

25 databases can be assembled for any assay data acquired using an assay of the invention. 

The compositions and methods for identifying and/or quantitating the t , Vative 
and/or absolute abundance of a variety of molecular and macromolecular species from a 
biological sample undergoing breast cancer, i.e., the identification of breast cancer-associated 
sequences described herein, provide an abundance of information, which can be correlated 

30 with pathological conditions, predisposition to disease, dmg testing, therapeutic monitoring, 
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gene-disease causal linkages, identification of correlates of immunity and physiological 
status, among others. Although the data generated firom the assays of the invention is suited 
for manual review and analysis, in a preferred embodiment, prior data processing using high- 
speed computers is utilized. 
5 An array of methods for indexing and retrieving biomolecular information is 

known in the art. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational 
database system for storing biomolecular sequence information in a manner that allows 
sequences to be catalogued and searched according to one or more protein function 
hierarchies. U.S. Patent 5,953,727 discloses a relational database having sequence records 

10 containing information in a format that allows a collection of partial-length DNA sequences 
to be catalogued and searched according to association with one or more sequencing projects 
for obtaining full-length sequences firom the collection of partial length sequences. U.S. 
Patent 5,706,498 discloses a gene database retrieval system for making a retrieval of a gene 
sequence similar to a sequence data item in a gene database based on the degree of similarity 

15 between a key sequence and a target sequence. U.S. Patent 5,538,897 discloses a method 

using mass spectroscopy firagmentation pattems of peptides to identify amino acid sequences 
in computer databases by comparison of predicted mass spectra with experimentally-derived 
mass spectra using a cldseness-of-fit measure. U.S. Patent 5,926,818 discloses a multi- 
dimensional database comprising a functionality for multi-dimensional data analysis 

20 described as on-line analytical processing (OLAP), which entails the consolidation of 

projected and actual data according to more than one consolidation path or dimension. U.S. 
Patent 5,295,261 reports a hybrid database stmcture in which the fields of each database 
record are divided into two classes, navigational and informational data, with navigational 
fields stored in a hierarchical topological map which can be viewed as a tree structure or as 

25 the merger of two or more such tree stmctures. 

See also Mount et aL, Bioinfortnatics (2001); Biological Sequence Analysis: 
Probabilistic Models of Proteins and Nucleic Acids (Durbin et aL, eds., 1999); 
Bioinfomtatics: A Practical Guide to the Analysis of Genes and Proteins (Baxevanis & . 
OeuUette eds., 1998)); Rashidi & Buehler, Bioinfomtatics: Basic Applications in Biological 

30 Science and Medicine (1999); Introduction to Computational Molecular Biology (Setubal et 
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a/., eds 1997); Bioinfonnatics: Methods and Protocols (MiscneT & Krawetz, eds, 2000); 
Bioinformatics: Sequence, Structure, and Databanks: A Practical Approach (Higgins & 
Taylor, eds., 2000); Brown, Bioinformatics: A Biologist's Guide to Biocomputing and the 
Internet (2001); Han & Kamber, Data MiniJig: Concepts and Techniques (2000); and 
Waterman, Introduction to Computational Biology: Maps, Sequences, and Genomes (1995). 

The present invention provides a computer database comprising a computer 
and software for storing in computer-retrievable form assay data records cross-tabulated, e.g., 
with data specifying the source of the target-containing sample from which each sequence 
specificity record was obtained. 

In an exemplary embodiment, at least one of the sources of target-containing 
sample is from a control tissue sample known to be free of pathological disorders. In a 
variation, at least one of the sources is a known pathological tissue specimen, e.g., a 
neoplastic lesion or another tissue specimen to be analyzed for breast cancer. In another 
variation, the assay records cross-tabulate one or more of the following parameters for each 
target species in a sample: (1) a unique identification code, which can include, e.g., a target 
molecular structure and/or characteristic separation coordinate (e.g., electrophoretic 
coordinates); (2) sample source; and (3) absolute and/or relative quantity of the target species 
present in the sample. 

The invention also provides for the storage and retrieval of a collection of 
target data in a computer data storage apparatus, which can include magnetic disks, optical 
disks, magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, 
magnetic bubble memory devices, and other data storage devices, including CPU registers 
and on-CPU data storage arrays. Typically, the target data records are stored as a bit pattern 
in an array of magnetic domains on a magnetizable medium or as an array of charge states or 
transistor gate states, such as an array of cells in a DRAM device (e.g., each cell comprised of 
a transistor and a charge storage area, which may be on the transistor). In one embodiment, 
the invention provides such storage devices, and computer systems built therewith, 
comprising a bit pattern encoding a protein expression fingerprint record comprising unique 
identifiers for at least 10 target data records cross-tabulated with target source. 
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When the target is a peptide or nucleic acid, the invention preferably provides 
a method for identifying related peptide or nucleic acid sequences, comprising performing a 
computerized comparison between a peptide or nucleic acid sequence assay record stored in 
or retrieved from a computer storage device or database and at least one other sequence. The 
5 comparison can include a sequence analysis or comparison algorithm or computer program 
embodiment thereof (e,g., FASTA, TFASTA, GAP, BESTFIT) and/or the comparison may 
be of the relative amount of a peptide or nucleic acid sequence in a pool of sequences 
determined from a polypeptide or nucleic acid sample of a specimen. 

The invention also preferably provides a magnetic disk, such as an EBM- 

10 compatible (DOS, Windows, Windows95/98/2000, Windows NT, OS/2) or other format 

(e.g., Linux, SimOS, Solaris, AIX, SCO Unix, VMS, MV, Macintosh, etc.) floppy diskette or 
hard (fixed, Winchester) disk drive, comprising a bit pattern encoding data from an assay of 
the invention in a file format suitable for retrieval and processing in a computerized sequence 
analysis, comparison, or relative quantitation method. 

15 The invention also provides a network, comprising a plurality of computing 

devices linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, 
ISDN line, wireless network, optical fiber, or other suitable signal transmission medium, 
whereby at least one network device (e.g., computer, disk array, etc.) coniprises a pattern of 
magnetic domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM 

20 cells) composing a bit pattern encoding data acquired from an assay of the invention. 

The invention also provides a method for transmitting assay data that includes 
generating an electronic signal on an electronic communications device, such as a modem, 
ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a 

25 database comprising a plurality of assay results obtaiined by the method of the invention. 

In a preferred embodiment, the invention provides a computer system for 
comparing a query target to a database containing an array of data stmctures, such as an assay 
result obtained by the method of the invention, and ranking database targets based on the 
degree of identity and gap weight to the target data. A central processor is preferably 

30 initialized to load and execute the computer program for aUgnment and/or comparison of the 
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assay results. Data for a query target is entered into the central processor via an I/O device. 
Execution of the computer program results in the central processor retrieving the assay data 
from the data file, which comprises a binary description of an assay result. 

The target data or record and the computer program can be transferred to 
5 secondary memory, which is typically random access memory (e.g., DRAM, SRAM, 
SGRAM, or SDRAM). Targets are ranked according to the degree of correspondence 
between a selected assay characteristic (e.g., binding to a selected affinity moiety) and the 
same characteristic of the query target and results are output via an I/O device. For example, 
a central processor can be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, 
10 PA-8000, SPARC, MIPS 4400, MIPS 10000, VAX, etc.); a program can be a commercial or 
public domain molecular biology software package (e.g., UWGCG Sequence Analysis 
Software, Darwin); a data file can be an optical or magnetic disk, a data server, a memory 
device (e.g., DRAM, SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, 
etc.); an I/O device can be a terminal comprising a video display and a keyboard, a modem, 
15 an ISDN terminal adapter, an Ethernet port, a punched card reader, a magnetic strip reader, or 
other suitable I/O device. 

The invention also preferably provides the use of a computer system, such as 
that described above, which comprises: (1) a computer; (2) a stored bit pattern encoding a 
collection of peptide sequence specificity records obtained by the methods of the invention, 
20 which may be stored in the computer; (3) a comparison target, such as a query target; and (4) 
a program for alignment and comparison, typically with rank-ordering of comparison results 
on the basis of computed similarity values. 

Characteristics of breast cancer-associated proteins 

25 Breast cancer proteins of the present invention may be classified as secreted 

proteins, transmembrane proteins or intracellular proteins. In one embodiment, the breast 
cancer protein is an intracellular protein. Intracellular proteins may be found in the 
cytoplasm and/or in the nucleus. Intracellular proteins are involved in all aspects of cellular 
fimction and replication (including, e.g., signaling pathways); aberrant expression of such 

30 proteins often results in unregulated or disregulated cellular processes (see, e.g.. Molecular 
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Biology of the Cell (Alberts, ed., 3rd ed., 1994). For example, many intracellular proteins 
have enzymatic activity such as protein kinase activity, protein phosphatase activity, protease 
activity, nucleotide cyclase activity, polymerase activity and the like. Intracellular proteins 
also serve as docking proteins that are involved in organizing complexes of proteins, or 
5 targeting proteins to various subcellular localizations, and are involved in maintaining the 
stmctural integrity of organelles. 

An increasingly appreciated concept in characterizing proteins is the presence 
in the proteins of one or more motifs for which defined functions have been attributed. In 
addition to the highly conserved sequences found in the enzymatic domain of proteins, highly 

10 conserved sequences have been identified in proteins that are involved in protein-protein 
interaction. For example, Src-homology-2 (SH2) domains bind tyrosine-phosphorylated 
targets in a sequence dependent manner. PTB domains, which are distinct from SH2 
domains, also bind tyrosine phosphorylated targets. SH3 domains bind to proUne-rich 
targets. In addition, PH domains, tetratricopeptide repeats and WD domains to name only a 

15 few, have.been shown to mediate protein-protein interactions. Some of these may also be 
involved in binding to phospholipids or other second messengers. As will be appreciated by 
one of ordinary skill in the art, these motifs can be identified on the basis of primary 
sequence; thus, an analysis of the sequence of proteins may provide insight into both the 
enzymatic potential of the molecule and/or molecules with which the protein may associate. 

20 One useful database is Pfam (protein famiUes), which is a large collection of multiple 

sequence aligimients and hidden Markov models covering many common protein domains. 
Versions are available via the intemet from Washington University in St. Louis, the Sanger 
Center in England, and the Karolinska Institute in Sweden (see, e.g,, Bateman et al, Nuc, 
Acids Res. 28:263-266 (2000); Sonnhammer et al.. Proteins 28:405-420 (1997); Bateman et 

25 aL, Nuc. Acids Res. 27:260-262 (1999); and Sonnhammer et al, Nuc. Acids Res. 26:320-322- 
(1998)). 

In another embodiment, the breast cancer sequences are transmembrane 
proteins. Transmembrane proteins are molecules that span a phospholipid bilayer of a cell. 
They may have an intracellular domain, an extracellular domain, or both. The intracellular 
30 domains of such proteins may have a number of functions including those already described 
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for intracellular proteins; For example, the intracellular domain may have enzymatic activity 
and/or may serve as a binding site for additional proteins. Frequently the intracellular 
domain of transmembrane proteins serves both roles. For example certain receptor tyrosine 
kinases have both protein kinase activity and SH2 domains. In addition, autophosphorylation 
5 of tyrosines on the receptor molecule itself, creates binding sites for additional SH2 domain 
containing proteins. 

Transmembrane proteins may contain from one to many transmembrane 
domains. For example, receptor tyrosine kinases, certain cytokine receptors, receptor 
guanylyl cyclases and receptor serine/threonine protein kinases contain a single 
10 transmembrane domain. However, varioxis other proteins including channels and adenylyl 
cyclases contain numerous transmembrane domains. Many important cell surface receptors 
such as G protein coupled receptors (GPCRs) are classified as "seven transmembrane 
domain" proteins, as they contain 7 membrane spanning regions. Characteristics of 
transmembrane domains include approximately 20 consecutive hydrophobic amino acids that 
15 may be followed by charged amino acids. Therefore, upon analysis of the amino acid 
sequence of a particular protein, the localization and nxmiber of transmembrane domains 
within the protein may be predicted (see, e.g. PSORT web site http://psort.nibb.ac.jp/). 
Important transmembraae protein receptors include, but are not limited to the insulin 
receptor, insulin-like growth factor receptor, human growth hormone receptor, glucose 
20 transporters, transferrin receptor, epidermal growth factor receptor, low density Upoprotein 
receptor, epidermal growth factor receptor, leptin receptor, ihterleukin receptors, e.g. DL-l 
receptor, IL-2 receptor. 

The extracellular domains of transmembrane proteins are diverse; however, 
conserved motifs are found repeatedly among various extracellular domains. Conserved 
25 structure and/or functions have been ascribed to different extracellular motifs. Many 

extracellular domains are involved in binding to other molecules. In one aspect, extracellular 
domains are found on receptors. Factors that bind the receptor domain include circulating 
ligands, which may be peptides, proteins, or small molecules such as adenosine and the like. 
For example, growth factors such as EOF, FGF and PDGF are circulating growth factors that 
30 bind to their cognate receptors to initiate a variety of cellular responses. Other factors include 
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cytokines, mitogenic factors, neurotrophic factors and the hke. Extracellular domains also 
bind to cell-associated molecules. In this respect, they mediate cell-cell interactions. Cell- 
associated ligands can be tethered to the cell, e.g., via a glycosylphosphatidylinositol (GPI) 
anchor, or may themselves be transmembrane proteins. Extracellular domains also associate 
5 with the extracellular matrix and contribute to the maintenance of the cell structure. 

Breast cancer proteins that are transmembrane are particularly preferred in the 
present invention as they are readily accessible targets for immimotherapeutics, as are 
described herein. In addition, as outlined below, transmembrane proteins can be also useful 
in imaging modalities. Antibodies may be used to label such readily accessible proteins in 

10 situ. Alternatively, antibodies can also label intracellular proteins, in which case samples are 
typically pemieablized to provide access to intracellular proteins. 

It will also be appreciated by those in the art that a transmembrane protein can 
be made soluble by removing transmembrane sequences, e.g., through recombinant methods. 
Furthermore, transmembrane proteins that have been made soluble can be made to be 

1 5 secreted through recombinant means by adding an appropriate signal sequence. 

In another embodiment, the breast cancer proteins are secreted proteins; the 
secretion of which can be either constitutive or regulated. These proteins have a signal 
peptide or signal sequence that targets the molecule to the secretory pathway. Secreted 
proteins are involved in numerous physiological events; by virtue of their circulating nature, 

20 they serve to transmit signals to various other cell types. The secreted protein may function in 
an autocrine manner (acting on the cell that secreted the factor), a paracrine manner (acting 
on cells in close proximity to the cell that secreted the factor) or an endocrine manner (acting 
on cells at a distance). Thus secreted molecules find use in modulating or altering numerous 
aspects of physiology- Breast cancer proteins that are secreted proteins are particularly i 

25 preferred in the present invention as they serve as good targets for diagnostic markers, e,g., 
for blood, plasma, serum, or stool tests. 

Use of breast cancer nucleic acids 

As described above, breast cancer sequence is initially identified by , 
30 substantial nucleic acid and/or amino acid sequence homology or linkage to the breast cancer 
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sequences outlined herein. Such homology can be based upon the overall nucleic acid or 
amino acid sequence, and is generally determined as outlined below, using either homology 
programs or hybridization conditions. Typically, linked sequences on a mRNA are found on 
the same molecule. - ' 

5 The breast cancer nucleic acid sequences of the invention, e.g., the sequences 

in Tables 1-25, can be fragments of larger genes, i.e., they are nucleic acid segments. 
"Genes" in this context includes coding regions, non-coding regions, and mixtures of coding 
and non-coding regions. Accorduigly, as will be appreciated by those in the art, using the 
sequences provided herein, extended sequences, in either direction, of the breast cancer genes 

10 can be obtained, using techniques well known in the art for cloning either longer sequences or 
the fiill length sequences; see Ausubel, et al, supra. Much can be done by infonnatics and 
many sequences can be clustered to include multiple sequences corresponding to a single 
gene, e.g., systems such as UniGene (see, http://www.ncbi.nhn.nih.gov/UniGene/)- 

Once the breast cancer nucleic acid is identified, it can be cloned and, if 

15 necessary, its constituent parts recombined to form the entire breast cancer nucleic acid 
codmg regions or the entire mRNA sequence. Once isolated from its natural source, e.g., 
contained within a plasmid or other vector or excised therefrom as a linear nucleic acid 
segment, the recombinant breast cancer nucleic acid can be fiirther-used as a probe to identify 
and isolate other breast cancer nucleic acids, e.g., extended coding regions. It can also be 

20 used as a "precursor" nucleic acid to make modified or variant breast cancer nucleic acids 
and proteins. 

The breast cancer nucleic acids of the present invention are used in several 
ways. In a first embodiment, nucleic acid probes to the breast cancer nucleic acids are made 
and attached to biochips to be used in screening and diagnostic methods, as outlined below, 
25 or for administration, e.g., for gene therapy, vaccine, and/or antisense appUcations. 

Alternatively, the breast cancer nucleic acids that include coding r : ons of breast cancer 
proteins can be put into expression vectors for the expression of bx\?ast cancer proteins, again 
for screening purposes or for administration to a patient. 

In a preferred embodiment, nucleic acid probes to breast cancer nucleic acids 
30 (both the nucleic acid sequences outlined in the figures and/or the complements thereof) are 
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made. The nucleic acid probes attached to the biochip are designed to be substantially 
complementary to the breast cancer nucleic acids, i.e. the target sequence (either the target 
sequence of the sample or to other probe sequences, e.g., in sandwich assays), such that 
hybridization of the target sequence and the probes of the present invention occurs. As 
5 outlmed below, this complementarity need not be perfect; there may be any number of base 
pair mismatches which will interfere with hybridization between the target sequence and the 
single stranded nucleic acids of the present invention. However, if the nxmiber of mutations 
is so great that no hybridization can occur under even the least stringent of hybridization 
conditions, the sequence is not a complementary target sequence. Thus, by "substantially 

10 complementary" herein is meant that the probes are sufficiently complementary to the target 
sequences to hybridize under normal reaction conditions, particularly high stringency 
conditions, as outlined herein. 

A nucleic acid probe is generally single stranded but can be partially single 
and partially double stranded. The strandedness of the probe is dictated by the structure, 

15 composition, and properties of the target sequence. In general, the nucleic acid probes range 
from about 8 to about 100 bases long, with from about 10 to about 80 bases being preferred, 
and from about 30 to about 50 bases being particularly preferred. That is, generally whole 
genes are not used. In some embodiments, much longer nucleic acids can be used, up to 
hundreds of bases. 

20 In a preferred embodiment, more than one probe per sequence is used, witii 

either overlapping probes or probes to different sections of the target being used. That is, 
two, three, four or more probes, with three being preferred, are used to build in a redundancy 
for a particular target. The probes can be overlapping (i.e., have some sequence in common), 
or separate. In some cases, PGR primers may be used to amplify signal for higher sensitivity. 
25 As will be appreciated by those in the art, nucleic acids can be attached or 

immobilized to a solid support in a wide variety of ways. By "immobiUzed" and grammatical 
equivalents herein is meant the association or binding between the nucleic acid probe and the 
solid support is sufficient to be stable xmder the conditions of binding, washing, analysis, and 
removal as outlined below. The binding can typically be covalent or non-covalent. By "non- 
30 covalent binding" and grammatical equivalents herein is meant one or more of electrostatic, 
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hydrophilic, and hydrophobic mteractions. Included in non-covalent binding is the covalent 
attachment of a molecule, such as, streptavidin to the support and the non-covalent binding of 
the biotinylated probe to the streptavidin. By "covalent binding" and grammatical 
equivalents herein is meant that the two moieties, the soUd support and the probe, are 
5 attached by at least one bond, including sigma bonds, pi bonds and coordination bonds. 
Covalent bonds can be formed directly between the probe and the solid support or can be 
formed by a cross linker or by inclusion of a specific reactive group on either the solid 
support or the probe or both molecules. LnmobiUzation may also involve a combination of 
covalent and non-covalent interactions. 
10 In general, the probes are attached to the biochip in a wide variety of ways, as 

will be appreciated by those in the art. As described herein, the nucleic acids can either be 
synthesized first, with subsequent attachment to the biochip, or can be directly synthesized on 
the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid 
15 support" or other grammatical equivalents herein is meant a material that can be modified to 
contain discrete individual sites appropriate for the attachment or association of the nucleic 
acid probes and is amenable to at least one detection method. As will be appreciated by those 
in the art, the number of possible substrates are very large, and include, but are not limited to, 
glass and modified or functionalized glass, plastics (including acrylics, polystj^ene and 
20 copolymers of styrene and other materials, polypropylene, polyethylene, polybutylene, 

polyurethanes, TeflonJ, etc.), polysaccharides, nylon or nitrocellulose, resins, silica or silica- 
based materials including silicon and modified silicon, carbon, metals, inorganic glasses, 
plastics, etc. In general, the substrates allow optical detection and do not appreciably 
fluoresce. A preferred substrate is described in copending application entitled Reusable Low 
25 Fluorescent Plastic Biochip, U.S. AppUcation Serial No. 09/270,214, filed March 15, 1999, 
herein incorporated by reference in its entirety. 

Generally the substrate is planar, although as will be appreciated by those in 
the art, other configurations of substrates may be used as well. For example, the probes may 
be placed on the inside surface of a tube, for flow-through sample analysis to minimize 
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sample volume. Similarly, the substrate may be flexible, such as a flexible foam, including 
closed cell foams made of particular plastics. 

In a preferred embodiment, the surface of the biochip and the probe may be 
derivatized with chemical functional groups for subsequent attachment of the two. Thus, e.g., 
5 the biochip is derivatized with a chemical functional group including, but not limited to,' 
amino groups, carboxy groups, oxo groups and thiol groups, with amino groups being 
particularly preferred. Using these functional groups, the probes can be attached using 
functional groups on the probes. For example, nucleic acids containing amino groups can be 
attached to surfaces comprising amino groups, e.g. using linkers as are known in the art; e.g., 
10 homo-or hetero-bifixnctional linkers as are well known {see 1994 Pierce Chemical Company 
catalog, technical section on cross-linkers, pages 155-200). In addition, in some cases, 
additional linkers, such as alkyl groups (including substituted and heteroalkyl groups) may be 
used. 

In this embodiment, oligonucleotides are synthesized as is known in the art, 
15 and then attached to the surface of the solid support. As will be appreciated by those skilled 
in the art, either the 5' or 3' terminus may be attached to the solid support, or attachment may 
be via an internal nucleoside. 

In another embodiment, the immobilization to the solid support may be very 
strong, yet non-covalent. For example, biotinylated oligonucleotides can be made, which 
. 20 bind to surfaces covalently coated with streptavidin, resulting in attachment. 

Altematively, the oligonucleotides may be synthesized on the surface, as is 
known in the art. For example, photoactivation techniques utilizing photopolymerization 
compounds and techniques are used. In a preferred embodiment, the nucleic acids can be 
synthesized in situ, using well known photolithographic techniques, such as those described 
25 in WO 95/25 11 6; WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references 
cited within, all of which are expressly incorporated by reference; these methods of 
attachment form the basis of the Affimetrix GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression 
level of breast cancer-associated sequences. These assays are typically performed in 
30 conjxmction with reverse transcription. In such assays, a breast cancer-associated nucleic acid 
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sequence acts as a template in an amplification reaction (e.g., Polymerase Chain Reaction, or 
PCR). In a quantitative amplification, the amount of amplification product will be 
proportional to the amount of template in the original sample. Comparison to appropriate 
controls provides a measure of the amount of breast cancer-associated RNA. Methods of 

5 quantitative amplification are well known to those of skill in the art. Detailed protocols for 
quantitative PCR are provided, e.g., in Innis et al, PCR Protocols, A Guide to Methods and 
Applications {1990). 

In some embodiments, a TaqMan based assay is used to measure expression. 
TaqMan based assays use a fluorogenic oligonucleotide probe that contains a 5' fluorescent 

10 dye and a 3' quenching agent. The probe hybridizes to a PCR product, but cannot itself be 
extended due to a blocking agent at the 3' end. When the PCR product is amplified in 
subsequent cycles, the 5' nuclease activity of the polymerase, e,g,, AmpUTaq, results in the 
cleavage of the TaqMan probe. This cleavage separates the 5' fluorescent dye and the 3' 
quenching agent, thereby resulting in an increase in fluorescence as a fimction of 

15 amplification {see, e.g., literature provided by Perkin-Elmer, e.g., www2.perkin-elmer,com). 

Other suitable amplification methods include, but are not limited to, ligase 
chain reaction (LCR) {see Wu & Wallace, Genomics 4:560 (1989), Landegren et al. Science 
241:1077 (1988), and Barringer et al. Gene 89:117 (1990)), transcription amplification 
(Kwoh etal, Proc, Natl Acad. Scu USA 86:1173 (1989)), self-sustained sequence replication 

20 (Guatelli al, Proc. Nat Acad. Sci. USA 87:1874 (1990)), dot PCR, and linker adapter PCR, 
etc. 

Expression of breast cancer proteins from nucleic acids 

In a preferred embodiment, breast cancer nucleic acids, e-g., encoding breast 
25 cancer proteins are used to make a variety of expression vectors to express breast cancer 

proteins which can then be used in screening assays, as described below. Expression vectors 
and recombinant DNA technology are well known to those of skill in. the art {see, e.g., 
Ausubel, supra, and Gene Expression Systems (Fernandez & Hoeffler, eds, 1999)) and are 
used to express proteins. The expression vectors may be either self-replicating 
30 extrachromosomal vectors or vectors which integrate into a host genome. Generally, these 
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expression vectors include transcriptional and translational regulatory nucleic acid operably 
linked to the nucleic acid encoding the breast cancer protein. The term "control sequences" 
refers to DNA sequences used for the expression of an operably linked coding sequence in a 
particular host organism. Control sequences that are suitable for prokaryotes, e.g., include a 
5 promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic cells are 
known to utilize promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional 
relationship with another nucleic acid sequence. For example, DNA for a presequence or 
secretory leader is operably linked to DNA for a polypeptide if it is expressed as a preprotein 

10 that participates in the secretion of the polypeptide; a promoter or enhancer is operably linked 
to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site 
is operably linked to a coding sequence if it is positioned so as to facilitate translation. 
Graerally, "operably linked" means that the DNA sequences being linked are contiguous, 
and, in the case of a secretory leader, contiguous and in reading phase. However, enhancers 

15 do not have to be contiguous. Linking is typically accomplished by ligation at convenient 
restriction.sites. If such sites do not exist, synthetic oligonucleotide adaptors or linkers are 
used in accordance with conventional practice. Transcriptional and translational regulatory 
nucleic acid will generally be appropriate to the host cell used to express the breast cancer 
protein. Niunerous types of appropriate expression vectors, and suitable regulatory 

20 sequences are known in the art for a variety of host cells. 

In general, transcriptional and translational regulatory sequences may include, 
but are not limited to, promoter sequences, ribosomal binding sites, transcriptional start and 
stop sequences, translational start and stop sequences, and enhancer or activator sequences. 
In a preferred embodiment, the regulatory sequences include a promoter and transcriptional 

25 start and stop sequences. 

Promoter sequences encode either constitutive or inducible promoters. The 
promoters may be either naturally occurring promoters or hybrid promoters. Hybrid 
promoters, which combine elements of more than one promoter, are also known in the art, 
and are useful in the present invention. 



EKSOOCIDl <WQ; 0SaS93T7ASJ_^' 



wo 02/059377 PCT/US02/02242 



In addition, an expression vector may comprise additional elements. For 
example, the expression vector may have two replication systems, thus allowing it to be 
maintained in two organisms, e.g. in mammalian or insect cells for expression and in a 
procaryotic host for cloning and amplification. Furthermore, for integrating expression 
5 vectors, the expression vector contains at least one sequence homologous to the host cell 
genome, and preferably two homologous sequences which flank the expression construct. 
The integrating vector may be directed to a specific locus in the host cell by selecting the 
appropriate homologous sequence for inclusion in the vector. Constmcts for integrating 
vectors are well known in the art (e.g., Fernandez & Hoeffler, supra), 
10 In addition, in a preferred embodiment, the expression vector contains a 

selectable marker gene to allow the selection of transformed host cells. Selection genes are 
well known in the art and will vary with the host cell xised. 

The breast cancer proteins of the present invention are produced by culturing a 
host cell transformed with an expression vector containing nucleic acid encoding a breast 
15 cancer protein, imder the appropriate conditions to induce or cause expression of the breast 

cancer protein. Conditions appropriate for breast cancer protein expression will vary with the 
choice of the expression vector and the host cell, and will be easily ascertained by one skilled, 
in the art through routine experimentation or optimization. For example, the use of 
constitutive promoters in the expression vector will require optimizing the growth and 
20 proliferation of the host cell, while the use of an inducible promoter requires the appropriate 
growth conditions for induction. In addition, in some embodiments, the timing of the harvest 
is important. For example, the baculoviral systems used in insect cell expression are lytic 
viruses, and thus harvest time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fimgi, and insect 
25 and animal cells, including mammalian cells. Of particular interest are Saccharomyces 
cerevisiae and other yeasts, E. coli. Bacillus subtilis, Sf9 cells, C129 cells, 293 cells, 
Neurospora, BHK, CHO, COS, HeLa cells, HUVEC (human umbilical vein endotheUal 
cells), THPl cells (a macrophage cell line) and various other human cells and cell lines. 

In a preferred embodiment, the breast cancer proteins are expressed in 
30 mammaUan cells. Mammalian expression systems are also known in the art, and include 

.45 



wo 02/059377 PCT/US02/02242 



retroviral and adenoviral systems. One expression vector system is a retroviral vector system 
such as is generally described in PCT/US97/01019 and PCT/US97/01048, both of which are 
hereby expressly incorporated by reference. Of particular use as mammalian promoters are 
the promoters from mammalian viral genes, since the viral genes are often highly expressed 

5 and have a broad host range. Examples include the SV40 early promoter, mouse mammary 
tumor viras LTR promoter, adenovirus major late promoter, herpes simplex virus promoter, 
and the CMV promoter {see, e,g., Femandez & Hoeffler, supra). Typically, transcription 
termination and polyadenylation sequences recognized by mammalian cells are regulatory 
regions located 3' to the translation stop codon and thus, together with the promoter elements, 

10 flank the coding sequence. Examples of transcription terminator and polyadenlyation signals 
include those derived form SV4G. 

The methods of introducing exogenous nucleic acid into mammalian hosts, as 
well as other hosts, is well known in the art, and will vary with the host cell used. 
Techniques include dextran-mediated transfection, calcium phosphate precipitation, 

15 polybrene mediated transfection, protoplast fusion, electroporation, viral infection, 

encapsulation of the polynucleotide(s) in liposomes, and direct microinjection of the DNA 
into niiclei. 

In a preferred embodiment, breast cancer proteins are expressed in bacterial 
systems. Bacterial expression systems are well known in the art. Promoters from 

20 bacteriophage may also be used and are known in the art. In addition, synthetic promoters 
and hybrid promoters are also useful; e.g., the tac promoter is a hybrid of the trp and lac 
promoter sequences. Furthermore, a bacterial promoter can include naturally occurring 
promoters of non-bacterial origin that have the ability to bind bacterial RNA polymerase and 
initiate transcription. In addition to a fimctioning promoter sequence, an efficient ribosome 

25 binding site is desirable. The expression vector may also include a signal peptide sequence 
that provides for secretion of the breast cancer protein in bacteria. The protein is either 
secreted into the growth media (gram-positive bacteria) or into the periplasmic space, located 
between the inner and outer membrane of the cell (gram-negative bacteria). The bacterial 
expression vector may also include a selectable marker gene to allow for the selection of 

30 bacterial strains that have been transformed. Suitable selection genes include genes which 
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render the bacteria resistant to drugs such as ampicillin, chloramphenicol, CTythromycin, 
kanamycin, neonaycin and tetracycline. Selectable markers also include biosynthetic genes, 
such as those in the histidine, tryptophan and leucine biosynthetic pathways. These 
components are assembled into expression vectors. Expression vectors for bacteria are well 
5 known in the art, and include vectors for Bacillus siibtilis, E, coli. Streptococcus cremoris, 
and Streptococcus lividans, among others (e.g., Femandez & Hoeffler, supra). The bacterial 
expression vectors are transformed into bacterial host cells using techniques well known in 
the art, such as calcium chloride treatment, electroporation, and others. 

In one embodiment, breast cancer proteins are produced in insect cells. 

10 Expression vectors for the transformation of insect cells, and in particular, baculovirus-based 
expression vectors, are well known in the art. 

In a preferred embodiment, breast cancer protein is produced in yeast cells. 
Yeast expression systems are well known in the art, and include expression vectors for 
Saccharomyces cerevisiae, Candida albicans and C maltosa, Hansemda polymorpha, 

15 Khiyveroinyces fragilis and K. lactis, Pichia guillerimondii andP, pastoris, 
Schizosaccharomyces pombe^ and Yarrowia lipolytica. 

The breast cancer protein may also be made as a fusion protein, using 
techniques well known in the art. Thus, e.g., for the creation of monoclonal antibodies, if the 
desired epitope is small, the breast cancer protein may be fused to a carrier protein to form an 

20 iimnunogen. Alternatively, the breast cancer protein may be made as a fusion protein to 
increase expression, or for other reasons. For example, when the breast cancer protein is a 
breast cancer peptide, the nucleic acid encoding the peptide may be linked to other nucleic 
acid for expression purposes. 

In a preferred embodiment, the breast cancer protein is purified or isolated 

25 after expression. Breast cancer proteins may be isolated or purified in a variety of ways 
known to those skilled in the art depending on what other components are present in the 
sample. Standard purification methods include electrophoretic, molecular, immunological 
and chromatographic techniques, including ion exchange, hydrophobic, affinity, and reverse- 
phase UPLC chromatography, and chromatofocusing. For example, the breast cancer protein 

30 may be purified using a standard anti-breast cancer protein antibody column. Ultrafiltration 
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and diafiltration techniques, in conjunction with protein concentration, are also useful. For 
general guidance in suitable purification techniques, see Scopes, Protein Purification (1982). 
The degree of purification necessary will vary depending on the use of the breast cancer 
protein. In some instances no purification will be necessary. 
5 Once expressed and purified if necessary, the breast cancer proteins and 

nucleic acids are useful in a number of applications. They may be used as immunoselection 
reagents, as vaccine reagents, as screening agents, etc. 

Variants of breast cancer proteins 

10 In one embodiment, the breast cancer proteins are derivative or variant breast 

cancer proteins as compared to the wild-type sequence. That is, as outlined more fully below, 
the derivative breast cancer peptide will often contain at least one amino acid substitution, 
deletion or insertion, with amino acid substitutions being particularly preferred. The amino 
acid substitution, insertion or deletion may occur at any residue within the breast cancer 

15 peptide. . 

Also included within one embodiment of breast cancer proteins of the present 
invention are amino acid sequence variants. These variants typically fall into one or more of 
three classes: substitutional, insertional or deletional variants. These variants ordinarily are 
prepared by site specific mutagenesis of nucleotides in the DNA encoding the breast cancer 

20 protein, using cassette or PGR mutagenesis or other techniques well known in the art, to 

produce DNA encoding the variant, and thereafter expressing the DNA in recombinant cell 
culture as outlined above. However, variant breast cancer protein firagments having up to 
about 100-150 residues may be prepared by in vitro synthesis using established techniques. 
Amino acid sequence variants are characterized by the predetermined nature of the variation, 

25 a feature that sets them £^art from naturally occurring allelic or interspecies variation of the 
breast cancer protein amino acid sequence. The variants typically exhibit the same 
qualitative biological activity as the naturally occurring analogue, although variants can also 
be selected which have modified characteristics as Avill be more fully outlined below. 

While the site or region for introducing an . amino acid sequence variation is 

30 predetermined, the mutation per se need not be predetermined. For example, in order to 
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optimize the perfonnance of a mutation at a given site, random mutagenesis may be 
conducted at the target codon or region and the expressed breast cancer variants screened for 
the optimal combination of desired activity. Techniques for making substitution mutations at 
predetermined sites in DNA having a known sequence are well known, e.g.. Ml 3 primer 

5 mutagenesis and PGR mutagenesis. Screening of the mutants is done using assays of breast 
cancer protein activities. 

Amino acid substitutions are typically of single residues; insertions usually 
will be on the order of from about 1 to 20 amino acids, although considerably larger 
. insertions may be tolerated. Deletions range from about 1 to about 20 residues, although in 

10 some cases deletions may be much larger. 

Substitutions, deletions, insertions or any combination thereof may be used to 
arrive at a final derivative. Generally these changes are done on a few amino acids to 
minimize the alteration of the molecule. However, larger changes may be tolerated in certain 
circumstances. When small alterations in the characteristics of the breast cancer protein are 

15 desired, substitutions are generally made in accordance with the amino acid substitution 
relationships provided in the definition section. 

The variants typically exhibit the same qualitative biological activity and will 
elicit the same immune response as the naturally-occurring analog, although variants also are 
selected to modify the characteristics of the breast cancer proteins as needed. Alternatively, 

20 the variant may be designed such that the biological activity of the breast cancer protein is 
altered. For example, glycosylation sites may be altered or removed. 

Substantial changes in function or immunological identity are made by 
selecting substitutions that are less conservative than those described above. For example, 
substitutions may be made which more significantly affect: the structure of the polypeptide 

25 backbone in the area of the alteration, for example the alpha-helical or beta-sheet structure; 
the charge or hydrophobicity of the molecule at the target site; or the bulk of the side chain. 
The substitutions which in general are expected to produce the greatest changes in the 
polypeptide's properties are those in which (a) a hydrophiUc residue, e.g. seryl or threonyl is 
substituted for (or by) a hydrophobic residue, e.g. leucyl, isoleucyl, phenylalanyl, valyl or 

30 alanyl; (b) a cysteine or proline is substituted for (or by) any other residue; (c) a residue 
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having an electropositive side chain, e-g. lysyl, arginyl, or histidyl, is substituted for (or by) 
an electronegative residue, e.g. glutamyl or aspartyl; or (d) a residue having a bulky side 
chain, e.g. phenylalanine, is substituted for (or by) one not having a side chain, e.g. glycine. 

Covalent modifications of breast cancer polypeptides are included within the 

5 scope of this invention. One type of covalent modification includes reacting targeted amino 
acid residues of a breast cancer polypeptide with an organic derivatizing agent that is capable 
of reacting with selected side chains or the N-or C-terminal residues of a breast cancer 
polypeptide. Derivatization with bifunctional agents is useful, for instance, for crosslinking 
breast cancer polypeptides to a water-insoluble support matrix or surface for use in the 

10 method for purifying anti-breast cancer polypeptide antibodies or screening assays, as is more 
fully described below. Commonly used crosslinking agents include, e.g., 1,1- 
bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, e.g., esters 
with 4-azidosalicylic acid, homobifunctional imidoesters, including disuccinimidyl esters 
such as 3,3'-dithiobis(succinimidylpropionate), bifunctional maleimides such as bis-N- 

15 maleimido-l,8-octane and agents such as methyl-3-((p'azidophenyl)dithio)propioimidate. , 
Other modifications include deamidation of glutaminyl and asparaginyl 
residues to the corresponding glutamyl and aspartyl residues, respectively, hydroxylation of 
proline and lysine, phosphorylation of hydroxyl groups of seryl, threonyl or tyrosyl residues, 
methylation of the amino groups of the lysine, arginine, and histidine side chains (Creighton, 

20 Proteins: Sti-ucture and Molecular Properties, pp. 79-86 (1983)), acetylation of the N- 
terminal amine, and amidation of any C-tenninal carboxyl group. 

Another type of covalent modification of the breast cancer polypeptide 
included within the scope of this invention comprises altering the native glycosylation pattem 
of the polypeptide. "Altering the native glycosylation pattem" is intended for purposes 

25 herein to mean deleting one or more carbohydrate moieties found in native sequence breast 
cancer polypeptide, and/or adding one or more glycosylation sites that are not present in the 
native sequence breast cancer polypeptide. Glycosylation patterns can be altered in many 
ways. For example the use of different cell types to express breast cancer-associated 
sequences can result in different glycosylation patterns. 
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Addition of glycosylation sites to breast cancer polypeptides may also be 
accomplished by altering the amino acid sequence thereof. The alteration may be made, e.g., 
by the addition of, or substitution by, one or more serine or threonine residues to the native 
sequence breast cancer polypeptide (for O-linked glycosylation sites). The breast cancer 
5 amino acid sequence may optionally be altered through changes at the DNA level, 

particularly by mutating the DNA encoding the breast cancer poljqjeptide at preselected bases 
such that codons are generated that will translate into the desired amino acids. 

Another means of increasing the number of carbohydrate moieties on the 
breast cancer polypeptide is by chemical or enzjntnatic coupling of glycosides to the 

10 polypeptide, Such methods are described in the art, e.g., in WO 87/05330, and in Aplin & 
Wriston, CRC Crit. Rev, Biochent., pp. 259-306 (1981). 

Removal of carbohydrate moieties present on the breast cancer polypeptide 
may be accomplished chemically or enzymatically or by mutational substitution of codons 
encoding for amino acid residues that serve as targets for glycosylation. Chemical 

15 deglycosylation techniques are known in the art and described, for instance, by Hakimuddin, 
et aL, Arch. Biochenu Biophys., 259:52 (1987) and by Edge et al^Anal Biochem,, 1 18:131 
(1981). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the 
use of a variety of endo-and exo-glycosidases as described by Thotakura et al, Meth. 
EnzymoL, 138:350 (1987). 

20 Another type of covalent modification of breast cancer comprises linking the 

breast cancer polypeptide to one of a variety of nonproteinaceous polymers, e.g., 
polyethylene glycol, polypropylene glycol, or polyoxyaUcylenes, in the manner set forth in 
U:S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

Breast cancer polypeptides of the present invention may also be modified in a 

25 way to form chimeric molecules comprising a breast cancer polypeptide fused to another, 
heterologous polypeptide or amino acid sequence. In one embodiment, such a chimeric 
molecule comprises a fiision of a breast cancer polj/peptide with a tag polypeptide which 
provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is 
generally placed at the amino-or carboxyl-^terminus of the breast cancer polypeptide. The 

30 presence of such epitope-tagged fomis of a breast cancer polypeptide can be detected using 

51 



BNSOCXIO: <WO a2059377A2_r_> 



wo 02/059377 PCT/US02/02242 



an antibody against the tag polypeptide: Also, provision of the epitope tag enables the breast 
cancer polypeptide to be readily purified by affinity purification using an anti-tag antibody or 
another type of affinity matrix that binds to the epitope tag. In an alternative embodiment, 
the chimeric molecule may comprise a fusion of a breast cancer polypeptide with an 
5 immunoglobulin or a particular region of an immxmoglobulin. For a bivalent fonn of the 
chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

Various tag polypeptides and their respective antibodies are well known in the 
art. Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; 
HIS6 and metal chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field et 

. 10 al, Mol Cell BioL 8:2159-2165 (1988)); the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 
9E10 antibodies thereto (Evan etal, Molecular and Cellular Biology 5:3610-3616 (1985)); 
and the.Herpes Simplex virus glycoprotein D (gD) tag and its antibody {Paborsky et ai. 
Protein Engineering 3(6):547-553 (1990)). Other tag polypeptides include the Flag-peptide 
(Hopp et al, BioTechnology 6:1204-1210 (1988)); the KT3 epitope peptide (Martin et al, 

15 Science 255:192-194 (1992)); tubulin epitope peptide (Skinner et al, J. BioL Chem. 

266:15163-15166 (1991)); and the T7 gene 10 protein peptide tag (Lutz-Freyermuth et al, 
Proc NatL Acad. Set USA 87:6393-6397 (1990)). 

Also included are other breast cancer proteins of the breast cancer family, and 
breast cancer proteins firom other organisms, which are cloned and expressed as outlined 

20 below. Thus, probe or degenerate polymerase chain reaction (PCR) primer sequences may be 
used to find other related breast cancer proteins fi'om humans or other organisms. As will be 
appreciated by those in the art, particularly useful probe and/or PCR primer sequences 
include the imique areas of the breast cancer nucleic acid sequence. As is generally known in 
the art, preferred PCR primers are fi-om about 15 to about 35 nucleotides in length, with firom 

25 about 20 to about 30 being preferred, and may contain inosine as needed. The conditions for 
the PCR reaction are well known in the art (e.g., Innis, PCR Protocols, supra). 

Antibodies to breast cancer proteins 

In a preferred embodiment, when the breast cancer protein is to be used to 
30 generate antibodies, e.g., for immunotherapy or immunodiagnosis, the breast cancer protein 
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should share at least one epitope or determinant with the fiill length protein. By "epitope" or 
"determinant" herein is typically meant a portion of a protein which will generate and/or bind 
an antibody or T-cell receptor in the context of MHC. Thus, in most instances, antibodies 
made to a smaller breast cancer protein will be able to bind to the full-length protein, 

5 particularly linear epitopes. In a preferred embodiment, the epitope is unique; that is, 
antibodies generated to a unique epitope show little or no cross-reactivity. 

Methods of preparing polyclonal antibodies are known to the skilled artisan 
(e.g., Coligan, supra\ and Harlow & Lane, supra). Polyclonal antibodies can be raised in a 
mammal, e.g., by one or more injections of an immunizing agent and, if desired, an adjuvant. 

10 Typically, the immunizing agent and/or adjuvant will be injected in the mammal by multiple 
subcutaneous or intraperitoneal injections. The immunizing agent may include a protein 
encoded by a nucleic acid of the figures or fragment thereof or a fusion protein thereof. It 
may be useful to conjugate the immxmizing agent to a protein known to be immunogenic in 
the mammal being immimized. Examples of such immunogenic proteins include but are not 

15 limited to keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, and soybean 
trypsin inhibitor. Examples of adjuvants which may be employed include Freund's complete 
adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, sjmthetic trehalose 
dicorynomycolate). The immunization protocol may be selected by one skilled in the art 
without undue experimentation. 

20 The antibodies may, altematively, be monoclonal antibodies. Monoclonal 

antibodies may be prepared using hybridoma methods, such as. those described by Kohler 8c 
Milstein, Nature 256:495 (1975). In a hybridoma method, a mouse, hamster, or other 
appropriate host animal, is typically immunized with an immunizing agent to elicit 
lymphocytes that produce or are capable of producing antibodies that will specifically bind to 

25 the immimizing agent. Altematively, the lymphocytes may be immimized in vitro. The 

immunizing agent will typically include a polypeptide encoded by a nucleic acid of Tables 1- 
25 or fragment thereof, or a fusion protein thereof. Generally, either peripheral blood 
lymphocytes ("PBLs") are used if cells of human origin are desired, or spleen cells or lymph 
node cells are used if non-hxmaan mammalian sources are desired. The Ijrmphocj^es are then 

30 fused with an immortalized cell line using a suitable fusing agent, such as polyethylene 
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glycol, to fonn a hybridoma cell (Coding, Monoclonal Antibodies: Principles and Practice^ 
pp. 59-103 (1986)). Inunortalized cell lines are usually transformed mammalian cells, 
particularly myeloma cells of rodent, bovine and human origin. Usually, rat or mouse 
myeloma cell lines are employed. The hybridoma cells may be cultured in a suitable culture 
5 medium that preferably contains one or more substances that inhibit the growth or survival of 
the unfused, immortalized cells. For example, if the parental cells lack the enzyme 
hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium 
for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine ("HAT 
mediimi"), which substances prevent the growth of HGPRT-deficient cells. 

10 In one embodiment, the antibodies are bispecific antibodies. Bispecific 

antibodies are monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens or that have binding specificities for two 
epitopes on the same antigen. In one embodiment, one of the binding specificities is for a 
protein encoded by a nucleic acid Tables 1-25 or a fi-agment thereof, the other one is for any 

15 other antigen, and preferably for a cell-surface protein or rec^tor or receptor subunit, 

preferably one that is tumor specific. Alternatively, tetramer-type technology may create 
multivalent reagents. 

In a preferred embodiment, the antibodies to breast cancer protein are capable 
of reducing or eliminating a biological function of a breast cancer protein, as is described 

20 below. That is, the addition of anti-breast cancer protein antibodies (either polyclonal or 

preferably monoclonal) to breast cancer tissue (or cells containing breast cancer) may reduce 
or eliminate the breast cancer. Generally, at least a 25% decrease in activity, growth, size or 
the like is preferred, with at least about 50% being particularly preferred and about a 95- 
100% decrease being especially preferred. 

25 In a preferred embodiment the antibodies to the breast cancer proteins are 

humanized antibodies (e.g,, Xenerex Biosciences, Mederex, Inc., Abgenix, Inc., Protein 
Design Labs,Inc.) Humanized forms of non-human (e.g., murine) antibodies are chimeric 
molecules of immunoglobulins, immunoglobulin chains or fi:agments thereof (such as Fv, 
Fab, Fab', F(ab')2 or other antigen-binding subsequences of antibodies) which contain 

30 minimal sequence derived firom non-human immxmoglobulin. Humanized antibodies include 
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human immimoglobulins (recipient antibody) in which residues from a complementary 
determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 
human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, 
affinity and capacity. In some instances, Fv framework residues of the human 

5 immimoglobulin are replaced by corresponding non-himian residues. Humanized antibodies 
may also comprise residues which are found neither in the recipient antibody nor in the 
imported CDR or framework sequences. In general, a humanized antibody will comprise 
substantially all of at least one, and typically two, variable domains, in which all or 
substantially all of the CDR regions correspond to those of a non-human immunoglobulin 

10 and all or substantially all of the framework (FR) regions are those of a hiunan 

immunoglobulin consensus sequence. The humanized antibody optimally also will comprise 
at least a portion of an immunoglobulin constant region (Fc), typically that of a human 
immunoglobulin (Jones et aL, Nature 321 :522-525 (1986); Riechmann et al. Nature 
332:323-329 (1988); andPresta, Curr, Op. Struct Biol 2:593-596 (1992)). Humanization 

15 can be essentially performed following the method of Winter and co-workers (Jones et aL, 

Nature 321:522-525 (1986); Riechmann et al. Nature 332:323-327 (1988); Verhoeyen et al. 
Science 239:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the 
corresponding sequences of a human antibody. Accordingly, such humanized antibodies are 
chimeric antibodies (U.S. Patent No. 4,816,567), wherein substantially less than an intact 

20 human variable domain has been substituted by the corresponding sequence from a non- 
. human species. 

Hiiman antibodies can also be produced using various techniques known in the 
art, bicluding phage display libraries (Hoogenboom & Winter, J. Mol BioL 227:381 (1991); 
Marks et al, J, Mol Biol 222:581 (1991)), The techniques of Cole et al and Boemer et al 

25 are also available for the preparation of human monoclonal antibodies (Cole et al^ 

Monoclonal Antibodies and Cancer Therapy, p. 77 (1985) and Boemer et al^J. Immunol 
147(l):86-95 (1991)). Similarly, human antibodies can be made by introducing of human 
immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous 
iimnunoglobulin genes have been partially or completely inactivated. Upon challenge, 

30 human antibody production is observed, which closely resembles that seen in humans in all 
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respects, including gene rearrangement, assembly, and antibody repertoire. This approach is 
described, e.g., in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 
5,661,016, and in the following scientific publications: Marks et aL^ Bio/Technology 10:779- 
783 (1992); Lonberg et aL, Nature 368:856-859 (1994); Morrison, Nature 368:812-13 
5 (1994); Fishwild et al. Nature Biotechnology 14:845-51 (1996); Neuberger, Nature 

Biotechnology 14:826 (1996); Lonberg & Huszar, Intern, Rev. Immunol 13:65-93 (1995). 

By immunotherapy is meant treatment of breast cancer with an antibody raised 
against breast cancer proteins. As used herein, immunotherapy can be passive or active. 
Passive immunotherapy as defined herein is the passive transfer of antibody to a recipient 

10 (patient). Active immunization is the induction of antibody and/or T-cell responses in a 

recipient (patient). Induction of an immune response is the result of providing the recipient 
with an antigen to which antibodies are raised. As appreciated by one of ordinary skill in the 
art, the antigen may be provided by injecting a polypeptide against which antibodies are 
desired to be raised into a recipient, or contacting the recipient with a nucleic acid capable of 

15 expressing the antigen and under conditions for expression of the antigen, leading to an 
immune response. 

In a preferred embodiment the breast cancer proteins against which antibodies 
are raised are secreted proteins as described above. Without being bound by theory, 
antibodies used for treatment, bind and prevent the secreted protein firom binding to its 

20 receptor, thereby inactivating the secreted breast cancer protein. 

In another preferred embodiment, the breast cancer protein to which 
antibodies are raised is a transmeinbrane protein. Without being bound by theory, antibodies 
used for treatment, bind the extracellular domain of the breast cancer protein and prevent it 
fi-om binding to other proteins, such as circulating ligands or cell-associated molecules. The 

25 antibody may cause down-regulation of the transmembrane breast cancer protein. As will be 
appreciated by one of ordinary skill in the art, the antibody may be a competitive, non- 
competitive or uncompetitive inhibitor of protein binding to the extracellular domain of the 
breast cancer protein. The antibody is also an antagonist of the breast cancer protein. 
Further, the antibody prevents activation of the transmembrane breast cancer protein. In one 

30 aspect, when the antibody prevents Uie binding of other molecules to the breast cancer 
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protein, the antibody prevents growth of the cell. The antibody may also be used to target or 
sensitize the cell to cytotoxic agents, including, but not limited to TNF-a, TNF-p, IL-1, E^JF-y 
and IL-2, or chemotherapeutic agents including 5FU, vinblastine, actinomycin D, cisplatin, 
methotrexate, and the like. In some instances the antibody belongs to a sub-type that 
5 activates serum complement when complexed with the transmembrane protein thereby 
mediatmg cytotoxicity or antigen-dependent cytotoxicity (ADCC). Thus, breast cancer is 
treated by administering to a patient antibodies directed against the transmembrane breast 
cancer protein. Antibody-labeling may activate a co-toxin, localize a toxin payload, or 
otherwise provide means to locally ablate cells. 
10 In another preferred embodunent, the antibody is conjugated to an effector 

moiety. The effector moiety can be any number of molecules, including labelling moieties 
such as radioactive labels or fluorescent labels, or can be a therapeutic moiety. In one aspect 
the therapeutic moiety is a small molecule that modulates the activity of the breast cancer 
protein. In another aspect the therapeutic moiety modulates the activity of molecules 
15 associated with or in close proximity to the breast cancer protein. The therapeutic moiety 
may inhibit enzymatic activity such as protease or coUagenase or protein kinase activity 
associated with breast cancer. 

In a preferred embodiment, the therapeutic moiety can also be a cytotoxic 
agent. In this method, targeting the cytotoxic agent to breast cancer tissue or cells, results in 
20 a reduction in the number of afflicted cells, thereby reducing symptoms associated with 

breast cancer. Cytotoxic agents are numerous and varied and include, but are not limited to, 
cytotoxic drugs or toxins or active fragments of such toxins. Suitable toxins and their 
corresponding fragments include diphtheria A chain, exotoxin A chain, ricin A chain, abrin A 
chain, curcin, crotin, phenomycin, enomycin and the like. Cytotoxic agents also include 
25 radiochemicals made by conjugating radioisotopes to antibodies raised against breast cancer 
proteins, or binding of a radionuclide to a chelating agent that has been covalently attached to 
the antibody. Targeting the therapeutic moiety to transmembrane breast cancer proteins not 
only serves to increase the local concentration of therapeutic moiety in the breast cancer 
afflicted area, but also serves to reduce deleterious side effects that may be associated with 
30 the therapeutic moiety. 
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In another preferred embodiment, the breast cancer protein against which the 
antibodies are raised is an intracellular protein. In this case, the antibody may be conjugated 
to a protein which faciUtates entry into the cell. In one case, the antibody enters the cell by 
endocytosis. In another embodiment, a nucleic acid encoding the antibody is administered to 
5 the individual or cell. Moreover, wherein the breast cancer protein can be targeted within a 
cell, i.e., the nucleus, an antibody thereto contains a signal for that target localization, i.e., a 
nuclear localization signal. 

The breast cancer antibodies of the invention specifically bind to breast cancer 
proteins. By "specifically bind" herein is meant that the antibodies bind to the protein with a 
10 Kd of at least about 0.1 mM, more usually at least about 1 \xM, preferably at least about 0.1 
|xM or better, and most preferably, 0.01 p,M or better. Selectivity of binding is also 
important. 

Detection of breast cancer sequence for diagnostic and therapeutic applications 

15 In one aspect, the RNA expression levels of genes are determined for different 

cellular states in the breaist cancer phenotype. Expression levels of genes in normal tissue 
(i.e., not undergoing breast cancer) and in breast cancer tissue (and in some cases, for varying 
severities of breast cancer that relate to prognosis, as outUned below) are evaluated to provide 
expression profiles. An expression profile of a particular cell state or point of development is 

20 essentially a "fingerprint" of the state. While two states may have any particular gene 
similarly expressed, the evaluation of a number of genes simultaneously allows the 
generation of a gene expression profile that is reflective of the state of the cell. By comparing 
expression profiles of cells in different states, information regarding which genes are 
important (including both up- and down-regulation of genes) in each of these states is 

25 obtained. Then, diagnosis may be performed or confirmed to determine whether a tissue 
sample has the gene expression profile of normal or cancerous tissue. This will provide for 
molecular diagnosis of related conditions. 

"Differential expression," or grammatical equivalents as used herein, refers to 
qualitative or quantitative differences in the temporal and/or cellular gene expression 

30 patterns within and among cells and tissue. Thus, a differentially expressed gene can 
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qualitatively have its expression altered, including an activation or inactivation, in, e.g., 
normal versus breast cancer tissue. Genes may be turned on or turned off in a particular state, 
relative to another state thus permitting comparison of two or more states. A qualitatively 
regulated gene will exhibit an expression pattern within a state or cell type which is 

5 detectable by standard techniques. Some genes will be expressed in one state or cell type, but 
not in both. Altematively, the difference in expression may be quantitative, e.g., in that 
. expression is increased or decreased; i.e., gene expression is either upregulated, resulting in 
an increased amoimt of transcript, or downregulated, resulting in a decreased amoimt of 
transcript. The degree to which expression differs need only be large enough to quantify via 

10 standard characterization techniques as outlined below, such as by use of Affymetrix 
GeneChip™ expression arrays, Lockhart, Nature Biotechnology 14:1675-1680 (1996), 
hereby expressly incorporated by reference. Other techniques include, but are not limited to, 
quantitative reverse transcriptase PGR, northern analysis and RNase protection. As outlined 
above, preferably the change in expression (i.e., upregulation or downregulation) is at least 

15 about 50%, more preferably at least about 100%, more preferably at least about 150%, more 
preferably at least about 200%, with from 300 to at least 1000% being especially preferred. 

Evaluation may be at the gene transcript, or the protein level. The amount of 
gene expression may be monitored using nucleic acid probes to the DNA or RNA equivalent 
of the gene transcript, and the quantification of gene expression levels, or, alternatively, the 

20 final gene product itself (protein) can be monitored, e.g., with antibodies to the breast cancer 
protein and standard immunoassays (ELISAs, etc.) or other techniques, including mass 
spectroscopy assays, 2D gel electrophoresis assays, etc. Proteins corresponding to breast 
cancer genes, i.e., those identified as being important in a breast cancer phenotype, can be 
' evaluated in a breast cancer diagnostic test. 

25 In a preferred embodiment, gene expression monitoring is performed 

simultaneously on a number of genes. Multiple protein expression monitoring can be 
performed as well. Similarly, these assays may be performed on an individual basis as well. 

In this embodiment, the breast cancer nucleic acid probes are attached to 
biochips as outlined herein for the detection and quantification of breast cancer sequences in 
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a particular cell. The assays are further described below ia the example. PGR techniques can 
be used to provide greater sensitivity. 

In a preferred embodiment nucleic acids encoding the breast cancer protein are 
detected. Although DNA or RNA encoding the breast cancer protein may be detected, of 
5 particular interest are methods wherein an mRNA encoding a breast cancer protein is 
detected. Probes to detect mRNA can be a nucleotide/deoxynucleotide probe that is 
complementary to and hybridizes with the mRNA and includes, but is not limited to, 
oligonucleotides, cDNA or RNA. Probes also should contain a detectable label, as defined 
herein. In one method the mRNA is detected after immobilizing the nucleic acid to be 

10 examined on a solid support such as nylon membranes and hybridizing the probe with the 
sample. Following washing to remove the non-specifically bound probe, the label is 
detected. In another method detection of the mRNA is performed in situ. In this method 
permeabilized cells or tissue samples are contacted with a detectably labeled nucleic acid 
probe for sufficient time to allow the probe to hybridize with the target mRNA. Following 

15 washing to remove the non-specifically bound probe, the label is detected. For example a 

digoxygenin labeled riboprobe (RNA probe) that is complementary to the mRNA encoding a 
breast cancer protein is detected by binding the digoxygenin with an anti-digoxygenin 
secondary antibody and developed with nitro blue tetrazolium and 5-bromo-4-chloro-3- 
indoyl phosphate. 

20 In a preferred embodiment, various proteins fi-om the. three classes of proteins 

as described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
assays. The breast cancer proteins, antibodies, nucleic acids, modified proteins and cells 
containing breast cancer sequences are used in diagnostic assays. This can be performed on 
an individual gene or corresponding poljq^eptide level. In a preferred embodiment, the 

25 expression profiles are used, preferably in conjunction with high throughput screening 
teclmiques to allow monitoring for expression profile genes and/or corresponding 
polypeptides. 

As described and defined herein, breast cancer proteins, including 
intracellular, transmembrane or secreted proteins, find use as markers of breast cancer. 
30 Detection of these proteins in putative breast cancer tissue allows for detection or diagnosis 
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of breast cancer. In one embodiment, antibodies are used to detect breast cancer proteins. A 
preferred method separates proteins from a sample by electrophoresis on a gel (typically a 
denaturing and reducing protein gel, but may be another type of gel, including isoelectric 
focusing gels and the like). Following separation of proteins, the breast cancer protein is 
5 detected, e.g., by immimoblotting with antibodies raised against the breast cancer protein. 
Methods of immimoblotting are well known to those of ordinary skill in the art. 

In another preferred method, antibodies to the breast cancer protein find use in 
in situ imaging techniques, e.g., in histology (e.g.. Methods in Cell Biology: Antibodies in 
Cell Biology y volume 37 (Asai, ed. 1993)). In this method cells are contacted with from one 

10 to many antibodies to the breast cancer protein(s). Following washing to remove non- 
specific antibody binding, the presence of the antibody or antibodies is detected. In one 
embodiment the antibody is detected by incubating with a secondary antibody that contains a 
detectable label. In another method the primary antibody to the breast cancer protein(s) 
contains a detectable label, e.g. an enzyme marker that can act on a substrate. In another 

15 preferred embodiment each one of multiple primary antibodies contains a distinct and 

detectable label. This method finds particular use in simultaneous screening for a plurality of 
breast cancer proteins. As will be appreciated by one of ordinary skill in the art, many other 
histological imaging techniques are also provided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the 

20 ability to detect and distinguish emissions of different wavelengths. In addition, a 
fluorescence activated cell sorter (FACS) can be used in the method. 

In another preferred embodiment, antibodies find use in diagnosing breast 
cancer from blood, serum, plasma, stool, and other samples. Such samples, therefore, are 
useful as samples to be probed or tested for the presence of breast cancer proteins. 

25 Antibodies can be used to detect a breast cancer protein by previously described 
immimoassay techniques including ELIS A, inimunoblotting (western blotting), 
immunoprecipitation, BIACORE technology and the like. Conversely, the presence of 
antibodies may indicate an immune response against an endogenous breast cancer protein. 

In a preferred embodiment, in situ hybridization of labeled breast cancer 

30 nucleic acid probes to tissue arrays is done. For example, arrays of tissue samples, including 
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breast cancer tissue and/or normal tissue, are made. In situ hybridization {see, e.g., Ausubel, 
supra) is then performed. When comparing the fingerprints between an individual and a 
standard, the skilled artisan can make a diagnosis, a prognosis, or a prediction based on the 
findings. It is further imderstood that the genes which indicate the diagnosis may differ firom 
5 those which indicate the prognosis and molecular profiling of the condition of the cells may 
lead to distinctions between responsive or refiractory conditions or may be predictive of 
outcomes. 

In a preferred embodiment, the breast cancer proteins, antibodies, nucleic 
acids, modified proteins and cells containing breast cancer sequences are used in prognosis 

10 assays. As above, gene expression profiles can be generated that correlate to breast cancer, 
in terms of long term prognosis. Again, this may be done on either a protein or gene level, 
with the? use of genes being preferred. As above, breast cancer probes may be attached to 
biochips for the detection and quantification of breast cancer sequences in a tissue or patient. 
The assays proceed as outlined above for diagnosis. PGR method may provide more 

15 sensitive and accurate quantification. 

Assays for therapeutic compounds 

In a preferred embodiment members of the proteins, nucleic acids, and 
antibodies as described herein are used in drag screening assays. The breast cancer proteins, 
20 antibodies, nucleic acids, modified proteins and cells containing breast cancer sequences are 
used in drug screening assays or by evaluating the effect of drug candidates on a "gene 
expression profile" or expression profile of polypeptides. In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes after treatment with a candidate 
25 agent (e.g., Zlokamik, et aL, Science 279:84-8 (1998); Heid, Genome Res 6:986-94, 1996). 

In a preferred embodiment, the breast cancer proteins, antibodies, nucleic 
acids, modified proteins and cells containing the native or modified breast cancer proteins are 
used in screening assays. That is, the present invention provides novel methods for screening 
for compositions which modulate the breast cancer phenotype or an identified physiological 
30 ftinction of a breast cancer protein. As above, this can be done on an individual gene level or 

62 



BNSOOCID: <WO XS2aS93nKZ'J}> 



wo 02/059377 PCT/US02/02242 



by evaluating the effect of drug candidates on a "gene expression profile". In a preferred 
embodiment, the expression profiles are used, preferably in conjunction with high throughput 
screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, see Zlokamik, supra, 

5 Having identified the differentially expressed genes herein, a variety of assays 

may be executed. In a preferred embodiment, assays may be run on an individual gene or 
protein level. That is, having identified a particular gene as up regulated in breast cancer, test 
compoimds can be screened for the ability to modulate gene expression or for binding to the 
breast cancer protein. "Modulation" thus includes both an incirease and a decrease in gene 

10 expression. The preferred amoimt of modulation will depend on the original change of the 
gene expression in normal versus tissue undergoing breast cancer, with changes of at least 
10%, preferably 50%, more preferably 100-300%, and m some embodiments 300-1000% or 
greater. Thus, if a gene exhibits a 4-fold increase in breast cancer tissue compared to normal 
tissue, a decrease of about four-fold is often desired; similarly, a 10-fold decrease in breast 

15 cancer tissue compared to normal tissue often provides a target value of a lO-fold increase in 
expression to be induced by the test compoimd. 

The amount of gene expression may be monitored using nucleic acid probes 
and the quantification of gene expression levels, or, altematively, the gene product itself can 
be monitored, e.g., through the use of antibodies to the breastcancer protein and standard 

20 immunoassays. Proteomics and separation techniques may also allow quantification of 
expression. 

In a preferred embodiment, gene expression or protein monitoring of a number 
of entities, i.e., an expression profile, is monitored simultaneously. Such profiles will 
typically involve a plurality of those entities described herein.. 
25 In this embodiment, the breast cancer nucleic acid probes are attached to 

biochips as outlined herein for the detection and quantification of breast cancer sequences in 
a particular cell. Altematively, PGR may be used. Thus, a series, e.g., of microtiter plate, 
may be used with dispensed primers in desired wells. A PGR reaction can then be performed 
and analyzed for each well. 
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Expression monitoring can be performed to identify compounds that modify 
the expression of one or more breast cancer-associated sequences, e.g., a polynucleotide 
sequence set out inTable 17. Generally, in a preferred embodiment, a test modulator is added 
to the cells prior to analysis. Moreover, screens are also provided to identify agents that 
5 modulate breast cancer, modulate breast cancer proteins, bind to a breast cancer protein, or 
interfere with the binding of a breast cancer protein and an antibody or other binding partner. 

The term "test compound" or "drug candidate" or "modulator" or grammatical 
equivalents as used herein describes any molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 

10 indirectly alter the breast cancer phenotype or the expression of a breast cancer sequence, 
e.g., a nucleic acid or protein sequence. In preferred embodiments, modulators alter 
expression profiles, or expression profile nucleic acids or proteins provided herein. In one 
embodiment, the modulator suppresses a breast cancer phenotype, e.g. to a normal tissue 
fingerprint. In another embodiment, a modulator induced a breast cancer phenotype. 

15 Generally, a plurality of assay mixtures are run in parallel with different agent concentrations 
to obtain a differential response to the various concentrations. Typically, one of these 
concentrations serves as a negative control, i.e., at zero concentration or below the level of 
detection. 

Drug candidates encompass numerous chemical classes, though typically they 
20 are organic molecules, preferably small organic compoimds having a molecular weight of 
more than 100 and less than about 2,500 daltons. Preferred small molecules are less than 
2000, or less than 1500 or less than 1000 or less than 500 D. Candidate agents comprise 
functional groups necessary for stmctural interaction with proteins, particularly hydrogen 
bonding, and typically include at least an amine, carbonyl, hydroxyl or carboxyl group, 
25 preferably at least two of the functional chemical groups. The candidate agents often 
comprise cyclical carbon or heterocyclic stmctures and/or aromatic or polyaromatic 
structures substituted with one or more of the above functional groups. Candidate agents are 
also found among biomolecules including peptides, saccharides, fatty acids, steroids, purines, 
pyrimidines, derivatives, stmctural analogs or combinations thereof. Particularly preferred 
30 are peptides. 
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In one aspect, a modulator will neutralize the effect of a breast cancer protein. 
By "neutralize" is meant that activity of a protein is inhibited or blocked and the consequent 
effect on the cell. 

In certain embodiments, combinatorial libraries of potential modulators will be 
5 screened for an ability to bind to a breast cancer polj^peptide or to modulate activity. 

Conventionally, new chemical entities with useful properties are generated by identifying a 
chemical compound (called a "lead compound") with some desirable property or activity, 
e.g., inhibiting activity, creating variants of the lead compound, and evaluating the property 
and activity of those variant compounds. Often, high throughput screening (HTS) methods 

10 are employed for such an analysis. 

In one preferred embodiment, high throughput screening methods involve 
providing a library containing a large number of potential therapeutic compounds (candidate 
compoimds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identi:fy those library members (particular chemical species or subclasses) that 

15 display a desired characteristic activity. The compoimds thus identified can serve as 

conventional "lead compounds" or cian themselves be used as potential or actual therapeutics. 

A combinatorial chemical Ubrary is a collection of diverse chemical 
compounds generated by either chemical synthesis or biological synthesis by combining a 
number of chemical "building blocks" such as reagents. For example, a linear combinatorial 

20 chemical library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of 
chemical building blocks called amino acids in every possible way for a given compound 
length (i.e., the number of amino acids in a polypeptide compound). Millions of chemical 
compounds can be synthesized through such combinatorial mixing of chemical building 
blocks (Gallop et al, J. Med, Chem. 37(9): 1233-1251 (1994)). 

25 Preparation and screening of combinatorial chemical libraries is well known to 

those of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide Hbraries (see, e.^., U.S. Patent No. 5,010,175, Furka, PepL Prot Res. 37:487-493 
(1991), Houghton et al. Nature, 354:84-88 (1991)), peptoids (PCT PubUcation No WO 
91/19735), encoded peptides (PCT Publication WO 93/20242), random bio-oligomers (PCT 

30 Publication WO 92/00091), benzodiazepines (U.S. Pat. No. 5,288,514), diversomers such as 
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hydantoins, benzodiazepines and dipeptides (Hobbs et al, Proa Nat, Acad, Sci. USA 
90:6909-6913 (1993)), vinylogous polypeptides (Hagihara ei al, J, Amer, Chem, Soc, 
1 14:6568 (1992)), nonpeptidal peptidomimetics with a Beta-D-Glucose scaffolding 
(Hirschmann etal,,J. Amer, Chem, Soc, 114:9217-9218 (1992)), analogous organic syntheses 
5 of small compound libraries (Cheri et al,, J, Amer, Chem. Soc. 1 16:2661 (1994)), 

oUgocarbamates (Cho, et al.. Science 261:1303 (1993)), and/or peptidyl phosphonates 
(Campbell et al,, J, Org, Chem, 59:658 (1994)). See, generally, Gordon et al,, J. Med, Chem, 
37:1385 (1994), nucleic acid libraries (see, e,g,, Strategene, Corp.), peptide nucleic acid 
libraries (see, e.g., U.S. Patent 5,539,083), antibody libraries (see, e.g., Vaughn et al. Nature 

10 Biotechnology 14(3):309-314 (1996), and PCTAJS96/10287), carbohydrate libraries (see, 

e,g., Liang et al. Science 274:1520-1522 (1996), and U.S. Patent No. 5,593,853), and small 
organic niolecule libraries (see, e.g„ benzodiazepines, Baum, C&EN, Jan 18, page 33 (1993); 
isoprenoids, U.S. Patent No. 5,569,588; thiazolidinones and metathiazanones, U.S. Patent No. . 
5,549,974; pyrrolidines, U.S. Patent Nos. 5,525,735 and 5,519,134; morpholino compounds, 

15 U.S. Patent No. 5,506,337; benzodiazepmes, U.S. Patent No. 5,288,514; and the like). 

Devices for the preparation of combinatorial libraries are commercially 
available (see, e.g„ 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, 
Rainin, Wobum, MA, 433A AppUed Biosystems, Foster City, CA, 9050 Plus, Millipore, 
Bedford, MA). 

20 A number of well known robotic systems have also been developed for 

solution phase chemistries. These systems include automated workstations Uke the 
automated synthesis apparatus developed by Takeda Chemical Industries, LTD. (Osaka, 
Japan) and many robotic systems utilizing robotic arms (Zymate EE, Zymark Corporation, 
. Hopkinton, Mass.; Orca, Hewlett-Packard, Palo Alto, Calif.), which mimic the manual 

25 synthetic operations performed by a chemist. Any of the above devices are suitable for use 
with the present invention. The nature and implementation of modifications to these devices 
(if any) so that they can operate as discussed herein will be apparent to persons skilled in the 
relevant art. In addition, numerous combinatorial libraries are themselves commercially 
available (see, e,g., ComGenex, Princeton, N.J,, Asinex, Moscow, Ru, Tripos, Inc., St. Louis, 
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MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Extoh, PA, Martek Biosciences, 
Columbia, MD, etc.). 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect enhancement or inhibition of breast cancer gene transcription, 

5 inhibition or enhancement of polypeptide expression, and inhibition or enhancement of 
polypeptide activity. 

High throughput assays for the presence, absence, quantification, or other 
properties of particular nucleic acids or protein products are well known to those of skill in 
the art. Similarly, binding assays and reporter gene assays are similarly well known. Thus, 

10 e.g., U.S. Patent No. 5,559,410 discloses high throughput screening methods for proteins, 
U.S. Patent No. 5,585,639 discloses high throughput screening methods for nucleic acid 
binding (i.e., in arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high 
throughput methods of screening for hgand/antibody blading. 

In addition, high throughput screening systems are coihmercially available 

15 (see, e.g,, Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 
Instruments, Inc. FuUerton, CA; Precision Systems, Inc., Natick, MA, eta). These systems 
typically automate entire procedures, including all sample and reagent pipetting, liquid 
dispensing, timed incubations, and final readings of the microplate in detector(s) appropriate 
for the assay. These configurable systems provide high throughput and rapid start up as well 

20 as a high degree of flexibility and customization. The manufacturers of such systems provide 
detailed protocols for various high throughput systems. Thus, e.g., Zymark Corp. provides 
technical bulletins describing screening systems for detecting the modulation of gene 
transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring 

25 proteins or firagments of naturally occurring proteins. Thus, e,g., cellular extracts containing 
proteins, or random or directed digests of proteinaceous cellular extracts, may be used. In 
this way Ubraries of proteins may be made for screening in the methods of the invention. 
Particularly preferred in this embodiment are libraries of bacterial, fimgal, viral, and 
mammalian proteins, with the latter being preferred, and hirnian proteins being especially 
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preferred. Particularly useful test compound will be directed to the class of proteins to which 
the target belongs, e.g.y substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5 to about 
30 amino acids, with from about 5 to about 20 amino acids being preferred, and from about 7 
5 to about 15 being particularly preferred. The peptides may be digests of naturally occurring 
proteins as is outlined above, random peptides, or "biased" random peptides. By 
"randomized" or grammatical equivalents herein is meant that each nucleic acid and peptide 
consists of essentially random nucleotides and amino acids, respectively. Since generally 
these random peptides (or nucleic acids, discussed below) are chemically synthesized, they 

10 may incorporate any nucleotide or amino acid at any position. The synthetic process can be 
designed to generate randomized proteins or nucleic acids, to allow the formation of all or 
most of the possible combinations over the length of the sequence, thus forming a library of 
randomized candidate bioactive proteinaceous agents. 

In one embodiment, the library is fiilly randomized, with no sequence 

15 preferences or constants at any position. In a preferred embodiment, the library is biased. 
That is, some positions within the sequence are either held constant, or are selected from a 
limited number of possibilities. For example, in a preferred embodiment, the nucleotides or 
amino acid residues are randomized within a defined class, e.g., of hydrophobic amino acids, 
hydrophilic residues, sterically biased (either small or large) residues, towards the creation of 

20 nucleic acid binding domains, ihe creation of cysteines, for cross-linking, prolines for SH-3 
domains, serines, threonines, tyrosines or histidines for phosphorylation sites, etc., or to 
purines, etc. 

Modulators of breast cancer can also be nucleic acids, as defined above. 

As described above generally for proteins, nucleic acid modulating agents may 
25 be naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. 
For example, digests of procaryotic or eucaryotic genomes may be used as is outlined above 
for proteins. 

In a preferred embodiment, the candidate compounds are organic chemical 
moieties, a wide variety of which are available in the literature. 
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After the candidate agent has been added and the cells allowed to incubate for 
some period of time, the sample containing a target sequence to be analyzed is added to the 
biochip. If required, the target sequence is prepared using known techniques. For example, 
the sample may be treated to lyse the cells, using known lysis buffers, electroporation, etc., 
5 with purification and/or amplification such as PGR performed as appropriate. For example, 
an in vitro transcription with labels covalently attached to the nucleotides is performed. 
Generally, the nucleic acids are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, e.g., a 
fluorescent, a chemiluminegcent, a chemical, or a radioactive signal, to provide a means of 

10 detecting the target sequence's specific binding to a probe; The label also can be an enzyme, 
such as, alkahne phosphatase or horseradish peroxidase, which when provided with an 
appropriate substrate produces a product that can be detected, Altematively, the label can be 
a labeled compound or small molecule, such as an enzyme inhibitor, that binds but is not 
catalyzed or altered by the enzyme. The label also can be a moiety or compound, such as, an 

15 epitope tag or biotin which specifically binds to streptavidin. For the example of biotin, the 
streptavidin is labeled as described above, thereby, providing a detectable signal for the 
bound target sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

As will be appreciated by those in the art, these assays can be direct 
hybridization assays or can comprise "sandwich assays", which include the use of multiple 

20 probes, as is generally outlined in U.S. Patent Nos. 5,681,702, 5,597,909, 5,545,730, 

5.594.117, 5,591,584, 5,571,670, 5,580,731, 5,571,670, 5,591,584, 5,624,802, 5,635,352, 

5.594.1 18, 5,359,100, 5,124,246 and 5,681,697, all of which are hereby mcorporated by 
reference. In this embodiment, in general, the target nucleic acid is prepared as outlined 
above, and then added to the biochip comprising a plurality of nucleic acid probes, under 

25 conditions that allow the formation of a hybridization complex. 

A variety of hybridization conditions may be used in the present invention, 
including high, moderate and low stringency conditions as outlined above. The assays are 
generally run under stringency conditions which allows formation of the, label probe 
hybridization complex only in the presence of target. Stringency can be controlled by 
30 altering a step parameter that is a thermodynamic variable, including, but not hmited to, 
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temperature, formamide concentration, salt concentration, chaotropic salt concentration pH, 
organic solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is 
generally outlined in U.S. Patent No. 5,681,697. Thus it may be desirable to perform certain 
5 steps at higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. 
Components of the reaction may be added simultaneously, or sequentially, in different orders, 
with preferred embodiments outlined below. In addition, the reaction may include a variety 
of otlier reagents. These include salts, buffers, neutral proteins, e.g. albumin, detergents, etc. 
10 which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, eifc, may also be 
used as appropriate, depending on the sample preparation methods and piirity of the target. 

The assay data are analyzed to determine the expression levels, and changes in 
15 expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the breast cancer phenotype. 
In one embodiment, screening is performed to identify modulators that can induce or 
suppress a particular expression profile, thus preferably generating the associated phenotype. 
In another embodiment, eg^., for diagnostic applications, having identified differentially 
20 expressed genes important in a particular state, screens can be performed to identify 

modulators that alter expression of individual genes. In an another embodiment, screening is 
performed to identify modulators that alter a biological function of the expression product of 
a differentially expressed gene. Again, having identified the importance of a gene in a 
particular state, screens are performed to identify agents that bind and/or modulate the 
25 biological activity of the gene product. 

In addition screens can be done for genes that are induced in response to a 
candidate agent. After identifying a modulator based upon its ability to suppress a breast 
cancer expression pattern leading to a normal expression pattern, or to modulate a single 
breast cancer gene expression profile so as to mimic the expression of the gene fi-om normal 
30 tissue, a screen as described above can be performed to identify genes that are specifically 
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modulated in response to the agent. Comparing expression profiles between normal tissue 
and agent treated breast cancer tissue reveals genes that are not expressed in normal tissue or 
breast cancer tissue, but are expressed in agent treated tissue. These agent-specific sequences 
can be identified and used by methods described herein for breast cancer genes or proteins. 
In particular these sequences and the proteins they encode find use in marking or identifying 
agent treated cells. In addition, antibodies can be raised against the agent induced proteins 
and used to target novel therapeutics to the treated breast cancer tissue sample. 

Thus, in one embodiment, a test compound is administered to a population of 
breast cancer cells, that have an associated breast cancer expression profile. By 
"administration" or "contacting" herein is meant that the candidate agent is added to the cells 
in such a manner as to allow the agent to act upon the cell, whether by uptake and 
intracellular action, or by action at the cell surface. In some embodiments, nucleic acid 
encodmg a proteinaceous candidate agent (i.e., a peptide) may be put into a viral construct 
such as an adenoviral or retroviral construct, and added to the cell, such that expression of 
the peptide agent is accompUshed, e.g., PCX US97/01019. Regulatable gene therapy systems 
can also be used. 

Once the test compound has been administered to the cells, the cells can be 
washed if desired and are allowed to incubate under preferably physiological conditions for 
some period of time. The cells are then harvested and a new gene expression profile is 
generated, as outlined herein. 

Thus, e.g., breast cancer tissue may be screened for agents that modulate, e.g., 
induce or suppress the breast cancer phenotype. A change in at least one gene, preferably 
many, of the expression profile indicates that the agent has an effect on breast cancer activity. 
By defining such a signature for the breast cancer phenotype, screens for new drugs that alter 
the phenotype can be devised. With this approach, the drug target need not be known and 
need not be represented in the original expression screening platform, nor does the level of 
transcript for the target protein need to change. 

In a preferred embodiment, as outlined above, screens may be done on 
individual genes and gene products (proteins). That is, having identified a particular 
differentially expressed gene as important in a particular state, screening of modulators of 
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either the expression of the gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as '"breast cancer proteins" or 
a "breast cancer modulatory protein". The breast cancer modulatory protein may be a 
fragment, or alternatively, be the full length protein to the fragment encoded by the nucleic 

5 acids of the Tables. Preferably, the breast cancer modulatory protein is a fragment. In a 
preferred embodiment, the breast cancer amino acid sequence which is used to determine 
sequence identity or similarity is encoded by a nucleic acid of Table 25. In another 
embodiment, the sequences are naturally occurring allelic variants of a protein encoded by a 
nucleic acid of Table 25. In another embodiment, the sequences are sequence variants as 

10 further described herein. 

Preferably, the breast cancer modvdatory protein is a fragment of 
approximately 14 to 24 amino acids long. More preferably the fragment is a soluble 
fragment. Preferably, the fragment includes a non-transmembrane region. In a preferred 
embodiment, the fragment has an N-terminal Cys to aid in solubility. In one embodiment, the 

15 C-terminus of the fragment is kept as a free acid and the N-terminus is a free amine to aid in 
coupling, i.e., to cysteine. 

In one embodiment the breast cancer proteins are conjugated to an 
immunogenic agent as discussed herein. In one embodiment the breast cancer protein is 
conjugated to BSA. 

20 Measurements of breast cancer polypeptide activity, or of breast cancer or the 

breast cancer phenotype can be performed using a variety of assays. For example, the effects 
of the test compoimds upon the function of the breast cancer polypeptides can be measured 
by examining parameters described above. A suitable physiological change that affects 
activity can be used to assess the influence of a test compoimd on the polypeptides of this 

25 invention. When the functional consequences are determined using intact cells or animals, 
one can also measure a variety of effects such as, in the case of breast cancer associated with 
tumors, tumor growth, tumor metastasis, neovascularization, hormone release, transcriptional 
changes to both known and imcharacterized genetic markers (e.g., northem blots), changes in 
cell metaboUsm such as cell growth or pH changes, and changes in intracellular second 
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messengers such as cGMP. In the assays of the invention, mammalian breast cancer 
polypeptide is typically used, e.g., mouse, preferably human. 

Assays to identify compounds with modulating activity can be performed in 
vitj'o. For example, a breast cancer polypeptide is fu*st contacted with a potential modulator 
5 and incubated for a suitable amount of time, e.g., from 0.5 to 48 hours. In one embodiment, 
the breast cancer polypeptide levels are determined in vitro by measuring the level of protein 
or mRNA. The level of protein is measured using immunoassays such as. western blotting, 
ELIS A and the like with an antibody that selectively binds to the breast cancer polypeptide or 
a fragment thereof. For measurement of mRNA, amplification, e.g., using PGR, LCR, or 

10 hybridization assays, e.g., northern hybridization, RNAse protection, dot blotting, are 
preferred. The level of protein or mRNA is detected using directly or indirectly labeled 
detection agents, e.g., fluorescently or radioactively labeled nucleic acids, radioactively or 
enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system can be devised using the breast cancer 

15 protein promoter operably linked to a reporter gene such as luciferase, green fluorescent 
protein, CAT, or P-gal. The reporter construct is typically transfected into a cell. After 
treatment with a potential modulator, the amount of reporter gene transcription, translation, or 
activity is measured according to standard techniques known to those of skill in the art. 

In a preferred embodiment, as outlined above, screens may be done on 

20 individual genes and gene products (proteins). That is, having identified a particular 

differentially expressed gene as important in a particular state, screening of modulators of the 
expression of the gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as **breast cancer proteins." 
The breast cancer protein may be a fragment, or altematively, be the fiill length protein to a 

25 fragment shown herein. 

In one embodiment, screening for modulators of expression of specific genes 
is performed. Tj^jically, the expression of only one or a few genes are evaluated. In another 
embodiment, screens are designed to first find compounds that bind to differentially 
expressed proteins. These compoimds are then evaluated for the ability to modulate 
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differentially expressed activity. Moreover, once initial candidate compounds are identified, 
variants can be further screened to better evaluate structure activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or 
isolated gene product is used; that is, the gene products of one or more differentially 
5 expressed nucleic acids are made. For example, antibodies are generated to the protein gene 
products, and standard immunoassays are run to determine the amoxmt of protein present. 
Alternatively, cells comprising the breast cancer proteins can be used in the assays. 

Thus, in a preferred embodiment, the methods comprise combining a breast 
cancer protein and a candidate compound, and determining the binding of the compound to 

1 0 the breast cancer protein. Preferred embodiments utiUze the human breast cancer protein, 
although other mammalian proteins may also be used, e.g. for the development of animal 
models of human disease. In some embodiments, as outlined herein, variant or cierivative 
breast cancer proteins may be used. 

Generally, in a preferred embodiment of the methods herein, the breast cancer 

15 protein or the candidate agent is non-diffiisably bound to an insoluble support having isolated 
sample receiving areas (e.g. a microtiter plate, an array, etc.). The insoluble supports may be 
made of any composition to which the compositions can be bound, is readily separated fi-om 
soluble material, and is otherwise compatible with the overall method of screening. The 
surface of such supports may be solid or porous and of any convenient shape. Examples of 

20 suitable insoluble supports include microtiter plates, arrays, membranes and beads. These are 
typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon or nitrocellulose, 
teflon™, etc. Microtiter plates and arrays are especially convenient because a large number 
of assays can be carried out simultaneously, using small amounts of reagents and samples. 
The particular manner of binding of the composition is not crucial so long as it is compatible 

25 with the reagents and overall methods of the invention, maintains the activity of the 

composition and is nondiffiisable. Preferred methods of binding include the use of antibodies 
(which do not sterically block either the ligand binding site or activation sequence when the 
protein is bound to the support), direct binding to "sticky" or ionic supports, chemical 
crosslinking, the synthesis of the protein or agent on the surface, etc. Following binding of 

30 the protein or agent, excess imbound material is removed by washing. The sample receiving 
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areas may then be blocked through incubation with bovine serum albuinin (BSA), casein or 
other iimocuous protein or other moiety. 

In a preferred embodiment, the breast cancer protein is bound to the support, 
and a test compound is added to the assay. Alternatively, the candidate agent is bound to the 

5 support and the breast cancer protein is added. Novel binding agents include specific 
antibodies, non-natural binding agents identified in screens of chemical libraries, peptide 
analogs, etc. Of particular interest are screening assays for agents that have a low toxicity for 
human cells. A wide variety of assays may be used for this purpose, including labeled in 
vitro protein-protein binding assays, electrophoretic mobility shift assays, immunoassays for 

10 protein binding, functional assays (phosphorylation assays, etc.) and the like. 

The determination of the binding of the test modulating compound to the 
breast cancer ^protein may be done in a number of ways. In a preferred embodiment, the 
compound is labeled, and binding determined directly, e.g., by attaching all or a portion of 
the breast cancer protein to a solid support, adding a labeled candidate agent (e.g., a 

15 fluorescent label), washing off excess reagent, and determining whether the label is present 
on the solid support. Various blocking and washing steps may be utilized as appropriate. 

In some embodiments, only one of the components is labeled, e.g., the 
proteins (or proteinaceous candidate compounds) can be labeled. Alternatively, more than 
oue component can be labeled with different labels, e.g., ^^^I for the proteins and a fluorophor 

20 for the compoimd. Proximity reagents, e.g., quenching or energy transfer reagents are also 
useful. 

In one embodiment, the binding of the test compound is determined by 
competitive binding assay. The competitor is a binding moiety known to bind to the target 
molecule (i.e., a breast cancer protein), such as an antibody, peptide, binding partner, ligand, 

25 etc. Under certain circumstances, there may be competitive binding between the compound 
and the binding moiety, with the binding moiety displacing the compoimd. In one 
embodiment, the test compound is labeled. Either the compound, or the competitor, or both, 
is added first to the protein for a time sufficient to allow binding, if present. Incubations may 
be performed at a temperature which facilitates optimal activity, typically between 4 and 

30 40^C. Incubation periods are typically optimized,. e.g., to facilitate rapid high throughput 
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screening. Typically between 0.1 and 1 hour will be sufficient. Excess reagent is generally 
removed or washed away. The second component is then added, and the presence or absence 
of the labeled component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by the test 
5 compomid. Displacement of the competitor is an indication that the test compound is binding 
to the breast cancer protein and thus is capable of binding to, and potentially modulating, the 
activity of the breast cancer protein. In this embodiment, either component can be labeled. 
Thus, e.g., if the competitor is labeled, the presence of label in the wash solution indicates 
displacement by the agent. Altematively, if the test compound is labeled, the presence of the 

10 label on the support indicates displacement. 

In an altemative embodiment, the test compound is added first, with 
incubation and washing, followed by the competitor. The absence of binding by the 
competitor may indicate that the test compound is bound to the breast cancer protein with a 
higher affinity. Thus, if the test compound is labeled, the presence of the label on the 

15 support, coupled with a lack of competitor binding, may indicate that the test compoimd is 
capable of binding to the breast cancer-protein. 

In a preferred embodiment, the methodis comprise differential screening to 
identity agents that are capable of modulating the. activity of the breast cancer proteins. In 
this embodiment, the methods comprise combining a breast cancer protein and a competitor 

20 in a first sample. A second sample comprises a test compound, a breast cancer protein, and a 
competitor. The binding of the competitor is determined for both samples, and a change, or 
difference in binding between the two samples indicates the presence of an agent capable of 
binding to the breast cancer protein and potentially modulating its activity. That is, if the 
binding of the competitor is different in the second sample relative to the first sample, the 

25 agent is capable of binding to the breast cancer protein. 

Altematively, differential screening is used to identify dmg candidates that 
bind to the native breast cancer protein, but caimot bind to modified breast cancer proteins. 
The structure of the breast cancer protein may be modeled, and used in rational dmg design 
to synthesize agents that interact with that site. Dmg candidates that affect the activity of a 
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breast cancer protein are also identified by screening drugs for the ability to either enhance or 
reduce the activity of the protein. 

Positive controls and negative controls may be used in the assays. Preferably 
control and test samples are performed in at least triplicate to obtain statistically significant 

5 results. Incubation of all samples is for a time sufficient for the binding of the agent to the 
protein. Following incubation, samples are washed free of non-specifically bound material 
and the amount of bound, generally labeled agent determined. For example, where a 
radiolabel is employed, the samples may be counted in a scintillation counter to determine the 
amount of bound compound. 

10 A variety of other reagents may be included in the screening assays. These 

include reagents like salts, neutral proteins, e.g. albumin, detergents, etc. which may be used 
to facilitate optimal protein-protein binding and/or reduce non-specific or backgroimd 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., niay be used. The mixture 

15 of components may be added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening for a 
compound capable of modulating the activity of a breast cancer protein. The methods 
comprise adding a test compound, as defined above, to a cell comprising breast cancer 
proteins. Preferred cell types include almost any cell. The cells contain a recombinant 

20 nucleic acid that encodes a breast cancer protein. In a preferred embodiment, a library of 
candidate agents are tested on a plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous 
or subsequent exposure of physiological signals, e.g. hormones, antibodies, peptides, 
antigens, cytokines, growth factors, action potentials, pharmacological agents including 

25 chemotherapeutics, radiation, carcinogenics, or other cells (i.e. cell-cell contacts). In another 
example, the determinations are determined at different stages of the cell cycle process. 

In this way, compounds that modulate breast cancer agents are identified. 
Compounds with pharmacological activity are able to enhance or interfere with the activity of 
the breast cancer protein. Once identified, similar structures are evaluated to identify critical 

30 structural feature of the conipoimd. 
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In one embodiment, a method of inhibiting breast cancer cell division is 
provided. The method comprises administration of a breast cancer inhibitor. In another 
embodiment, a method of inhibiting breast cancer is provided. The method comprises 
administration of a breast cancer inhibitor. In a further embodiment, methods of treating cells 
5 or individuals with breast cancer are provided. The method comprises administration of a 
breast cancer inhibitor. 

In one embodiment, a breast cancer inhibitor is an antibody as discussed 
above. In another embodiment, the breast cancer inhibitor is an antisense molecule. 

A variety of cell growth, proliferation, and metastasis assays are known to 
10 those of skill in the art, as described below. 

Soft agar growth or colony formation in suspension 

Normal cells require a solid substrate to attach and grow. When the cells are 
transformed, they lose this phenotype and grow detached from the substrate. For example, 
transformed cells can grow iii stirred suspension culture or suspended in semi-soUd media, 

1 5 such as semi-solid or soft agar. The transformed qells, when transfected with tumor 

suppressor genes, regenerate normal phenotype and require a solid substrate to attach and 
grow. Soft agar grov^^h or colony formation in suspension assays can be used to identify 
modulators of breast cancer sequences, which when expressed in host cells, inhibit abnormal 
cellular proliferation and transformation. A therapeutic compoimd would reduce or eliminate 

20 the host cells' ability to grow in stirred suspension culture or suspended in semi-sohd media, 
such as semi-solid or soft. 

Techniques for soft agar growth or colony formation in suspension assays are 
described in Freshney, Culture of Animal Cells a Manual of Basic Technique (3^*^ ed., 1994), 
herein incorporated by reference. See also, the methods section of Garkavtsev et aL (1996), 

25 supra^ herein incorporated by reference. 

Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in a petri dish imtil 
they touch other cells. When the cells touch one another, they are contact inhibited and stop 
growing. When cells are transformed, however, the cells are not contact inhibited and 
30 continue to grow to high densities in disorganized foci. Thus, the transformed cells grow to a 
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higher saturation density than normal cells. This can be detected morphologically by the 
formation of a disoriented monolayer of cells or rounded cells in foci within the regular 
pattern of normal surroimding cells. Alternatively, labeling index with (^H)-thymidine at 
saturation density can be used to measure density limitation of growth. See Freshney (1994), 
5 supra. The transformed cells, when transfected with tumor suppressor genes, regenerate a 
nomial phenotype and become contact inhibited and would grow to a lower density. 

In this assay, labeling index with (^H)-thymidine at saturation density is a 
preferred method of measuring density limitation of growth. Transformed host cells are 
transfected with a breast cancer-associated sequence and are grown for 24 hours at saturation 
10 density in non-limititig medium conditions. The percentage of cells labeling with (^H)- 
thymidine is determined autoradiographically, See^ Freshney (1994), supra. 

Growth factor , or serum dependence 

Transformed cells have a lower serum dependence than their normal 
15 counterparts {see, e.g., Temin, J, Natl. Cancer Insti. 37:167-175 (1966); Eagle et al.^ J. Exp. 
Med. 131 :836"879 (1970)); Freshney, supra. This is in part due to release of various growth 
factors by the transformed cells. Growth factor or serum dependence of transformed host 
cells can be compared with that of control. 

20 Tumor specific markers levels 

Tumor cells release an increased amount of certain factors (hereinafter 'tumor 
. specific markers") than their normal counterparts. For example, plasminogen activator (PA) 
is released ft^om human glioma at a higher level than from normal brain cells {see, e.g., 
Gullino, Angiogenesis, tumor vascularization, and potential interference with tumor growth. 
25 m Biological Responses in Cancer, pp. 178-184 (Mihich (ed.) 1985)). Similarly, Timior 
angiogenesis factor (TAF) is released at a higher level in tumor cells than their normal 
counterparts. See, e.g., Volkm^ri^ Angiogenesis and Cancer, Sem Cancer Biol (1992)). 

Various techniques which measure the release of these factors are described in 
Freshney (1994), supra. Also, see, Unkless et al , Biol. Chem. 249:4295-4305 (1974); 
30 Strickland & Beers, J. Biol. Chem. 25 1 :5694-5702 (1976); Whur et al., Br. J. Cancer 42:305- 
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312 (1980); Gullino, Angiogenesis, tumor vascularization, and potential interference with 
tumor growth, in Biological Responses in Cancer^ pp. 178-184 (Mihich (ed.) 1985); 
VrcshnQy Anticancer Res. 5:111-130 (1985). 

5 Invasiveness into Matrigel 

The degree of invasiveness into Matrigel or some other extracellular matrix 
constituent can be used as an assay to identify compounds that modulate breast cancer- 
associated sequences. Tumor cells exhibit a good correlation between malignancy and 
invasiveness of cells into Matrigel or some other extracellular matrix constituent. In this 
10 assay, tumorigenic cells are typically used as host cells. Expression of a tumor suppressor 
gene in these host cells would decrease invasiveness of the host cells. 

Techniques described in FresTmey (1994), supra^ can be used. Briefly, the 
level of invasion of host cells can be measiired by using filters coated with Matrigel or some 
other extracellular matrix constituent. Penetration into the gel, or through to the distal side of 
15 the filter, is rated as invasiveness, and rated histologically by number of cells and distance 

moved, or by prelabeling the cells with ^^^I and counting the radioactivity on the distal side of 
the filter or bottom of the dish. See, e.g., Freshney (1984), supra. 

Tumor growth in vivo 

20 Effects of breast cancer-associated sequences on cell growth can be tested in 

transgenic or immxme-suppressed mice. Knock-out transgenic mice can be made, in which 
the breast cancer gene is disrupted or in which a breast cancer gene is inserted. Knock-out 
transgenic mice can be made by insertion of a marker gene or other heterologous gene into 
the endogenous breast cancer gene site in the mouse genome via homologous recombination. 

25 Such mice can also be made by substituting the endogenous breast cancer gene with a 

mutated version of the breast cancer gene, or by mutating the endogenous breast cancer gene, 
e.g., by exposure to carcinogens. 

A DNA construct is introduced into the nuclei of embryonic stem cells. Cells 
containing the newly engineered genetic lesion are injected into a host mouse embryo, which 

30 is re-implanted into a recipient female. Some of these embryos develop into chimeric mice 
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that pQssess germ cells partially derived from the mutant cell line. Therefore, by breeding the 
chimeric mice it is possible to obtain a new line of mice containing the introduced genetic 
lesion {see, e,g,, Capecchi et al. Science 244:1288 (1989)). Chimeric targeted mice can be 
derived according to Hogan et aL, Manipulating the Mouse Einbryo: A Laboratoiy Manual, 
5 Cold Spring Harbor Laboratory (1988) and Teratocarcinomas and Embiyonic Stem Cells: A . 
Practical Approach, Robertson, ed., IRL Press, Washington, D.C., (1987). 

Altematively, various immune-suppressed or immune-deficient host animals 
can be used. For example, genetically athymic "nude" mouse {see, e,g,, Giovanella et al.^ J. 
Natl. Cancer Inst 52:921 (1974)), a SCID mouse, a thymectomized mouse, or an irradiated 

10 mouse {see, e.g., Bradley et al^ Br, J. Cancer 38:263 (1978); Selby et al^ Br, J, Cancer 
41 :52 (1980)) can be used as a host. Transplantable tumor cells (typically about 10^ cells) 
injected into isogenic hosts will produce invasive tumors in a high proportions of cases, while 
normal cells of similar origin will not. In hosts which developed invasive tuniors, cells 
expressing a breast cancer-associated sequences are injected subcutaneously. After a suitable 

1 5 length of time, preferably 4-8 weeks, tumor growth is measured (e.g., by volume or by its two 
largest dimensions) and compared to the control. Tumors that have statistically significant 
reduction (using, e.g.. Student's T test) are said to have inhibited growth. 

Polynucleotide modulators of breast cancer 

20 . Antisense Polynucleotides 

In certain embodiments, the activity of a breast cancer-associated protein is 
down-regulated, or entirely inhibited, by the use of antisense polynucleotide, a nucleic 
acid complementary to, and which can preferably hybridize specifically to, a coding mRNA 
nucleic acid sequence, e;g., a breast cancer protein mRNA, or a subsequence thereof 

25 Binding of the antisense polynucleotide to the mRNA reduces the translation and/or stability 
of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise 
naturally-occurring nucleotides, or synthetic species formed from naturally-occurring 
subunits or their close homologs. Antisense polynucleotides may also have altered sugar 
30 moieties or inter-sugar linkages. Exemplary among these are the phosphorothioate and other 
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sulfur containing species which are known for use in the art. Analogs are comprehended by 
this invention so long as they function effectively to hybridize with the breast cancer protein 
mRNA. See, e.g., Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 

Such antisense polynucleotides can readily be synthesized using recombinant 
5 means, or can be synthesized in vitro. Equipment for such synthesis is sold by several 

vendors, including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known to those of skill in the art. 

Antisense molecules as used herein include antisense or sense 
oligonucleotides. Sense oligonucleotides can, e.g., be employed to block transcription by 

10 binding to the anti-sense strand. The antisense and sense oligonucleotide comprise a single- 
stranded nucleic acid sequence (either RNA or DNA) capable of binding to target mRNA 
(sense) or DNA (antisense) sequences for breast cancer molecules. A preferred antisense 
molecule is for a breast cancer sequences in Tables 1-25, or for a ligand or activator thereof. 
Antisense or sense oligonucleotides, according to the present invention, comprise a fragment 

15 generally at least about 14 nucleotides, preferably from about 14 to 30 nucleotides. The 
ability to derive an antisense or a sense oligonucleotide, based upon a cDNA sequence 
encoding a given protein is described in, e.g.. Stein & Cohen {Cancer Res. 48:2659 (1988 and 
van der Krol et al {BioTechniques 6:958 (1988)). 

20 Ribozymes 

In addition to antisense polynucleotides, ribozymes can be used to target and 
inhibit transcription of breast cancer-associated nucleotide sequences. A ribozyme is an 
RNA molecule that catalytically cleaves other RNA molecules. Different kinds of ribozymes 
have been described, including group I ribozymes, hammerhead ribozymes, hairpin 

25 ribozymes, RNase P, and axhead ribozymes {see, e.g., Castanotto et ai. Adv. in 

Pharmacology 25: 289-317 (1994) for a general review of the properties of different 
ribozymes). 

The general features of hairpin ribozymes are described, e.g., in Hampel et aL, 
Nucl Acids Res. 18:299-304 (1990); European Patent Publication No. 0 360 257; U.S. Patent 
30 No. 5,254,678, Methods of preparing are well knoAvn to those of skill in the art {see, e.g., 
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WO 94/26877; Ojwang et al.. Proa Natl Acad. ScL USA 90:6340-6344 (1993); Yamada et 
al. Human Gene Tlierapy^ 1:39-45 (1994); Leavitt a/., Proc. Natl Acad. ScL USA 92:699- 
703 (1995); Leavitt et al. Human Gene Therapy 5:1 151-120 (1994); and Yamada al. 
Virology 205 : 121.126 (1994)). 

-5 Polynucleotide modulators of breast cancer may be introduced into a cell 

containing the target nucleotide sequence by formation of a conjugate with a ligand binding 
molecule, as described in WO 91/04753. Suitable Ugand binding molecules include, but are 
not limited to, cell suriface receptors, groAvth factors, other cytokines, or other ligands that 
bind to cell surface receptors. Preferably, conjugation of the Ugand binding molecule does 

10 not substantially interfere with the ability of the ligand binding molecule to bind to its 

corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell. Altematively, a polynucleotide modulator of breast 
cancer may be introduced into a cell containing the target nucleic acid sequence, e.g., by 
formation of an polynucleotide-lipid complex, as described in WO 90/10448. It is 

15 understood that the use of antisense molecules or knock out and knock in models may also be 
used in screening assays as discussed above, in addition to methods of treatment. 

Thus, in one embodiment, methods of modulating breast cancer in cells or 
organisms are provided. In one embodiment, the methods comprise administering to a cell an 
anti-breast cancer antibody that reduces or eliminates the biological activity of an endogenous 

20 breast cancer protein. Altematively, the methods comprise administering to a cell or 
organism a recombinant nucleic acid encoding a breast cancer protein. This may be 
accomplished in any number of ways. In a preferred embodiment, e.g. when the breast 
cancer sequence is down-regulated in breast cancer, such state may be reversed by increasing 
the amount of breast cancer gene product in the cell. This can be accomplished, e.g. , by 

25 ovefexpressing the endogenous breast cancer gene or administering a gene encoding the 
breast cancer sequence, using known gene-therapy techniques, e.g.. In a preferred 
embodiment, the gene therapy techniques include the incorporation of the exogenous gene 
using enhanced homologous recombination (EHR), e.g. as described in PCT/US93/03868, 
hereby incorporated by reference in its entirety. Altematively, e.g. when the breast cancer 
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sequence is up-regulated in breast cancer, the activity of the endogenous breast cancer gene is 
decreased, e.g. by the administration of a breast cancer antisense nucleic acid. 

In one embodiment, the breast cancer proteins of the present invention may be 
used to generate polyclonal and monoclonal antibodies to breast cancer proteins. Similarly, 
5 the breast cancer proteins can be coupled, using standard technology, to affinity 
chromatography columns. These columns may then be used to purify breast cancer 
antibodies useful for production, diagnostic, or therapeutic purposes. In a preferred 
embodiment, the antibodies are generated to epitopes imique to a breast cancer protein; that 
is, the antibodies show little or no cross-reactivity to other proteins. The breast cancer 
10 antibodies may be coupled to standard affinity chromatography columns and used to pnrify 
breast cancer proteins. The antibodies may also be used as blocking polypeptides, as outlined 
above, since they will specifically bind to the breast cancer protein. 

Methods of identifying variant breast cancer-associated sequences 

1 5 Without being bound by theory, expression of various breast cancer sequences 

is correlated with breast cancer. Accordingly, disorders based on mutant or variant breast 
cancer genes may be determined. In one embodiment, the invention provides methods for 
identifying cells containing variant breast cancer genes, e.g., determining all or part of the 
sequence of at least one endogenous breast cancer genes in a cell. This may be accomplished 

20 using any number of sequencing techniques. In a preferred embodiment, the invention 

provides methods of identifying the breast cancer genotype of an individual, e.g., deteimining 
all or part of the sequence of at least one breast cancer gene of the individual. This is 
generally done in at least one tissue of the individual, and may include the evaluation of a 
number of tissues or different samples of the same tissue. The method may include 

25 comparing the sequence of the sequenced breast cancer gene to a known breast cancer gene, 
i.e., a wild-type gene. 

The sequence of all or part of the breast cancer gene can then be compared to 
the sequence of a known breast cancer gene to determine if any differences exist. This can be 
done using any nmnber of known homology programs, such as Bestfit, etc. In a preferred 

30 embodiment, the presence of a difference in the sequence between the breast cancer gene of 
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the patient and the known hreast cancer gene correlates with a disease state or a propensity 
for a disease state, as outlined herein. 

In a preferred embodiment, the breast cancer genes are used as probes to 
determine the number of copies of the breast cancer gene in the genome, . 
5 In another preferred embodiment, the breast cancer genes are used as probes to 

determine the chromosomal localization of the breast cancer genes. Information such as 
chromosomal localization finds use in providing a diagnosis or prognosis in particular when 
chromosomal abnormalities such as translocations, and the like are identified in the breast 
cancer gene locus. 

10 

Administration of pharmaceutical and vaccine compositions 

In one embodiment, a therapeutically effective dose of a breast cancer protein 
or modulator thereof, is administered to a patient. By "therapeutically effective dose" herein 
is meant a dose that produces effects for which it is administered. The exact dose will depend 

15 on the purpose of the treatment, and will be ascertainable by one skilled in the art using 
known techniques (e.g., Ansel et aL, Pharmaceutical Dosage Forms and Drug Delivery; 
Lieberman, Pharmaceutical Dosage Forms (vols. 1-3,1 992), Dekker, ISBN 0824770846, 
0824769 18X, 0824712692, 0824716981; Lloyd, The Art, Science and Technology of 
Pharmaceutical Compounding (1999); and Pickar, Dosage Calculations (1999)). As is 

20 known in the art, adjustments for breast cancer degradation, systemic versus localized 

delivery, and rate of new protease synthesis, as well as the age, body weight, general health, 
sex, diet, time of administration, drug interaction and the severity of the condition may be 
necessary, and will be ascertainable with routine experimentation by those skilled in the art. 
U.S. Patent Application N. 09/687,576, further discloses the use of compositions and 

25 methods of diagnosis and treatment in breast cancer is hereby expressly incorporated by 
reference. 

A "patient" for the purposes of the present invention includes both himians 
and other animals, particularly mammals. Thus the methods are applicable to both human 
therapy and veterinary appUcations. In the preferred embodiment the patient is a mammal, 
30 preferably a primate, and in the most preferred embodiment the patient is himian. 
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The administration of the breast cancer proteins and modulators thereof of the 
present invention can be done in a variety of ways as discussed above, including, but not 
limited to, orally, subcutaneously, intravenously, intranasally, transdermally, 
intraperitoneally, intramuscularly, intrapulmonary, vaginally, rectally, or intraocularly. In 
5 some instances, e.g., in the treatment of woimds and inflanunation, the breast cancer proteins 
and modulators may be directly applied as a solution or spray. 

The pharmaceutical compositions of the present invention comprise a breast 
cancer protein in a form suitable for administration to a patient. In the preferred embodiment, 
the pharmaceutical compositions are in a water soluble form, such as being present as 

10 pharmaceutically acceptable salts, which is meant to include both acid and base addition 
salts, "Pharmaceutically acceptable acid addition salt" refers to those salts that retain the 
biological effectiveness of the free bases and that are not biologically or otherwise 
imdesirable, formed with inorganic acids such as hydrochloric acid, hydrobromic acid, 
sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 

1 5 propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandeUc acid, 
methanesulfonic acid, ethanesulfonic acid, pl-toluenesulfonic acid, salicylic acid and the like. 
"Pharaiaceutically acceptable base addition salts" include those derived from inorganic bases 
such as sodium, potassium, lithium, ammonium, calcium, magnesixma, iron, zinc, copper, 

20 manganese, aluminum salts and the like. Particularly preferred are the ammonium, 

potassium, sodium, calcium, and magnesium salts. Salts derived from pharmaceutically 
acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 
ion exchange resins, such as isopropylamine, trimethylamine, diethylamine, triethylamine, 

25 tripropylamine, and ethanolamine. 

The pharmaceutical compositions may also include one or more of the 
following: carrier proteins such as serum albimiin; buffers; fillers such as microcrystalline 
cellulose, lactose, com and other starches; binding agents; sweeteners and other flavoring 
agents; coloring agents; and polyethylene glycol. 
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The pharmaceutical compositions can be administered in a variety of unit 
dosage forms depending upon the method of administration. For example, unit dosage forms 
suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 
and lozenges. It is recognized that breast cancer protein modulators (e.g., antibodies, 

5 antisense constructs, ribozymes, small organic molecules, etc) when administered orally, 
should be protected from digestion. This is typically accompUshed either by complexing the 
molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, or by 
packaging the molecule(s) in an appropriately resistant carrier, such as a liposome or a . 
protection barrier. Means of protecting agents from digestion are well known in the art. 

10 The compositions for administration will commonly comprise a breast cancer 

protein modulator dissolved in a pharmaceutically acceptable carrier, preferably an aqueous 
carrier. A variety of aqueous carriers can be used, e.g., buffered saline and the like. These 
solutions are sterile and generally free of vmdesirable matter. These compositions may be 
sterilized by conventional, well known sterilization techniques. The compositions may 

15 contain pharmaceutically acceptable auxiliary substances as required to approximate 

physiological conditions such as pH adjusting and buffering agents, toxicity adjusting agents 
and the like, e.g., sodium acetate, sodium chloride, potassium chloride, calcium chloride, 
sodium lactate and the like. The concentration of active agent in these formulations can vary 
widely, and will be selected primarily based on fluid volimies, viscosities, body weight and 

20 the like in accordance with the particular mode of administration selected and the patient's 
needs (e.g.. Remington s Pharmaceutical Science (15th ed., 1980) and Goodman & Gillman, 
ITie Pharmacologial Basis of Therapeutics (Hardman et a/.,eds., 1 996)); 

Thus, a typical pharmaceutical composition for intravenous, administration 
would be about 0.1 to 10 mg per patient per day. Dosages from 0.1 up to about 100 mg per 

25 patient per da;y may be used, particularly when the drug is administered to a secluded site and 
not into the blood stream, such as into a body cavity or into a lumen of an organ. 
Substantially hijgher dosages are possible in topical administration. Actual methods for 
preparing parenterally administrable compositions will be known or apparent to those skilled 
in the art, e.g.. Remington 's Pharmaceutical Science and Goodman and Gillman, The 

3 0 Pharmacologial Basis of Titer apeutics^ supra . 
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The compositions containing modulators of breast cancer proteins can be 
administered for therapeutic or prophylactic treatments; In therapeutic applications, 
compositions are administered to a patient suffering from a disease (e.g., a cancer) in an 
amount sufficient to cure or at least partially arrest the disease and its complications. An 
5 amount adequate to accomplish this is defined as a "therapeutically effective dose." Amounts 
effective for this use will depend upon the severity of the disease and the general state of the 
patient's health. Single or multiple administrations of the compositions may be administered 
depending on the dosage and frequency as required and tolerated by the patient. In any event, 
the composition should provide a sufficient quantity of the agents of this invention to 

10 effectively treat the patient. An amount of modulator that is capable of preventing or slowing 
the development of cancer in a mammal is referred to as a "prophylactically effective dose." 
The particular dose required for a prophylactic treatment will depend upon the medical 
condition and history of the mammal, the particular cancer being prevented, as well as other 
factors such as age, weight, gender, administration route, efficiency, etc. Such prophylactic 

15 treatments.niay be used, e.g., in a mammal who has previously had cancer to prevent a 

recurrence of the cancer, or in a mammal who is suspected of having a significant likelihood 
of developing cancer. 

It will be appreciated that the present breast cancer protein-modulating 
compovmds can be administered alone or in combination with additional breast cancer 

20 modulating compounds or with other therapeutic agent, e.g., other anti-cancer agents or 
treatments. 

In numerous embodiments, one or more nucleic acids, e,g.^ pol3niucleotides 
comprising nucleic acid sequences set forth in Tables 1-25, such as antisense polynucleotides 
or ribozymes, will be introduced into cells, in vitro or in vivo. The present invention provides 
25 methods, reagents, vectors, and cells useful for expression of breast cancer-associated 
polypeptides and nucleic acids using in vitro (cell-free), ex vivo or in vivo (cell or 
organism-based) recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 
expression of a protein or nucleic acid is application specific. Many procedxires for 
30 introducing foreign nucleotide sequences into host cells may be used. These include the use 
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of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors and any of the other well known methods for introducing cloned 
genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell {see, 
e.g., Berger & Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology 

5 volimie 152 (Berger), Ausubel et al, eds., Cun-ent Protocols (supplemented through 1999), 
and Sambrook et aL, Molecular Cloning - A Laboratoiy Manual (2nd ed,, Vol. 1-3, 1989. 

In a preferred embodiment, breast cancer proteins and modulators are 
administered as therapeutic agents, and can be formulated as outlined above. Similarly, 
breast cancer genes (including both the full-length sequence, partial sequences, or regulatory 

10 sequences of the breast cancer coding regions) can be administered in a gene therapy 

application. These breast cancer genes can include antisense applications, either as gene 
therapy (i.e. for incorporation into the genome) or as antisense compositions, as will be 
appreciated by those in the art.. 

Breast cancer polypeptides and polynucleotides can also be administered as 

15 vaccine compositions to stimulate HTL, CTL and antibody responses.. Such vaccine 

compositions can include, e.g., lipidated peptides {see, e.g.,Vitiello, A. et aL,J. Clin. Invest. 
95:341 (1995)), peptide compositions encapsulated in poly(DL-lactide-co-glycolide) ("PLG") 
microspheres {see, e.g, Eldridge, et al, Molec. Immunol. 28:287-294, (1991); Alonso.e^ a/.. 
Vaccine 12:299-306 (1994); Jones et a/.. Vaccine 13:675-681 (1995)), peptide compositions 

20 contained in inamune stimulatiiig complexes (ISCOMS) {see, e,g,, Takahashi et ai. Nature 
344:873^875 (1990); Hu et al, Clin Exp Immunol 1 13:235-243 (1998)), multiple antigen 
peptide systems (MAPs) {see, e.^,, Tam,Prac. Natl Acad, Set U.SA. 85:5409-5413 (1988); 
Tam, J: Immunol Methods 196: 17-32 (1 996)), peptides formulated as multivalent peptides; 
peptides for use in ballistic delivery systems, typically crystallized peptides, viral delivery 

25 vectors (Perkiis, et al. In: Concepts in vaccine development fKiaufinann, ed., p. 379, 1996); 
Chakrabarti, et al. Nature 320:535 (1986); Hu et al. Nature 320:537 (1986); Kieny, et al, 
AIDS Bio/Technology^ 4:790 (1986); Top et al, J. Infect. Dis. 124:148 (1971); Chanda et al. 
Virology 175:535 (1990)), particles of viral or synthetic origin {see, e.g., Kofler et al, J. 
Immunol Methods. 192:25 (1996); Eldridge et al, Sem. Hematol 30:16 (1993); Falo et al, 

30 Nature Med. 7:649 (1995)), adjuvants (Warren et al, Annu. Rev. Immunol 4:369 (1986); 
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Gupta era/.. Vaccine 11:293 (1993)), liposomes (Reddye/ a/., 7. Immunol 148:1585 (1992); 
Rock, Immunol Today 17:131 (1996)), or, naked or particle absorbed cDNA (Ulmer, et a/.. 
Science 259:1745 (1993); Robinson et al. Vaccine 11:957 (1993); Shiver et al. In: Concepts 
in vaccine development (Kaufinann, ed., p. 423, 1996); Cease & Berzofsky, Annu. Rev. 
5 Immunol 12:923 (1994) and Eldridge et al, Sem, Hematol 30:16 (1993)). Toxin-targeted 
delivery technologies, also known as receptor mediated targeting, such as those of Avant 
Immunotherapeutics, Inc. (Needham, Massachusetts) may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a 
substance designed to protect the antigen jfrom rapid catabolism, such as aluminum hydroxide 

10 or mineral oil, and a stimulator of immxme responses, such as lipid A, Bortadella pertussis or 
Mycobacterium tubercidosis derived proteins. Certain adjuvants are commercially available 
as, e.g., Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, 
MI); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline 
Beecham, Philadelphia, PA); aluminum salts such as aluminimi hydroxide gel (aliim) or 

15 aluminiun phosphate; salts of calcium, iron or zinc; an insoluble suspension of acylated 
tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 
polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 
Cj^okines, such as GM-CSF, interleukin-2, -7, -12, and other Uke growth factors, may also be 
used as adjuvants. 

20 Vaccines can be administered as nucleic acid compositions wherein DNA or 

RNA encoding one or more of the polypeptides, or a fragment thereof, is administered to a 
patient. This approach is described, for instance, in Wolff et al. Science 247:1465 (1990) as 
well as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 5,679,647; 
WO 98/04720; and in more detail below. Examples of DNA-based delivery technologies 

25 include "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, 

cationic lipid complexes, and particle-mediated ("gene gun") or pressure-mediated delivery 
(see, e.g„ U.S. Patent No, 5,922,687). 

For therapeutic or prophylactic imrnvmization purposes, the peptides of the 
invention can be expressed by viral or bacterial vectors. Examples of expression vectors 

30 include attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 
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vaccinia virus, e.g., as a vector to express nucleotide sequences that encode breast cancer 
polypeptides or polypeptide fragments. Upon introduction into a host, the recombinant 
vaccinia virus expresses the inmiunogenic peptide, and thereby elicits an immune response. 
Vaccinia vectors and methods useful in immunization protocols are described in, e.g., U.S. 
5 Patent No. 4,722,848. Another vector is BCG (Bacille Cahnette Guerin). BCG vectors are 
described in Stover et aL, Nature 351 :456-460 (1991). A wide variety of other vectors useful 
for therapeutic administration or immunization e.g. adeno and adeno-associated vims vectors, 
retroviral vectors. Salmonella typhi vectors, detoxified anthrax toxin vectors, and the like, 
will be apparent to those skilled in the art from the description herein {see, e,g,, Shata et aL, 

10 Mol Med Today 6:66-71 (2000); Shedlock et aL, JLeukoc Biol 68:793-806 (2000); Hipp et 
aL, In Vivo 14:571-85 (2000)). 

Methods for the use of genes as DNA vaccines are well known, and include 
placing a breast cancer gene or portion of a breast cancer gene under the control of a 
regulatable promoter or a tissue-specific promoter for expression in a breast cancer patient. 

15 The breast cancer gene used for DNA vaccines can encode full-length breast cancer proteins, 
but more preferably encodes portions of the breast cancer proteins including peptides derived 
from the breast cancer protein. In one embodiment, a patient is immunized with a DNA 
vaccine comprising a plurality of nucleotide sequences derived from a breast cancer gene. For 
example, breast cancer- associated genes or sequence encoding sub fragments of abreast 

20 cancer protein are introduced into expression vectors and tested for their immunogenicity in 
the context of Class I MHC and an ability to generate cytotoxic T cell responses. This 
procedure provides for production of cytotoxic T cell responses against cells which present 
antigen, including intracellular epitopes. 

La a preferred erabodiment, the DNA vaccines include a gene encoding an 

25 adjuvant molecule with the DNA vaccine. Such adjuvant molecules include cytokines that 
increase the immunogenic response to the breast cancer polypeptide encoded by the DNA 
. vaccine. Additional or alternative adjuvants are available. 

In another preferred embodiment breast cancer genes find use in generating 
animal models of breast cancer. When the breast cancer gene identified is repressed or 

30 diminished in cancer tissue, gene therapy technology, e.g., wherein antisense RNA directed 
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to the breast cancer gene will also diminish or repress expression of the gene. Animal models 
of breast cancer jfind use in screening for modulators of a breast cancer-associated sequence 
or modulators of breast cancer. Similarly, transgenic animal technology including gene 
knockout technology, e.g. as a result of homologous recombination with an appropriate gene 
5 targeting vector, will result in the absence or increased expression of the breast cancer 
protein. When desired, tissue-specific expression or knockout of the breast cancer protein 
may be necessary. 

It is also possible that the breast cancer protein is overexpressed in breast 
cancer. As such, transgenic animals can be generated that overexpress the breast cancer 
10 protein. Depending on the desired expression level, promoters of various strengths can be 
employed to express the transgene. Also, the number of copies of the integrated transgene 
can be determined and compared for a determination of the expression level of the transgene. 
Animals generated by such methods find use as animal models of breast cancer and are 
additionally useful in screening for modulators to treat breast cancer. 

15 

Kits for Use in Diagnostic and/or Prognostic Applications 

For use in diagnostic, research, and therapeutic appUcations suggested above, 
kits are also provided by the invention. In the diagnostic and research applications such kits 
may include any or all of the following: assay reagents, buffers, breast cancer-specific nucleic 

20 acids or antibodies, hybridization probes and/or primers, antisense polynucleotides, 

ribozymes, dominant negative breast cancer polypeptides or polynucleotides, small molecules 
inhibitors of breast cancer-associated sequences etc, A therapeutic product may include 
sterile saline or another pharmaceutically acceptable emulsion and suspension base. 

In addition, the kits may include instructional materials containing directions 

25. (i.e., protocols) for the practice of the methods of this invention. While the instmctional 
materials typically comprise written or printed materials they are not limited to such. Any 
medium capable of storing such instmctions and communicating them to an end user is 
contemplated by this invention. Such media include, but are not limited to electronic storage 
media (e.g., magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the 
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like. Such media may include addresses to intemet sites that provide such instructional 
materials. 

The present invention also provides for kits for screening for modulators of 
breast cancer-associated sequences. Such kits can be prepared from readily available 

5 materials and reagents. For example, such kits can comprise one or more of the following 
materials: a breast cancer-associated polypeptide or polynucleotide, reaction tubes, and 
instructions for testing breast cancer-associated activity. Optionally, the kit contains 
biologically active breast cancer protein. A wide variety of kits and components can be 
prepared according to the present invention, depending upon the intended user of the kit and 

10 the particular needs of the user. Diagnosis would typically involve evaluation of a plurality 
of genes or products. The genes will be selected based on correlations with important 
parameters in disease which may be identified in historical or outcome diata. 

15 EXAMPLES 

Example 1: Tissue Preparation, Labeling Chips, and Fingerprints 

Purifying total RNA from tissue sample using TRIzol Reagent 

20 The sample weight is first estimated. The tissue samples are homogenized in 

1 ml of TRIzol per 50 mg of tissue using a homogenizer (e.g., Polytron 3 1 00). The size of 
the generator/probe used depends upon the sample amount. A generator that is too large for 
the amount of tissue to be homogenized will cause a loss of sample and lower RNA yield. A 
larger generator (e.g., 20 mm) is suitable for tissue samples weighing more than 0.6 g. Fill 

25 tubes should not be overfilled. If the working volume is greater than 2 ml and no greater than 
10 ml, a 15 ml polypropylene tube (Falcon 2059) is suitable for homogenization. 

Tissues should be kept frozen until homogenized. The TRIzol is added . 
directly to the frozen tissue before homogenization. Following homogenization, the insoluble 
material is removed from the homogenate by centrifiigation at 7500 x g for 15 min. in a 

30 Sorvall superspeed or 12,000 x g for 10 min. in an Eppendorf centrifiige at 4^C. The cleared 
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homogenate is then transferred to a new tube(s). Samples may be frozen and stored at -60 to 

-VO^C for at least one month or else continue with the purification. 

The next process is phase separation. The homogenized samples are incubated 
for 5 minutes at room temperature. Then, 0.2 ml of chloroform per 1ml of TRIzol reagent is 
5 added to the homogenization mixture. The tubes are securely capped and shaken vigorously 
by hand (do not vortex) for 15 seconds. The samples are then incubated at room temp, for 
2-3 minutes and next centrifuged at 6500 rpm in a Sorvall superspeed for 30 mm. at 4oC. 

The next process is RNA Precipitation. The aqueous phase is transferred to a 
fresh tube. The organic phase can be saved if isolation of DNA or protein is desired. Then 
10 0.5 ml of isopropyl alcohol is added per 1ml of TRIzol reagent used in the original 

hompgenization. Then, the tubes are securely capped and inverted to mix. The samples are 
then incubated at room temp, for 10 minutes an centriftiged at 6500 rpm in Sorvall for 20 

min. at40C. 

The RNA is then washed. The supernatant is poured off and the pellet washed 
15 with cold 75% ethanoL 1 ml of 75% ethanbl is used per 1 ml of the TRIzol reagent used in 
the initial homogenization. The tubes are capped securely and inverted several times to 
loosen pellet without vortexing , They are next centrifiiged at <8000 rpm (<7500 x g) for 5 
minutes at 4°C. 

The RNA wash is decanted. The pellet is carefully transferred to an 
20 Eppendorf tube (sliding down the tube into the new tube by use of a pipet tip to help guide it 
in if necessary). Tube(s) sizes for precipitating the RNA depending on the working volumes. 
Larger tubes may take too long to dry. Dry pellet. The RNA is then resuspended in an 
appropriate volume (e.g., 2 -5 ug/ul) of DEPC H2O. The absorbance is then measured. 

The poly A+ mRNA may next be purified from total RNA by other methods 

25 such as Qiagen' s RNeasy kit. The poly A *^ mRNA is purified from total RNA by adding the 
oligotex suspension which has been heated to 37^C and mixing prior to adding to RNA. 
The Elution Buffer is incubated at lO^C. If there is precipitate in the buffer, warm up the 2 x 
Binding Buffer at 650C. The the total RNA is mixed with DEPC-treated water, 2 x Binding 
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Buffer, and Oligotex according to Table 2 on page 16 of the Oligotex Handbook and next 

incubated for 3 minutes at 65^C and 10 minutes at room temperature. 

The preparation is centrifuged for 2 minutes at 14,000 to 18,000 g, preferably, 
at a "soft setting," The supernatant is removed without disturbing Oligotex pellet. A little bit 
5 of solution can be left behind to reduce the loss of Oligotex. The supematant is saved until 

satisfactory binding and elution of poly A"*" mRNA has been found. 

Then, the preparation is gently resuspended in Wash Buffer OW2 and pipetted 
onto the spin column and centrifiiged at full speed (soft setting if possible) for 1 minute. 

Next, the spin column is transferred to a new collection tube and gently 
10 resuspended in Wash Buffer OW2 and centrifuged as described herein. 

Then, the spin column is transferred to a new tube and eluted with 20 to 100 ul 
of preheated (70OC) Elution Buffer. The Oligotex resin is gently resuspended by pipettuig up 
and down. The centrifugation is repeated as above and the elution repeated with fresh elution 
buffer or first eluate to keep the elution volume low. 
15 The absorbance is next read to determine the yield, using diluted Elution 

Buffer as the blank. 

Before proceeding with cDNA synthesis, the mRNA is precipitated before 
proceeding with cDNA synthesis, as components leftover or in the Elution Buffer from the 
Oligotex purification procedure will inhibit downstream enzymatic reactions of the mKNA. 
0.4 vol. of 7,5 M NH40Ac + 2.5 vol. of cold 100% ethanol is added and the preparation 
precipitated at -20''C 1 hour to overnight (or 20-30 min. at -TO^'C), and centrifuged at 
14,000-16,000 X g for 30 minutes at 4°C. Next, the pellet is washed with 0.5 ml of 80% 
ethanol (-20^C) and then centrifuged at 14,000-16,000 x g for 5 minutes at room temperature. 
The80% ethanol wash is then repeated. The last bit of ethanol from the pellet is then dried 
without use of a speed vacuum and the pellet is then resuspended in DEPC H2O at lug/ul 
concentration. 

Alternatively the RNA mav be purified using other method s (e ^;.. Oiagen's RNeasv kit! 
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No more than 100 ug is added to the KNeasy column. The sample volume is 
adjusted to 100 ul with RNase-free water. 350 ul Buffer RLT and then 250 ul ethanol 
(100%) are added to the sample. The preparation is then mixed by pipetting and applied to an 
RNeasy mini spin colunm for centrifugation (15 sec at >10,000 rpm). If yield is low, reapply 
5 the flowthrough to the colunm and centrifuge again. 

Then, transfer colunm to a new 2 ml collection tube and add 500 ul Buffer 
RPE and centrifuge for 15 sec at >1 0,000 rpm. The flowthrough is discarded. 500 ul Buffer 
RPE and is then added and the preparation is centriuged for 1 5 sec at > 10,000 rpm. The 
flowthrough is discarded, and the column membrane dried by centrifuging for 2 min at 
10 maximum speed. The column is transferred to a new L5-ml collection tube. 30-50 ul of 

RNase-free water is applied directly onto column membrane. The column is then centrifiiged 
for 1 min at >1 0,000 rpm and the elution step repeated. 

The absorbance is then read to determine yield. If necessary, the material may 
be ethanol precipitated with ammoniiun acetate and 2.5X volvune 100% ethanol. 

15 

First Strand cDNA Synthesis 

The first strand can be make using using Gibco's "Superscript Choice System 
for cDNA Synthesis" kit. The starting material is 5 ug of total RNA or 1 ug of poly A+ 
mRNAl. For total RNA, 2 ul of Superscript RT is used; for polyA+ mRNA, 1 ul of 

20 Superscript RT is used. The final volimie of first strand synthesis mix is 20 ul. The RNA 
should be in a volimie no greater than 10 ul. The RNA is incubated with 1 ul of 100 pmol 
T7-T24 oligo for 10 min at 70°C followed by addition on ice of 7 ul of: 4ul 5X ist Strand 
. Buffer, 2 ul of 0. IM DTT, and 1 ul of lOmM dNTP mix. The preparation is then incubated at 
37^C for 2 min before addition of the Superscript RT followed by incubation at 37^C for 1 

25 hour. 

Second Strand Synthesis 

For the second strand synthesis, place 1st strand reactions on ice and add: 91 

ul DEPC H2O; 30 ul 5X 2nd Strand Buffer; 3 ul lOmM dNTP mix; 1 ul 10 U/ul E.coU DNA 

30 Ligase; 4 ul 10 U/ul E.coU DNA Polymerase; and 1 ul 2 U/ul RNase H. Mix and incubate 2 
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hours at 1 6^*0. Add 2 ul T4 DNA Polymerase. Incubate 5 min at le^'C. Add 10 ul of 0.5M 
EDTA. 

Cleaning up cDNA 

5 The cDNA is purified using Phenol:Chloroform:Isoamyl Alcohol (25:24:1) 

and Phase-Lock gel tubes. The PLG tubes are centrifuged for 30 sec at maximum speed. 
The cDNA mix is then transferred to PLG tube. An equal volume of 
phenol:chloroform:isamyl alcohol is then added, the preparation shaken vigorously (no 
vortexing), and centrifuged for 5 minutes at maximum speed. The top aqueous solution is 
10 transferred to a new tube and ethanol precipitated by adding 7.5X 5M NH40Ac and 2-5X 
volume of 100% ethanoL Next, it is centrifuged immediately at room temperature for 20 
min, maximiun speed. The supernatant is removed, and the pellet washed with 2X with cold 
80% ethanol. As much ethanol wash as possible should be removed before air drying the 
pellet; and resuspending it in 3 ul RNase-free water. 

15 

In vitro Transcription dVT) and labeling with biotin 

In vitro Transcription (IVT) and labeling with biotin is performed as follows: 
Pipet 1.5 ul of cDNA into a thin- wall PGR tube. Make NTP labeling mix by combining 2 ul 
T7 lOxATP (75 mM) (Ambion); 2 ul T7 lOxGTP (75 mM) (Ambion); 1.5 ul T7 lOxCTP (75 

20 mM) (Ambion); 1.5 ul T7 lOxUTP (75 mM) (Ambion); 3.75 ul 10 mM Bio-1 1-UTP 

(Boehringer-Mannheim/Roche or Enzo); 3.75 ul 10 mM Bio-16-CTP (Enzo); 2 ul lOx T7 
transcription buffer (Ambion); and 2 ul lOx T7 enzyme mix (Ambion). The final volurrie is 
20 ul. Incubate 6 hours at 37''C in a PGR machine. The RNA can be furthered cleaned. 
Clean-up follows the previous instructions for RNeasy colimms or Qiagen's RNeasy protocol 

25 handbook. The cRNA often needs to be ethanol precipitated by resuspension in a volimie 
compatible with the fragmentation step. 

Fragmentation is performed as follows. 15 ug of labeled RNA is usually 
fragmented. Try to niinimize the fragmentation reaction volxmie; a 10 ul volume is 
recommended but 20 ul is all right. Do not go higher than 20 ul because the magnesium in 

30 the fragmentation buffer contributes to precipitation in the hybridization buffer. Fragment 
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RNA by incubation at 94 C for 35 minutes in 1 x Fragmentation buffer (5 x Fragmentation 
buffer is 200 mM Tris-acetate, pH 8.1; 500 mM KOAc; 150 mM MgOAc), The labeled 
RNA transcript can be analyzed before and after fragmentation. Samples can be heated to 
65°C for 15 minutes and electrophoresed on 1% agarose/TBE gels to get an approximate idea 
5 of the transcript size range 

For hybridization, 200 ul (10 ug cRNA) of a hybridization mix is put on the 
chip. If multiple hybridizations are to be done (such as cycHng through a 5 chip iset), then it 
is recommended that an initial hybridization mix of 300 ul or more be made. The 
hybridization mix is: fragment labeled KNA (50 ng/ul final cone); 50 pM 948-b control 
10 oligo; 1.5 pM BioB; 5 pM BioC; 25 pM BioD; 100 pM CRE; 0.1 mg/ml herring sperm DNA; 
0.5 mg/ml acetylated BSA; and 300 ul with IxMES hyb buffer. 

The hybridization reaction is conducted with non-biotinylated IVT (purified 
by RNeasy colimins) (see example 1 for steps from tissue to IVT): The following mixture is 
prepared: 

15 IVT antisense RNA; 4 i^g: \i\ 

Random Hexamers (1 jig/jil): 4 jxl 
H2O: ^l 

14|al 

Incubate the above 14 jxl mixture at 70^C for 10 min.; then put on ice. 
20 The Reverse transcription procedure uses the following mixture: 

O.IMDTT: 3 ^l 

SOXdNTPmix: 0.6 [a1 

H2O: 2.4 Kil 

Cy3 or Cy5 dUTP (ImM): 3^1 
25 SS RT n (BRL): 1 ^il 



16^1 

The above solution is added to the hybridization reaction and incubated for 30 min., 42^C- 

Then, 1 yil SSU is added and incubated for another hour before being placed on ice. 
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The SOX dNTP mix contains 25mM of cold dATP, dCTP, and dGTP, lOmM . 
of dTTP and is made by adding 25 p.1 each of lOOmM dATP, dCTP, and dGTP; 10 ^1 of 
lOOmM dTTP to 15 ^il H2O. ] 

KNA degradation is performed as follows. Add 86 ^il H2O, 1 .5 ^,1 IM NaOH/ 
5 2 mM EDTA and incubate at 65°C, 10 min.. For U-Con 30, 500 yil TE/sample spin at 7000 g 
for 10 min, save flow tlirough for purification. For Qiag«i purification, suspend u-con 
recovered material in 500 nl buffer PB and proceed using Qiagen protocol. For DNAse 
digestion, add 1 ul of 1/100 dilution of DNAse/30 ul Rx and incubate at 37°C for 15 min. 
Incubate at 5 min 95°C to denature the DNAse. 

10 

Sample preparation 

For sample preparation, add Cot-1 DNA, 10 SOX dNTPs, 1 ^il; 20X SSC, 2.3 |j;1; Na pyro 
phosphate, 7.5 jil; 10 mg/ml Herring sperm DNA; I ul of I/IO dilution to 21 .8 final vol. Dry 
in speed vac. Resuspend in 15 jil H2O. Add 0.38 ^1 10% SDS. Heat 95°C, 2 min and slow 
15 cool at room temp, for 20 min. Put on slide and hybridize overnight at 64°C. Washing after 
the hybridization: 3X SSC/0.03% SDS: 2 min., 37.5 ml 20X SSC-H).75ml 10% SDS in 
250ml H2O; IX SSC: 5 min., 12.5 ml 20X SSC in 250ml H2O; 0.2X SSC: 5 min., 2.5 ml 20X 
SSC in 250ml H2O. Dry slides and scan at appropiate PMT's and channels. 
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TABLE 1: Figure 1 from BRCA 001 US 

Table 1 shows genes, (incorporated in their entirety here and throughout the application 
where primekeys are provided), downregulated in tumor tissue compared to normal breast 
tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of nornial breast tissue to tumor 



Pkey ExAccn UnigenelD UnigeneTitle R1 

100472 D90084 Hs.1023 pyruvate dehydrogenase (lipoamide) alpha 5 

100499 751986 H5.283108 hemoglobin, gamma G 10 

100545 M55405 gbiHomo sapiens muctn (MUG-3) mRNA. part 5 

100549 BE142019 Hs.222056 Homo sapiens cDNA FU1 1572 fis. done HE 10 

100613 X52078 Hs.101047 transcription factor 3 (E2A immunoglobul 5 

100635 BE253039 HS.1239S3 Ewing sarcoma breakpoint region 1 5 

100645 X16841 Hs.167988 neural cell adhesion molecule 1 5 

100654 A03758 NM.000477*:Homo sapiens albumin (ALB), m 10 

100702 L27065 gb:Human neurofibromatosis 2 (NF2) mRNA, . 5 

100915 M60832 Hs.249239 collagen, type VIII. alpha 2 5 

100971 BE379727 Hs.83213 fatty acid binding protein 4, adipocyte 10 

101125 AJ250562 Hs.82749 transmen^rane 4 superfamily member 2 5 

101165 M90424 Hs.2099 lipocafin 1 (protein migrating faster th 5 

101184 NM_G01674 Hs.460 activating transcription factor 3 10 

101336 NM_006732 Hs75678 FBJ murine osteosarcoma viral oncogene h 10 

101367 X03350 Hs.4 alcohol dehydrogenase 1B (dass i). beta 10 

101447 M21305 gb:Human alpha satellite and satellite 3 10 

101461 N98569 Hs76422 phospholipase A2. group I lA (platelets, 10" 

101511 M27826 Hs.267319 endogenous retroviral protease 10 

101634 AV650262 HsJ5765 GR02 oncogene 5- 

101736 M74447 Hs.502 transporter 2, ATP-binding cassette, sub 10 

102208 U22961 gb:Human mRNA clone with similarity to L 10 

102297 NM_a01504 Hs.198252 G protein-coupled receptor 9 5 

102450 U48251 Hs75871 protein kinase C binding protein 1 10 

102515 U89337 Hs.169886 tenasdnXB 10 

102571 U601 IS Hs.239069 four and a half UM domains 1 5 

102800 AA313538 gb:EST185419Cok}n cardnoma (HOC) cell 10 

102857 NMJ006744 Hs.76461 retinol-binding protein 4. inteistifial 10 

102990 AA829286 Hs.332053 serum amyloid A1 10 

103434 X98085 Hs.54433 tenascin R (restrictin, janusin) 5 

103747 AA081995 gb:zn26d06.n Stratagene neuroeplthellum 10 

103750 AA126129 gb:zm78c07.r1 Stratagene neuroepitheltum 5 

103812 AA137107 Hs.326391 Homo sapiens, clone MGC:1 6638, mRNA. com 10 

103851 AA326216 Hs.8719 hypothetical protein MGC1 136 5 

104080 AB041036 Hs.57771 kallikrein 11 (KLK11:TLSP: PRSS20;hipp 5 

104093 R50727 Hs.336970 ESTs 10 

104108 AA422123 gb:zv26h12.r1 Soares^NhHMPu_S1 Homosapi 5 

104109 AL353957 Hs-284181 hypothetical protein DKFZp434P0531 10 
104250 F06638 Hs.12440 Homo sapiens done 24734 mRNA sequence 10 
104340 AA426189 gbrzwl 1e09.r1 Soares_NhHMPu.S1 Homo sapl 5. 
104492 N73185 Hs.94285 EST 10 
104506 N91071 Hs.109650 ESTs 10 
104511 N99542 Hs.572 orosomucoid 1 5 
104532 AI498763 Hs.203013 hypothetical protein FUJI 2748 10 
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104536 R24024 
104572 Y11312 
104659 AW969769 
104677 AA009764 
5 104711 AA017245 . 
104731 AA019300 
104764 AI039243 
105005 AI298208 
105036 AA130390 
10 105105 R61532 

105231 AW970043 
105239 AA221036 
105921 AA421973 
105957 BE242857 
15 106052 N79885 
106119 AL359624 
106181 A1803651 
106194 AW976171 
106283 A1085846 
20 106379 AL042069 
106451 AW235928 
106491 AA1 35688 
106700 AA905434 
106782 AW054885 
25 106851 AI458523 
108870 AI983730 
105892 AI347578 
106954 AF1 28847 
106991 AJ223811 
30 107103 AI446183 
107124 AB006532 
107148 AI005036 
107214 AF1 27026 
107242 AB02Qj672 
35 107331 AI905985 
107351 U51704 
107423 W26652 
107447 W28516 
107451 AL042425 
40 107453 AI092790 
107459 W38002 
107683 N53167 
107711 W96141 
107754 AA017462 
45 107757 BE621721 
107864 AA025060 
107872 BE271708 
107888 AA025836 
107997 AL049176 
50 108056 AA043675 
108081 AA093668 
108113 AA012881 
108238 AA059473 
108257 AA677927 
55 108335 AA070500 
108351 AA071193 
108382 NIVL006770 
108392 AA075124 
108441 AA079079 
60 108446 AA085383 
108497 AA074897 
108604 AA934589 
108662 API 17646 
108706 AA121620 
65 108738 AA126583 
108827 AI273692 
109123 AI028376 



Hs3776 
H5.25682 

Hs.26530 



Hs.158101 Homo sapiens cDNA FLJ14673 fis. clone NT 
Hs. 132463 phosphoinositide-^-kinase, dass 2, beta 
Hs.105201 ESTs 
Hs.190380 ESTs 
HS32794 ESTs 

HS.12S070 ESTs. Moderately simiiar to 154374 gene 
HS278585 ESTs 
Hs.28805 ESTs 

Hs.25549 hypometical protein FU20898 
Hs.87016 hypothetical protein FLJ22938 
H5.238039 hypothetical protein FU1 1090 

gb:zr03f12.r1 Stratagene NT2 neuronal pr 
Hs.1691 19 ESTs. Weakly similar to T25731 hypotheti 
Hs.27021 hypothetical protein FLJ11159 
Hs.6382 ESTs, Highly similar to To6391 hypotheti 
Hs.11387 KIAA1453 protein 
Hs.191608 ESTs 

Hs.286194 hypothetical protein FU22233 
Hs.25522 KiAA1808 protein 
Hs.1 19021 DKFZP434N061 protein 
Hs.313182 ESTs 

Hs.1 0083 Homo sapiens, done IMAGE:41 39786. mRNA, 
zinc finger protein 216 
Homo sapiens mRNA for K1AA1863 protein, 
gb:tk04g09jc1 NCI_CGAP_Lu24 Homo sapiens 
serum deprivation response (phosphatidyl 
Hs.1 24015 hypothetical protein MGC2605 
Hs.204038 indoleihylanrune N-methyitransferase 
Hs.30127 hypothetical protein 

Hs.9572 ESTs. Highly similar to CYAS.HUMAN ADENY 

Hs,31442 RecQ protein-like 4 

Hs.334305 GS1999full 

Hs.5394 myosin lA 

Hs.175411 KlAA0a65 protein 

Hs.1 11805 ESTs 

Hs.323428 ESTs, Moderately similar to ALU8^HUMAN A 
Hs.61 S3 PTEN mduced putative kinase 1 
Hs.19210 hypothetical protein MGC1 1308 
Hs.283976 hypthetical protein PR02389 
Hs.334703 hypoOietical protein FU14529 

Empirically selected from AFFX single pr 
Hs.47623 ESTs • 
Hs.220687 ESTs 
Hs.269244 ESTs 

Hs.280792 hypothetical protein FU12387 similarto 
Hs.61246 ESTs 

Hs.951 10 ESTs, Weakly similar to A55943 1-phospha 
Hs.191637 ESTs 
Hs.82223 chordin-fike 
ESTs 

musdeblind (Drosophila)-like 
hypothetical protein FU1 1838 
EST 

Hs.144269 ESTs 

gb2m70h03.s1 Stratagene neuroepithelium 
gb:zf79b12.s1 Scares j)ineaLgland„N3HPG 
macrophage receptor wiUi collagenous str 
gb:zm86a01 .si Stratagene ovarian cancer 
gb:zm97c09.s1 Stratagene colon HT29 (937 
gb:2n13g03.s1 Stratagene hNT neuron (937 
gb:zm85a0S.r1 Stratagene ovarian cancer 
ESTs 

Hs.156637 Cas-Br-M (murine) ectroptc retroviral tr 
Hs.74569 K1AA0842 protein 
Hs.158725 ESTs 
Hs.1 10470 ESTs 
Hs,73232 ESTs 

101 



Hs.62633 
Hs.28578 
Hs,72531 
Hs.66783 



Hs.67726 



Hs.4g6g8 



5 

5 

5 

10 

10 

10 

5 

10 
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109546 

109919 

110006 

110141 

110354 

110433 

110448 

110455 

110540 

110553 

110976 

110987 

111158 

111168 

111187 

111307 

111400 

111498 

111651 

111738 

111803 

111995 

112071 

112204 

112258 

112490 

112588 

112654 

112784 

112817 

112885 

112913 

113149 

113174 

113203 

113299 

113367 

113457 

113563 

113574 

113776 

113790 

113807 

113958 

114211 

114254 

114349 

114449 

114484 

114576 

114624 

114844 

114906 

115624 

115666 

115712 

115889 

115949 

116107 

116180 

116267 

116291 

116527 

116659 

116708 

117058 



AA101325 Hs.86154 hypothetical protein FU1 2457 10 

F01449 Hs,26954 Homo sapiens mRNA; cONA DKFZp762G123 (fr 5 

R40604 Hs.129539 ESTs. Weakly similar to MCAT.HUMAN MfTOC 10 

AI094674 Hs.30524 ring finger protein 24 10 

H46749 Hs.31540 ESTs 10 

W22165 . Hs.22586 ESTs 5 

AW294162 Hs.301062 UDP-N-acetyl-alpha-D-ga!actosamlne:po!yp 10 

H51276 Hs.13526 hypothetical protein FU 12688 - 10 

H52576 gb:yt85e08.r1 Soares_pineal_gland_N3HPG 5 

H72639 Hs.167608 ESTs 5 

H60593 Hs.124990 ESTs 10 

AL044174 Hs.159526 patched (Drosophlla) homolog 10 

AI753316 Hs.26034 ESTs 5 

N66616 Hs.138629 H.sapiens mRNA for subtelomeric repeat s 5 

AI798376 gb:tr34t)07.x1 NCLCGAP_Ov23 Homo sapiens 10 

AJ224864 Hs.9688 leukocyte membrane antigen 5 

AA641 636 Hs.37477 ESTs, Weakly similar to T46908 hypotheti 5 

R00144 Hs.189771 ESTs 10 

AI168511 gb:ow90h09.s1 Soares_fetaUiver_spleen_ 10 

R16733 Hs.20499 ESTs 10 

R26065 gb:yh39d03.s1 Scares placenta tslb2HP Homo 5 

AA593731 Hs.325823 ESTs. Moderately similar to ALU5.HUMAN A 10 

R42333 Hs,302292 ESTs 10 

AL1 17490 Hs.47225 Ras-associated pratebi Rapl 10 

NM_006668 Hs.25121 cytochrome P450, subfamily 46 (cholester 10 

R51889 Hs.24990 ESTs 5 

R31094 Hs.24378 ESTs 10 

R77302 gb:yi75h08.s1 Scares placenta Nb2HP Homo 10 

BE618629 Hs.268809 ESTs 5 

T98628 Hs.191290 ESTs 5 

AI057205 Hs.14584 ESTs . 5 

AA581428 Hs.5021 EST 10 

T16837 Hs.4241. ESTs 5 

T51 588 gb:yb27e06.s1 Stratagene fetal spleen (9 10 

T54659 Hs.301755 Homo sapiens cDNAFU11465fis. done HE 5 

AA743563 Hs.10305 ESTs 5 

AW207424 Hs.332594 ESTs 10 

N92359 Hs.14518 ESTs. Moderately similar to A48752 B-cel 10 

R16763 Hs.268679 ESTs 5 

AA913635 Hs.326413 Homo sapiens cDNA FLJ2081 2 fis. done AD 10 

R06874 Hs.268628 ESTs. Moderately similar to ALU1_HUMAN A 5 

A1791905 Hs.95549 hypothetical protein 10 

A1244311 Hs.26912 ESTs 10 

W07586 Hs.8045 ESTs 3 
W86195 gb:2h54e05.s1 Soares_letaUiver_spteen. 10 ' 

Z39319 Hs.27347 EST 10 

AB016263 Hs.180336 tumor neaosis factor receptor superfami 5 

AA745978 Hs.28273 ESTs 5 

AA020736 gb:ze63bl 1 .si Scares retina N2b4HR Homo 5 

AA034378 Hs.267319 endogenous retroviral protease 5 
AA06S096 gb:zm50a02.s1 Stratagene fibroblast (937 5 

AA081 507 gb:zn05b10.r1 Stratagene hNT neuron (937 5 

AA234826 Hs.87386 EST 5 

AA234462 Hs.87350 ESTs 5 

AK000725 Hs.50579 hypothetical protein FLJ20718 3 

AF173081 Hs.178215 Vertebrate UN7 homolog 1, Tax inteiadl 5 

AB020649 Hs.74569 KIAA0842 protein 5 

AA398841 Hs.39850 hypothetical protein FU2051 7 10 

AI478427 Hs.43125 esophageal cancer related gene 4 protein 10 

AL133916 Hs.172572 hypothetical protein FLJ20093 10 

AA463902 Hs.13522 ESTs. Weakly simflar to 138022 hypotheti 5 

AW968703 Hs.30085 hypotheUcal protein FU231 86 5 

AW410377 Hs.41502 hypothetical protein FU21 276 5 

AW1 94253 Hs.68607 ESTs 10 

BE314852 Hs.168694 Homo sapiens done 23763 unknov/n mRNA, p 5 

F10528 Hs.70001 ESTs, Moderately similar to JC61 69 nude 5 
AW801806 gb:IL5-UM0070.110400-062-g07 UM0070 Homo 5 
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117151 A18036S6 Hs.42373 ESTs 5 

117226 N20468 gb:yx39b10.s1 Soares melanocyte 2NbHM Ho 10 

117323 AI472863 Hs.43387 ESTs 5 

117571 N34417 Hs.44584 ESTs. 3 

5 117624 N26627 Hs.82364 ESTs. Weakfy similar to JC4124 pregnancy 5 

117673 N40551 Hs.184043 Homo sapiens Ets-1 btading protein (E1B) 10 

117847 N49285 Hs.182391 ESTs 10. 

117877 AW263476 Hs.44268 myelin gene expression factor 2 10 

117919 BE222341 Hs.279472 ESTs 5 

10 118049 N53145 gb:yv55ft)9^1 Soares fetal liver spleen 3 

118413 AW955696 Hs.90960 ESTs 10 

118613 AI078236 Hs.49688 ESTs 5 

118664 N709D7 Hs.230619 EST 10 

1 18858 AL122040 Hs.102981 Homo sapiens mRNA; cDNA DKFZp434G1972 (f 3 

15 118902 AA993527 Hs.293907 hypothetical protein FLJ23403 5 

119039 AI16G570 Hs.252097 pregnancy specific beta-1-glycoprotein 6 3 

119159 AF142419 HS.15D20 homolog of mouse quaking QKI (KH domain 5 

119216 AA514422 Hs.221849 ESTs 5 

119317 AK002001 Hs.51305 v-maf musculoaponeurotic fibrosarcoma (a 10 

20 119366 T77892 gb:yd20f04^1 Scares fetal liver spleen 5 

119378 T81824 Hs.90949 EST 5 

119528 W38051 Empirically selected from AFFX single pr 10 

119792 AL049798 Hs.80552 demiatopontin 3 

119800 AF086332 Hs:58314 ESTs 10 

25 119817 AF088081 Hs.159690 ESTs 5 

119835 AF086429 Hs.58429 ESTs 5 

119923 AW803308 Hs.62954 femtin, heavy' polypeptide 1 5 

119961 U34249 Hs.337461 Human putative zinc finger protein (ZNFB 5 

120379 AL042725 gb:DKFZp434B1822_n 434 (synonym: htes3) 10 

30 120931 AW136934 Hs.97162 ESTs 5 

121037 AA907743 Hs.142373 ESTs 5 

121282 AA401695 Hs.97334 ESTs 5 

121382 AA405763 Hs.111939 Homo sapiens cDNA FU20470 fis. done KA 5 

121764 AA421452 Hs.164851 ESTs. Weakly simHar to KIAA0926 protein 5 

35 122034 AK000229 Hs.98017 Homo sapiens cDNA FU20222 fis. done CO 10 

122441 AA447555 Hs.99116 EST 10 

122756 AA458945 Hs.95898 ESTs 10 

122771 AW135093 Hs.97282 ESTs, Highly similar to GIOO^HUMAN 110 K 5 

123601 AA609122 Hs. 112645 Homo sapiens mRNA: cDNADKF2p434D2472(f 5 

40 123623 AI024595 Hs.97508 a disintegrin and metalloproteinase doma 5 

123941 AA621529 gb:af47a02.s1 Soares_totalJetus,Nb2HF8^ 10 

1 24215 H62570 gb:yr44a01 .r1 Scares fetal liver spleen 5 

124276 H83465 gb7s91a1 1 .si Scares retina N2b5HR Homo 5 

124680 AK001527 Hs.163953 hypofheUcal protein FU 10665 5 

45 125099 NM_014312 Hs.1 12377 cortic al thymocyte receptor (X. laevis 10 

125121 T98199 Hs.48403 hypothetical protein FU10847 10 

125188 BE299557 Hs.271749 ESTs. Moderately simflar to ALU8.HUMAN A 5 

125284 NM_002666 Hs.103253 perilipin 10 

125906 BE256206 Hs! 17775 p75NTR-associated cell death isxecutor; o 5 

50 128484 AA485421 Hs.270503 ESTs. Weakly similar to ALU7_HUMAN ALU S 10 

126511 NM.002250 Hs.10082 potassium intermediate/small conductance 10 

128538 R44214 Hs.101189 ESTs 5 

128606 C16161 Hs.283040 hypothefical protein PR02543 5 

128850 AA193106 Hs.180817 chromosome 11 open reading frame 23 10 

55 128870 H39537 Hs.75309 eukaryotk; translation elongation factor 10 

128903 AW150717 Hs.296176 STAT induced STAT inhbitor 3 10 

128931 N62869 Hs.107242 Homo sapiens cONAFU 12965 fis. done NT 10 

129001 AA443323 Hs.107812 BPOZ protein 5 

129091 AA056483 Hs^463 Human Chromosome 16 BAG done CiT987SK-A 5 

60 129101 NM.013403 Hs.108665 zinedtn 10 

129146 At117472 - Hs;i08924 SH3-domain protein 5 (ponsin) 5 

129213 Ai146494 Hs.109525 ESTs. Weakly ^milar to IRX2_HUMAN IROQU 3 

129228 U40714 Hs.239307 tyrosyl-tRNA synthetase 5 

129265 AA530892 Hs.171695 dual spedfidty phosphatase 1 5 

65 129285 BE617015 Hs.11006 ESTs. Moderately similar to T1 7372 plasm 10 

129346 AF110141 Hs.288908 WAS prot^ family, member 2 10 

129368 NM-003877 Hs. 11 0776 STAT induced STAT inhibitor-2 5 
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129371 


X06828 


Hs.1 10802 


von Wiilebrand factor 


5 


129381 


AW245805 


Hs.110903 


daudin 5 (transmembrane protein deleted 


10 


129440 


W37944 


Hs.4007 


SarcolemmaUassociated protein 


5 


129441 


BE06106g 


Hs.301943 


KIAA0467 protein 


10 


129516 


AF020038 


Hs.11223 


isodtrate dehydrogenase 1 (NADP+), soiu 


10 


129554 


BE222078 


Hs.113069 


ESTs 


ID 


129684 


BE622468 


Hs.11924 


ESTs. Weakly similar to 138022 hypotheti 


5 


129702 


AI304966 


Hs.12035 


ESTs. Weakly similar to 138022 hypotheti 


5 


129778 


AK001676 


Hs.12457 


hypotheticat protein FU10814 - 


10 


129893 


AK000956 


Hs.13209 


hypothetical protein FU10094 


5 


129928 


A1338993 


Hs.1 34535 


ESTs 


5 


129973 


AJ251760 


Hs.273385 


guanine nucleotide binding protein (G pr 


5 


129977 


m 000399 


Hs.1395 


eariy growth response 2 (Krox-20 (Drosop 


5 


130014 


m 001158 


Hs.143102 


amine oxidase, copper containing 2 (reti 


5 


130085 


M62402 


Hs,274313 


insulin-iike growth factor binding prote 


10 


130089 


AA452006 


Hs.333199 


ESTs 


5 


130162 


W80711 


Hs.319946 


Homo sapiens mRNA for KIAA1727 protein, 


5 


130243 


D88435 


Hs.153227 


cydin G associated kinase 


10 


130315 


A1241084 


Hs.154353 


nonselective sodium potassium/proton exc 


5 


130339 


AA435746 




gb:zt79e03^1 SoafesJtestis_NHT Homo sap 


5 


130400 


V00517 


Hs.283108 


henragfobin. gamma G 


10 


130436 


NM_001928 


Hs.155597 


D. component of complement (adipsin) 


10 


130478 


X72308 


Hs.251526 


small inducible cytokine A7 (monocyte ch 


5 


130480 


BE222978 


Hs.15760 


MYG1 protein 


10 


130494 


AW390834 


HS.75B74 


pregnancy-asscdated plasma protein A 


5 


130563 


BE270472 


Hs.279900 


HSPC01 5 protein 


10 


130589 


AL1 10226 


Hs.16441 


DKFZP434H204 protein 


10 


130606 


AI652143 


Hs.288382 


hypothetical protein FLJ13111 


5 


130634 


AI769067 


Hs.1 27824 


ESTs, Weakly similar to T28770 hypotheti 


3 


130683 


AA993269 


Hs.1 7872 


Homo sapiens, done IMAGE:3875012, mRNA 


10 


130689 


NM 006691 


Hs.17917 


extracellular link domain-containing 1 


10 


130716 


AA232075 


Hs.18259 


XPA binding protein 1; putative ATP(GTP) 


5 


130718 


AF263462 


Hs.18376 


KIAA1319 protein 


10 


130722 


N41322 


Hs.18441 


ESTs 


5 


130798 


M81349 


Hs.1955 


serum amyloid A4, constitutive 


10 


130840 


BE048821 


Hs.20144 


small inducible cytokine subfamily A (Cy 


10 


131184 


AB040935 


Hs.23954 


cerebral cell adhesion molecule 


10 


131261 


AA360419 


Hs.171776 


inositol (myo)-1 (or 4)-monophosphatase 1 


10 


131282 


X03350 


Hs,4 


alcohol dehydrogenase 16 (dass 1), beta 


10 


131328 


AW939251 


Hs.25647 


v-fos FBJ murine osteosarcoma viral onco 


10 


131340 


AK000393 


Hs.25817 


BTB (POZ) domain containing 2 


5 


131341 


AF110908 


Hs.297660 


TNF reoeptor-assodated tactor 3 


5 


131406 


H83294 


Hs.284122 


Wnt inhibitory factor-1 


5 


131489 


BE394648 


Hs.27414 


hypothetical protein 


5 


131543 


AW966881 


Hs.41639 


programmed cell death 2 


10 


131692 


BE559681 


Hs,30736 


KIAAbl24 protein 


5 


131753 


AA829286 


Hs.332053 


serum amyloid A1 


10 


131756 


AA443966 


Hs.31595 


ESTs 


10 


131785 


H69342 


Hs.26320 


TRABID protein 


10 


131815 


AA021258 


Hs.32753 


ESTs 


5 


131819 


BE244961 


Hs.173103 


FE65-LIKE 2 


5 


131828 


AJ000263 


Hs.278658 


keratin, hair, basic, 6 (monilethrix) 


10 


131888 


AW294659 


Hs.34054 


Homo sapiens cDNA: FU22488 fisi done H 


5 


131927 


AJ003112 


Hs.34780 


douWecortex; fesencephaly, X-linked (d 


5 


131949 


AK000010 


Hs.258798 


hypothetical protein FU20003 


10 


132115 


H81604 


Hs.178471 


KIAA0798 gene produd 


5 


132177 


X80818 


Hs.1 78078 


glutamate receptor, metabotropic 4 


5 


132296 


AA467752 


Hs.195161 


ESTs 


5 


132426 


AW1 18072 


Hs.8998i 


diacylglyceral kinase, zeta (104kD) 


10 


132477 


S68874 


Hs,170917 


prostaglandin E receptor 3 (subtype EP3) 


5 


132675 


AI291496 


Hs.5476 


Homo sapiens, done IMAGE:3530123, mRNA, 


10 


132796 


NM.006283 


Hs.173159 


transforming. addk;coiled-coil contain 


10 


132898 


W2854d 


Hs.224829 


ESTs 


10 


132905 


NM 004235 


Hs.7934 


KruppeMike factor 4 (gut) 


10 


132953 


BE175645 


Hs.321264 


LBP protein 32 


5 


133116 


BE563968 


Hs.6529 


ESTs. Weakly similar to 178885 sertnefth 


5 


133120 


NMJ003278 


Hs.65424 


tetranectin (plasminogen^inding protein 


10 
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133139 AF052138 Hs.6580 Homo sapiens cDNA: FU23227 fis, clone C 5 

133163 M668224 Hs.6634 Homo sapiens cDNA: FU22547 lis, clone H 5 

133268 AW956781 Hs.293937 ESTs, Weakly simaar to FXD2.HUMANF0RKH 5 

133272 NM 002776 Hs.69423 kaliauein 10 (KLK10)(PRSSL1)(nes1) 5 

S 133379 AA207059 gb.zq80h09.s1 Stratagene hNT neuron (937 5 

133407 AF017987 Hs.7306 secreted frizzled-related protein 1 5 

133552 H21497 Hs.7471 BBP-IIke protein 1 5 

133702 LQ2321 Hs.75652 glutathione S4ransferase M5 5 

133719 H26904 Hs.75736 apolipoprotein 0 5 

10 133731 N71725 Hs^2572 hemoglobin, alpha 2 10 

133789 T85626 Hs76239 hypothetical protein FU20608 5 

134007 AF072441 HsJ840 caldneurin binding protein 1 10 

. 134055 D85062 Hs.182423 ESI (zebrafish) protein, human homolog o 10 

134111 AI372588 Hs.8022 TU3Aproteb .10 

15 134117 AA081846 Hs7921 Homo sapiens mRNA; cDNA DKFZp566E183 (fr 10 

134177 BE243319 Hs79572 KIAA0652 gene product 5 

134308 AW905827 Hs,81454 ketohexokinase (fnjctokinase) 10 

134361 BE549343 Hs.82208 acyt-Coenzyme A dehydrogenase, very long 5 

134369 AF207664 Hs.8230 a disintegrin-^ike and metalloprotease ( 5 

20 134449 L34155 Hs.83450 laminln. alpha 3 (nicein (150kD). kalini 5 

134467 AI190413 Hs.8373 ESTs 10 

134496 (VI64936 gb:Homo sapiens retinoic acid-inducible 10 

134510 NM_002757 Hs.250870 mitogenHactivaled protein kinase kinase 10 

134550 M26315 Hs.85258 CD8 antigen, alpha polypeptide (p32) 10 

25 134577 BE244323 Hs.B5951 expo rtin, IRN A (nuclear export receptor 5 

134591 U73394 Hs.166085 killer cell immunoglobulln-Iike receptor 5 

134678 AL008583 Hs, 182595 dynein, axonemal. light polypeptide 4 5 

134728 D10216 Hs.89394 POU domain, dass 1 , transcription facto 5 

134758 NM_000078 Hs.89538 cholesteryl ester transfer protein, plas 10 

30 134786 T29618 Hs.89640 TEK tyrosine kinase, endothelial (venous 10 

134912 T87521 Hs.261457 ESTs 5 

134963 NM_003394 Hs.91985 wingless-type MMTV integration site fami 10 

134969 H22570 Hs.172572 hypothetical protein FU20093 5 

135001 AA302517 Hs.92732 KIAA1444 protein 5 

35 135066 X04430 Hs.93913 interleukin 6 (interferon, beta 2) 10 

135173 AU)36557 Hs.95910 putative lymphocyte G0/G1 switch gene 10 

135197 U76456 Hs.190787 tissue inhibitor of metailoproteinase 4 5 

135219 AB002361 Hs.96633 K1AA0363 protein 5 

135250 U83171 Hs.97203 small indudble cytokine subfamily A (Cy 5 

40 135304 AA416829 Hs.191597 ESTs 5 

135337 AA905406 Hs.9905 ESTs. Weakly similar to unnamed protein 3 

135417 X55019 Hs.99975 cholineigic receptor, nicotinic, delta p 10 

101367 X03350 Hs.4 alcohol dehydrogenase 1B (dass I), beta 5 

128870 H39537 Hs75309 eukaryotic translation elongation fador 5 

45 129381 AW2458b5 Hs.1 10903 claudin 5 (transmembrane protein deleted 5 

130085 M52402 Hs.274313 insulin-like growth factor binding prote 5 

130689 NM_005691 Hs.17917 extracellular link domain-containing 1 10 

133120 NM_003278 Hs.65424 tetranectin (plasminogen-binding protein 3 

133407 AF017987 Hs7306 secreted frizzled-related protein 1 5 

50 , 133731 N71725 Hs.272572 hemogtobin. alpha 2 5 

1 34369 AF207664 Hs.8230 a disintegrin-like and metalloprotease ( 5 

135066 X04430 Hs.93913 Interleukin 6 (interferon, beta 2) « 10 

135173 Ali)35557 Hs.95910 putative lymphocyte G0/G1 switch gene 5 

322580 AK001852 Hs.274151 ligatin 5 

55 408790 AW580227 Hs.47860 neurotrophic tyrosine kinase, receptor, type 2 • 10 

418043 AW377752 Hs.83341 AXL receptor tyrosine kinase 5 

427458 BE208364 Hs.29283 ESTs, Weakly similar to U<HU proteoglycan link 5 

446674 AA563892 Hs,305000 solute earner family 4 (anion exchanger), memb 10 

449826 U85642 Hs.138506 ESTs 5 

60 RC_H15814_s Human apMI mRNA for GS3109 (novel adipose specific collagen 10 

Ya024w/RIP1 EST-YEL024W/RIP1 3 
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Table 1 A shows the accession numbers for those pkeys lacking unigenelD's for Table 1 . 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 
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Pkey: Unique Eos probeset identifier number 

CAT number Gene dusternumber 
Accession: Genbank accession numbers 

Pkey CAT Number Accessions 

108446 112224_1 AA085383 AA126091 AA074174 AA075373 AA079120 AA070831 AA075978 AA075372 AA128503 
108497 110079_2 AA074897 AA113914 AA064871 AA079329 AA071309 AA084710 AA129030 AA075042 AA074794 AA071453 AA078803 
AA148628 AA122204 AA074159 AA126185 AA0791 17 AA1 27089 AA070912 AA079280 AA131372 AA078833 AA071087 
AA076131 AA071047 AA079401 AA083070 AA102G76 AA115163 AA074198 AA134725 AA1 13889 AA121103 AA075041 
AA065148 AA07131G AA101 144 AA079659 AA078931 AA079209 AA070928 AA068994 AA069817 AA076187 AA069053 
AA131489 AA071308 AA063317 AA070156 AA071430 AA076056 AA075684 AA070G53 AA126283 AA128078 AA075895 
AA079208 AA074583 AA071086 AA079623 AA070627 AA078802 AA076622 AA065051 AA079143 AA071 110 AA079434 
AA148748 AAO79230 AA085188 AA074485 AA070580 AA076151 AA083166 AA0851 18 AA079450 AA085044 AA120938 
AA079200 AA100188 AA081472 AA122355 AA129031 AA085362 AA069220 AA070940 AA07596a AA074563 AA084027 
AA1 15929 

124215 1597154_1 H62570 H59063 

117058 1219924.1 AW801806 H90434 BE086530 

110455 46874.1 H52576 AF085971 H52172 

111168 38585J AJ798376 S46400 AW811617 AW811616 W00557 BE142245 AW858232 AW861851 AW858362 AA232351 AA218567 
AA055556 AW858231 AW857541 AW814172 H66214 AW814398 AF134164 AA243093 AA173345 AA199942 AA223384 
AA227092 AA227080 T12379 AA092174 T61139 AA149776 AA699829 AW879188 AW813567 AW813538 AI267168 
AA157718 AA157719 AA100472 AA100774 AA130756 AA157705 AA157730 AA157715 AA053524 AW849581 AW854566 
C05254 AW882836 T92637 AW812621 AA206583 AA209204 BE1669G9 AA226824 AI829309 AW991957 N66951 AA527374 
H66215 AA045564 AI694265 H60808 AA149726 AW195620 BE081333 BE073424 AW817662 AW817705 AW817703 
AW817659 BE081531 H59570 

111498 411008J AI168511 AIG22712 AA700366 R07371 R07324 

104340 46289_10 AA426189 F15201 

103747 117944_1 AAG81 995 AA1 01099 

134498 46501_1 M64936 A1025512 AI382987 BE061777 AA089966 BE169930 T41 176 AW594624 BE502415 AA121893 AI269283 T4031 1 

A1079277 AI241 318 BE327710 AW975215 AW896268 AA884990 BE327514 
1 18365J AA126129 AA126033 AA082561 

34624_1 AA221036 R8717G BE537068 BE544757 C18935 AW812058 T92565 AA227415 AA233942 AA223237 AA668403 AA601627 
AW869639 BE061833 BE000620 AW961170 AW847519 AA308542 AW821833 AW945688 C04699 AA205504 AA377241 
AW821667 AA055720 AW817981 AW858468 AA155719 AA179928 T03007 AW754298 AA227407 AA113928 AA307904 
C16859 

AL042725 BE063316 AW975610 AA457591 BE082G92 AI655202 AA714296 AI267264 AI075321 AA223286 AA071122 
AA227849 AA216700 AI696002 AA101867 AA099426 AA135997 AL041698 T02815 T51824 AA207189 T59230 T51868 
AA663341 BE165757 AW818104 AW392886 AA584918 AA099408 AW856398 AW861859 AA053045 
1 1 1 686J AAG81 507 AA070G71 AA070840 AA084362 
322947 J AI458623 AA639708 AA485409 R22065 AA485570 
1 1 3549_1 AA075124 AA075208 
22955_1 1 M55405 AW752552 

Ugr HT2959 Aa3758 A06977 A15293 D17029 D17107 D17171 L00132 L00133 M12523 M13075 M13076 M92816 U22961 V00494 V00495 

X51363 X51364 X51365 
tigr_HT3413 L27065 

6735.9 U22951 AA203523 AA503337 AI174733 AI192802 C06092 AA035357 AI190619 AM99244 AI828450 AA602296 AI378195 
AI209170 AI186653 AI127795 AI183846 H77389 AI589465 AA629390 H94306 AI018388 R685B4 AA027196 AI745413 
AI685092 AI093426 AI623873 AI074570 N50096 AA047486 N25060 AA327614 AI042512 AI383957 AA156873 AI333101 
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103750 
105239 



120379 34624.3 



114624 
106851 
108392 
100545 
100654 

100702 
102208 
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N70806 AI141254 A1383191 AI401237 A1080709 AI093400 W84549 T90806 R00012 W01413 AA630557 AI378348 AI559265 
AA877103 W84464 AA625146 R68379 AI133207 AI132980 AI133214 AI064826 AI061615 AI133473 AI174852 AI133404 
AI1 33272 V00494 M12523 M12523 AI207526 AI133120 AI064802 AI174993 All 14729 AI061645 AI064716 AI064959 H77388 
T85705 AF075298 A!1 10799 D17107 NM.000477 AF190168 R50724 AI248416 AI207432 AI133684 AI133345 AI174710 
AI133290 AI133304 AI174948 AI207484 All 10717 AF074e24 All 14515 AF083516 All 10642 All 14559 All 14498 All 14759 
AI207558 AI064960 AI174753 All 14666 R69184 R0001 1 AI064997 T60501 AI207701 T71735 AA385318 H73569 T60498 
H94399AI133158T74675 AA484750 T73413 T55909 R50261 T72061 N80533 T51189T74936 AI207490 A1132925 AI064701 
AI174748 All 14663 AI133104 AI132999 AI133100 AI064925 AI064979 AI133063 AA343347 T69091 AA233989 T39772 
AI444620 T52290 D16931 T40012 T48403 T58926 T69195 AI133061 T50850 AI400677 AI091 136 AA334608 T5741 1 Z20979 
N56507 T87485 AI133622AA343370 T40075T69671 T53849 T74820 AF075316 AI110818T40121 T57381 A1114468 
AA332728 T51362 AI114589 R06691 A!1 10629 AF063503 AI140543 AA334661 AA33272a AA343262 T73513 T86549 
All 14840 T57284 T39981 T61407 T72757 T51749 T56630 AA343125 T72126 R94135 T83028 T39972 T39896 AI174786 
AI132926 R09237 A1064838 AI133660 T60398 T88753 T55930 T921 26 AI444602 T60995 AI1 14792 H9391 1 A!1 331 06 
R10779 AI065020 T90925 T50889 D17029 AI133703 AA333805 AI133040 A1133017 AI064857 All 10730 AF074637 AI207567 
H71080 T73217 AA343950 AI174743 AA334224 AA334281 R06692 T64739 T401 63 T60528 T81661 T73179 R01842 
AA501730 T39931 T39662 T40136 AA334904 T71425 H77784 R00874 A1065049 T84512 T55918 AI207595 T39951 
AA005016 T60361 T69176 T73356 T58795 T61233 T39955 T60612 AI114676 A1064778 AA035710 W52763 AI114786 
T83564 AA341859 T81684 T55769 All 14710 T51776 AA343213 All 14714 T58102 AM 10809 R28984 A1174854 AA305675 
AA343592 T53836 T46869 T64721 T55508 W05241 T54019 T57945 T60513 T48364 AF075308 W86731 T82851 T48269 
H54053 T7321 1 AI114590 T48317 T55985 T74857 R84226 T56552 T52231 T74946 T76976 R02576 T95666 AI203974 
AI189471 AA005147 AI478102 Af207662 AI192792 AI768421 AI064737 AW051713 AA936693 AI133117 AI766232 AI913646 
T83962 A1065112 A1207689 AI174684 AI207702 T81475 AI133325 AI032512 AA701169 AI936354 A1114720 AI433289 
AA046980 A1823482 AI114536 AA860651 AW242644 R07469 AW300438 AM33416 AW271670 AI991363 T78943 AIB23481 
AA845518 AA719124 AA883454 T68850 T691 15 AI935509 A1150977 T62890 T71374 T68294 AI174774 T6741 1 T68318 
A1064689 T56624 T69010 T68982 T68302 AI332829 T72908 AI06481 9 AI205880 T62895 T69430 T95111 AA025050 T73330 
W52657 T71984 769118 W92684 AI1 14850 T62093 T61797 AI522333 T73322 H92981 T56018 T6181 1 T57232 A1336158 
T61821 T69457 T62900 T62912 T72917 T45885 AI702448 T57212 T57203 R94581 T71311 T61819 T89358 T67708 T70918 
T59166 AI1871 1 1 T64308 T62071 T69427 AI114750 T60430 R09734 T69033 T69141 T69453 T67908 R16809 T69394 
AI207729 T55839 T90273 03339 AW194909 T75486 T71850 T71305 T71287 T53877 T73452 T68852 N75290 AI312890 
T67751 AI174983 T51679 T54851 H69880 N73734 AA443453 T73466 H69672 N53869 T68447 D11809 D12412 T64300 
T28321 T55864 



123941 


genbank„AA621529 


AA621529 


118049 


genbank_N53145 


N53145 


102800 


14782 20 AA31 3538 U88895 U88902 


104106 


AA422123_LatAA422123J 


111738 


genbanK.R26065 


R26065 


113149 


genbanK_T51588 


T51588 


113958 


genbanK.W86195 


W86195 


108335 


genbank^070500 


AA070500 


108351 


genbanKAA071193 


AA071193 


108441 


genbanle.AA079079 


AA079079 


124276 


genbank_H83465 


H83465 


101447 


entre2^M21305 


M21305 


117226 


genbank_N20468 


N20468 


133379 


genbanleAA207059 


AA207059,AA207241 


119366 


genbank_T77892 


T77892 


119528 


NOT^FOUND^entrez^W38051 W38051 


112588 


genbank_R77302 


R77302 


114449 


genbank J^020736 


AA020736 


114576 


genbanleAA065096 


AA065096 


107459 


W38002_s^at W38002_s 


130339 


genbanK.AA435746 


AA435746 
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TABLE 2: Figure 2 from BRCA 001 US 
Table 2 shows genes downregulated in tumor tissue compared to normal breast tissue. 
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Pkey: Unique Eos probeset Identifier number 

ExAocn: Exemplar Accession number, Genbank accession number 

Un^enelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of nomial breast tissue to tumor 



PItey 


ExAccn 


UnigenelD 


Unigene Tittle 


Rl 


100499 


T51986 


H5.283108 


hemoglobin, gamma G 


10 


100549 


BE142019 


Hs.222056 


Homo sapiens cDNA FU1 1572 fis, clone HE 


10 


100654 


A03758 




NM_000477*:Homo sapiens albumin (ALB), m 


10 


100971 


BE379727 


Hs.83213 


fatty acid binding protein 4, adipocyte 


10 


101184 


N(V1_001674 


Hs.460 


activating transcript'on factor 3 


10 


101336 


NIVI 006732 


Hs.75678 


FBJ murine osteosarcoma viral oncogene h 


10 


101367 


X03350 


Hs.4 


alcohol dehydrogenase IB (class 1). beta 


10 


101447 


M21305 




gbiHuman alpha satellite and satellite 3 


10 


101461 


N98569 


HSJ6422 


phospholipase A2, group IIA (platelets, 


10 


101511 


IV127826 


Hs.267319 


endogenous retroviral protease 


10 


101736 


M74447 


Hs.502 


transporter 2. ATP-bindIng cassette, sub-famil 


10 


102208 


U22961 




gb:Human mRNA done with similarity to L 


10 


102450 U48251 


Hs.75871 


protein kinase C binding protein 1 


10 


102800 


AA313538 




gb:EST185419 Colon carcinoma (HCC) cell 


10 


102857 


NIVI_006744 


Hs.76461 


retlnol-binding protein 4, interstitial 


10 


102990 


AA829286 


Hs.332053 


senjm amyloid A1 


10 


103747 AA081995 




gb:zn26d06.r1 Stratagene neuroepithelium 


10 


103812 


AA137107 


Hs.326391 


Homo sapiens, clone MGC:16638, mRNA, com 


10 


104093 


R50727 


Hs.336970 


ESTs 


10 


104109 


Al^53957 


Hs.284181 


hypothetical protein DKFZp434P0531 


10 


104250 


F06638 


Hs.12440 


Homo sapiens clone 24734 mRNA sequence 


10 


104492 


N73185 


Hs.94285 


EST 


10 


104506 


N91071 


Hs. 109650 


ESTs 


10 


104532 


AI498763 


Hs.203013 


hypothetical protein FU12748 


10 


104677 


AA009764 


Hs.190380 


ESTs 


10 


104711 


AA0i7245 


Hs.32794 


ESTs 


10 


104731 


AA019300 


Hs.125070 


ESTs, Moderately similar to 154374 gene 


10 


105005 


AI298208 


Hs.28805 


ESTs 


10 


105036 


AA130390 


Hs^5549 


hypothetical protein FU20898 


10 


105239 


AA221036 




gb:zr03f12.r1 Stratagene NT2 neuronal pr 


10 


106052 


N79885 


Hs.6382 


ESTs. Highly slmHarto T00391 hypotheti 


10 


106181 


A1803651 


Hs,191608 


ESTs 


10 


106283 A1085846 


Hs.25522 


KIAA1808 protein 


10 


106379 


AL042069 


Hs. 119021 


DKFZP434N061 protein 


10 


106451 


AW235928 


Hs.313182 


ESTs 


10 


106491 


AA1 35688 


Hs.10083 


Homo sapiens, clone IMAGE:4 139786, mRNA, 


10 


106782 AW054886 


Hs.25682 


Homo sapiens mRNA for KIAA1863 protein, 


10 


107124 


AB006532 


Hs.31442 


RecQ protein-like 4 


10 


107148 


AI005036 


Hs.334305 


GS1999full 


10 


107214 


AF127026 


Hs.5394 


myosin lA 


10 


107242 AB020672 


Hs.175411 


KIAA0865 protein 


10 


107331 


Aig05985 


Hs.111805 


ESTs 


10 


107447 


W28516 


• Hs.19210 


hypothetical protein MGC1 1308 


10 


107451 


ALi)42425 


Hs.283976 


hypthetical proti^ PR02389 


10 


107872 BE271708 


Hs.95110 


ESTs. Weddy similar to A55943 1-phospha 


10 


108351 


AA071193 




gb2f79b12^1 SoarBSj)ineaUlandJJ3HPG 


10 


109546 


F01449 


Hs.26954 


Homo sapiens mRNA; cDNA DKFZp762G123 (fir 


10 


110433 AW294162 


Hs.301062 


UDP-N-acetyl-alpha-D-galaclosamine:polyp 


10 


110976 


AL044174 


Hs.159526 


patched prosophila) homdog 


5 
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111168 

111651 

111803 

114484 

125284 

128850 

128903 

129346 

129381 

129516 

129554 

130085 

130243 

130400 

130436 

130563 

130589 

130683 

130689 

130689 

130718 

130798 

130840 

131184 

131282 

131328 

131543 

131753 

131785 

131828 

132426 

132675 

132898 

132905 

133120 

133407 

133719 

134007 

134055 

134111 

134117 

134177 

134369 

134496 

134510 

134550 

134758 

134963 

135066 

408790 

446674 



AI798376 

R16733 

AA593731 

AA034378 

NIVL002666 

AA193106 

AW150717 

AF1 10141 

AW245805 

AF020038 

BE222078 

M62402 

D88435 

V00517 

NWL001928 

BE270472 

AL1 10226 

AA99326g 

NM^006691 

AA045747 

N70196 

M81349 

BE048821 

AB040935 

X03350 

AW939251 

AW966881 

AA829286 

H59342 

AJ000263 

AWl 18072 

AI291496 

W28548 

NNL004235 

NM„003278 

AF017987 

H26904 

AF072441 

D86Q62 

AI372588 

AA081846 

BE243319 

AF207664 

M64936 

NMJ02757 

M26315 ■ 

NM 000078 

NmI0O3394 

X04430 

AW580227 

AA563892 



Hs.20499 

Hs.325823 

Hs.267319 

Hs.1 03253 

Hs.180817 

Hs^6176 

HS288908 

Hs.1 10903 

Hs.11223 

Hs.1 13069 

Hs.274313 

Hs.153227 

Hs.283108 

Hs.155597 

Hs.279900 

Hs.16441 

Hs.17872 

Hs.17917 

Hs.17917 

Hs.18376 

Hs.1955 

Hs.20144 

Hs.23954 

Hs.4 

Hs^5647 

Hs,41639 

Hs.332053 

Hs.26320 

Hs,278658 

Hs,89981 

Hs.5476 

Hs.224829 

Hs.7934 . 

Hs.65424 

Hs.7306 

HS75736 

Hs.7840 

Hs.182423 

Hs.8022 

Hs.7921 

HS79672 

Hs.8230 

Hs.250870 

Hs.85258 

Hs.89538 

Hs.91985 

Hs.93913 

Hs.47860 

Hs.306000 



gb:tr34b07.x1 NCLCGAP_Ov23 Ham sapiens 
ESTs 

ESTs. Moderately similar to ALUS^HUMAN A 
endogenous retroviral protease 



chromosome 11 open reading frame 23 
STAT induced STAT inhtoitor 3 
WAS protein family, member 2 
daudin 5 (transmembrane protein deleted 
isodtrate dehydrogenase 1 (NADP-^-), solu 
ESTs 

insulin-like growth factor binding prote 

cyclin G associated Idnase 

hemoglobin, gamma G 

D component of complement (adipsin) 

HSPC015 protein 

DKFZP434H204 protein 

Homo sapiens, clone IMAGE:3875012. mRNA- 

extracellular link domain-containing 1 

extracellular link domain-containing 1 

KIAA1 31 9 protein 

senjm amyloid A4, constitutive 

small inducible cytokine subfamily A (Cy 

cerebral cell adhesion molecule 

alcohol dehydrogenase 1B (class I), beta 

v-fos FBJ murine osteosarcoma viral onco. 

programmed cell death 2 

serum amyloid Al 

TRABID protein 

keratin, hair, basic, 6 (monilethrix) 
diacylglycerol kinase, zeta (104kD) 
Homo sapiens, ctone IMAGE:3530123. mRNA. 
ESTs 

Knjppel-fike factor 4 (gut) 

tetranectin (plasminogen-binding protein 

secreted frizzled-related protein 1 

apolipoprotein D 

calcineurin bindmg protein 1 

ESI (zebrafish) protein, human homolog o 

.TU3A protein 

Homo sapiens mRNA; cDNA DKFZp566E183 (fr 
KIAA0652 gene product 
a dislntegrin-like and metaltoprotease ( 
gb:Homo sapiens retinoic addnnducible 
mitogen-activated protein kinase kinase 
CDS antigen, alpha polypeptide (p32) 
cholesteryl este r transfer protein, plas 
wingless-type MMTV integratfon site fami 
interleukin 6 (interferon, beta 2) 
neurotrophic tyrosine kinase, receptor, type 2 
solute carrier family 4 (anion exchange^, memb 



10 

10 

10 

10 

10 

5 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

5 

5 

10 
10 
10 
10 

5 

5 

10 

10 

10 
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TABLE 2A 

Table 2A shows the accession numbers for those pkeys lacking unigenelD's for Table 2, 
For each probeset, we have listed the gene cluster nxxmber from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" colvunn. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene dusternumber 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 



111168 . 38585J AI798376 S46400AW811617AW811616W00557BE142245AW858232AW861851 AW858362AA232351 AA218567 
AA055556 AW858231 AW857541 AW814172 H68214 AW81439B AF134164 AA243093 AA173345 AA199942 AA223384 
20 AA227092 AA227080 T12379 AA092174 T61 139 AA149776 AA699829 AW879188 AW813567 AW813538 AI267168 AA157718 

AA157719 AA100472 AA100774 AA130756 AA157705 AA157730 AA157715 AA053524 AW849581 AW854566 C05254 
AW882836 T92637 AW812621 AA206583 AA209204 BE156909 AA226824 AI829309 AW991957 N66951 AA527374 H66215 
AA045564 AI694255 H60808 AA149726 AW195620 BE081333 BE073424 AW817662 AW817705 AW817703 AW817659 
BE081531 H59570 
25 103747 117944J AA081995 AA101099 

134496 46501^1 M64936 AI025512 AI382987 BE061777 AA089966 BE169930 T41176 AW594624 BE502415 AA121893 AI269283T40311 

AI884569 AA25701 1 AI079277 A1241318 BE327710 AW975215 AW896268 AA884990 BE327514 
105239 34624J AA221036 R87170 BE537058 BE544757 C18935 AW812058 T92565 AA227415 AA233942 AA223237 AA668403 AA601627 
AW869639 BE0B1833 BEOOD620 AW961170 AW847519 AA308542 AW821833 AW945688 C04699 AA205504 AA377241 
30 AW821667 AA055720 AW817981 AW856468 AA155719 AA179928 T03007 AW754298 AA227407 AA113928 AA307904 

C16859 

100654 tigr^HT2969 A03758 A06977 A15293 D17029 D17107 D17171 U)0132 L00133 IVI12523 M13075 M13076 M92816 U22961 V00494 V00495 

X51363X51364X51365 

102208 6735_9 U22961 AA203623 AA503337 AI174733 AI192802 C06092 AA035357 AI190619 AI199244 AI828450 AA602296 AI378195 

35 AI209170 AI186653 An27795 AJ183846 H77389 Ai589465 AA629390 H94306 AI018388 R68584 AA027196 AI745413 

AI685092 At093426 AI623873 AI074570 N500g6 AA047486 N25060 AA327614 AI042512 AI383957 AA156873 AI333101 
N70806 AI141254 AI383191 Ai401237 AI080709 AI093400 W84549 T90806 R00012 W01413 AA630557 AI378348 A1559265 
AA877103 W84464 AA625146 R68379 A1133207 AI132g80 A)133214 AI064826 AI061615 AI133473 Ai174852 AI133404 
AI133272 V00494 1^12523 M12523 AI207526 Ai133120 AI064802 AI174993 AI114729 AI061645.AI064716 AI064959 H77388 

40 T85706 AF075298 All 10799 D17107 NMJ000477 AF190168 R50724 A!248416 AI207432 AI133684 AI133345 AI174710 

AI133290 AI133304 An74948 A\207484 AI110717 AF074624 A!114515 AF063516 AI110642 At114559 A1114498 A!114759 
AI207568 AI064960 AI174753 AI1 14666 R69184 R00011 At064997 T60501 AI207701 T71735 AA385318 H73569 T60496 
H94399 AI133158 T74675 AA484750 T73413 T55909 R50261 T72061 N80533 T51189 T74936 AI207490 AI132925 AI064701 
AI174748 A!114663 AI133104 AI132999 AI133100 AI064925 AI064979 AI133063 AA343347 T69091 AA233989 T39772 

45 AI444620 T52290D16931 T40012T48403 T58926 T69195AI133061 T50850 AI4G0677 AI091136 AA334608 T57411 Z20979 

N56507 T87485AI1 33622 AA343370 T40075T69671 T53849T74820AF075316 All 10818 T401 21 T57381 AI114468 
AA332728T51 362 AH 14589 R06691 A11 10829 AF063503 AI140543 AA334651 AA332720 AA343262 T73513T86549 
AI114840 T57284 T39981 T61407 T72757 T51749 T56630 AA343125 T72126 R94135 T83028 T39972 T39896 AI174786 
AI132926 R09237 AI064838 AI133660 T60398 T88753 T55930 T92126 AI444602 T60996 All 14792 H9391 1 A1133106 

50 R10779 A1065020 T90925 T50889 D17029 AI133703 AA333805 All 33040 AI133017 AI064857 All 10730 AF074637 AI207567 

H71080 T73217 AA343950 AI174743 AA334224 AA334281 R06692 T64739 T40163 T60628 T81661 T73179 R01842 
AA501730 T39931 T39652 T40136 AA334904 T71425 H77784 RQ0874 AI065049 T84512 T55918 AI207595 T39951 
AA005018 T60361 T69176 T73356 T58795 T61233 T39955 T60612 A1114676 A1064778 AA035710 W52763 AI114786 
T83564 AA341859 T81684 T55769 All 14710 T51776 AA343213 AH 14714 T58102 All 10809 R28984 AI174854 AA305675 

55 AA343592 T53836 T46869 T64721 T55508 W05241 T54019 T57945 T60513 T48364 AF075308.W86731 T82851 T48269 

H54053 T73211 AH 14590 T48317 T55965 T74857 R84226 T56552 T52231 T74946 T76976 R02576 T95666 AI203974 
AI189471 AA005147 AI478102 AI207662 AJ192792 AI768421 A1064737 AW051713 AA936693 AI1331 17 AI766232 AI913646 
T83952 A10651 12 AI207689 AI174684 AI207702 T81475 AI133325 A1032512 AA701169 AI936354 AI114720 AI433289 
AA046980 AI823482 AI114536 AA860651 AW242644 R07469 AW30043B AI133416 AW271670 AI991363 T78943 AI823481 

60 AA845518 AA719124 AA883454 T68850 T69115 AI935509 AI150977 T62890 T71374 T68294 AI174774 T67411 T68318 

AI064689 T56624 T69010 T68982 T68302 AI332829 T72908 AI064819 AI205880 T62895 T69430 T95111 AA025050 T73330 
W52657 T71984 T691 18 W92684 All 14860 T62093 T61797 AI522333 T73322 H92981 T56018 T6181 1 T57232 AI336158 
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T61821 T69457 T62900 T62912 T72917 T46885 AI702448 T57212 T57203 R94581 T7131 1 T61819 T89358 T67708 T70918 
T59166AI187111 T64308 T62071 T69427AI114750 T60430 R09734 T69033 T69141 T69453 T67908 R16809 T69394 
AI207729 T55839 T90273 T73339 AW194909 T75486 T71850 T71305 T71287 T53877 T73452 T68852 N75290 AI312890 
T67751 AI174983 T51679 T54851 H69880 N73734 AA443453 T73466 H69672 N53869 T68447 D11809 D12412 T64300 
T28321 T55864 

102800 14782.20 AA313538 U88895 U88902 

108351 genbanK_AA071193 AA071193 

101447 entre^M21305 M21305 
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TABLES: Figure 3 from BRCA 001 US 
Table 3 shows genes downregulated in tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAocn: Exemplar Aooession number. Genbank aocesston number 

10 UnigenelD: Untgene number 

Unigene Title: Unigene gene title 

R 1 : Ratio of normal breast tissue to tumor 



15 



20 



25 



Pkey ExAccn 



101336 
102208 
102990 
111168 
111803 
130085 
130840 
131543 
133120 
134758 



NM_006732 

U22981 

AA829286 

AI798376 

AA593731 

M62402 

BE048821 

AW966881 

NM_003278 

NM_000078 



UnigenelD UnigeneTitle 



Hs.75678 

Hs.332053 

Hs.325823 

Hs.274313 

Hs.20144 

Hs,41639 

Hs.65424 

Hs.89538 



Rl 

FBJ murine osteosarcoma viral oncogene h 10.0 

gb:Human mRNA clone with similarity to L 10.0 

serum amyloid A1 10.0 

gbir34b07.x1 NCLCGAP_Ov23 Homo sapiens 10.0 

ESTs. IVIoderately similar to ALU5_HUMAN A 10.0 

insulin-like growth factor binding prote 10.0 

small inducible cytokine subfamily A (Cy 1 0.0 

programmed cell death 2 10.0 

tetranectin (plasminogen-bmding protein 10.0 

chotesteryl ester transfer protein, plas 1 0.0 
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TABLE 3 A 

Table 3A shows the accession numbers for those pkeys lacking unigenelD's for Table 3. For 
each probeset, we have listed the gene cluster number from which the oUgonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland Cahfomia). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" col\mm. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



IS Pkey CAT number Accessions 

111168 38585 1 AI798376 S46400AW81 1617 AW81 1616 W00557 BE142245AW858232 AW861851 AW858362 AA232351 AA218567 

AA055556 AW858231 AW857541 AW814172 H66214 AW814398 AF134164 AA243093 AA173345 AA199942 AA223384 
AA227092 AA227080 T12379 AA092174 T61 139 AA149776 AA699829 AW879188 AW813567 AW81353B A1267168 AA157718 
20 AA157719 AA100472 AA100774 AA130756 AA157705 AA157730 AA157715 AA053524 AW849581 AW854566 C05254 

AW882836 T92637 AW812621 AA206583 AA209204 BE156909 AA226824 AI829309 AW931957 N56951 AA527374 H56215 
AA045564 AI694285 H60808 AA149726 AW195620 BE081333 BE073424 AW817662 AW817705 AW817703 AW817659 
BE081531 H59570 

1022086735 9 U22961 AA203623 AA503337 AI174733 AM 92802 006092 AA035357 AI190619 AI199244 A1828450 AA602296 AI378195 

25 " Ai209170 AI186653 AI127795 AI183846 H77389 A1589465 AA629390 H94306 AI018388 R68584 AA027196 AI745413 AI685092 

AI093426 AI623873 AI074570 N50096 AA047486 N25060 AA327614 AI042512 AI383957 AA156873 AI333101 N70806 AI141254 
AI383191 AI401237 AI080709 AI093400 W84549 T90806 R00012 W01413 AA630557 A1378348 AI559265 AA877103 W84464 
AA625146 R68379 AI133207 AI132980 AI133214 A1064826 A1061615 AI133473 AI174852 A1133404 AI133272 V00494 M12523 
M12523 AI207526 A1133120 AI054802 AI174993 A1114729 AI061645 A1054716 AI064959 H77388 T85706 AF075298 AI110799 

30 D17107 NM_0G0477 AF190168 R50724 AI248416 AI207432 AI133684 AI133345 AI174710 AI133290 AI133304 AI174948 

AI207484 All 10717 AF074624 All 14515 AF063516 All 10642 A11 14559 All 14498 All 14759 AI207568 AI064960 AI174753 
All 14666 R69184 R00011 AI064997 T60501 AI207701 T71735 AA385318 H73569 T60496 894399 AI133158 T74675 AA484750 
T73413 T55909 R50261 T72061 N80533 T51189 T74936 A1207490 AI132925 AI0S4701 An74748 AI114663 AI133104 AI132999 
A1133100 AI064925 AI064979 AI133063 AA343347 T69091 AA233989 T39772 AI444620 T52290 D16931 T40012 T48403 

3 5 T58926 T69195 A1133061 T50850 AI400677 AI091 136 AA334608 T5741 1 Z20979 N56507 T87485 AI133622 AA343370 T40075 

T69671 T53849T74820 AF075316AI110818T40121 T57381 All 14468 AA33272BT51362 All 14589 R06691 AI110629 
AF063503 AI140543 AA334661 AA332720 AA343262 T73513 T86549 AI114840 T57284 T39981 T61407 T72757 T51749 
T56630 AA343125 T72126 R94135 T83028 T39972 T39896 AI174786 AI132926 R09237 AI064838 AI133660 T60398 T88753 
T55930 T92126 AI444602 T60996 All 14792 H9391 1 AI133106 R10779 AK)65020 T90925 T50889 D17029 AI133703 AA333805 

40 AI133040 AI133017 AI064857 All 10730 AF074637 AI207567 H71080 T73217 AA343950 AI174743 AA334224 AA334281 

R06692 T64739 T40163 T60628 T81661 T73179 R01842 AA501730 T39931 T39662 T40136 AA334904 T71425 H77784 R00874 
AI065049 T84512 T55918 AI207595 T39951 AA005016 T60361 T69176 T73356 T58795 T61233 T39955 T60612 All 14676 
AI064778 AA035710 W52763 AI114788 T83564 AA341859 T81684 T55769 AI114710 T51776 AA343213 AI114714 T58102 
AI110809 R28984 A1174854 AA305575 AA343592 T53835 T46869 T64721 T55508 W05241 T54019 T57945 T60513 T48364 

45 AF075308 W86731 T82851 T48269 H54053 T73211 A)114590 T48317 T55965 T74857 R84226 T56552 T52231 T74946 T76976 

R02576 T95666 AI203974 AI189471 AA005147 AI478102 A1207662 AI192792 AI768421 AI064737 AW051713 AA936693 
• A11331 17 AI766232 AI913646 T83962 A1065112 A1207689 AI174684 AI207702 T81475 AI133325 AI032512.AA701 169 AI936354 
AI114720 AI433289 AA046980 AI823482 All 14536 AA860651 AW242644 R07469 AW300438 AI133416 AW271670 AI991363 
T78943 A1823481 AA845518 AA719124 AA883454 T68850 T69115 AI935509 A!150977 T62890 T71374 T68294 AI174774 

50 T67411 T68318 AI064689 T56624 T69010 T68982 T68302 AI332829 T72908 AI064819 AI205880 T62895 T69430 T95111 

AA025050 T73330 W52657 T71984 T691 18 W92684 AM 14860 T62093 T61797 A1522333 T73322 H92981 T56018 T6181 1 
T57232 AI336158 T61821 T69457 T62900 T62912 T72917 T46885 AI702448 T57212 T57203 R94581 T7131 1 T61819 T89358 
T67708 T70918 T59166 AI1871 1 1 T64308 T62071 T69427 All 14750 T60430 R09734 T69033 T69141 T69453 T67908 R16809 
T69394 AI207729 T55839 T90273 T73339 AW194909 T75486 T71850 T71305 T71287 T53877 T73452 T68852 N75290 

55 AI312890 T67751 AI174983 T51679 T54851 H69880 N73734 AA443453 T73466 H69672 N53869 T6B447 D11809 D12412 

T64300 T28321T55864 
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TABLE 4: Figure 4 from BRCA 001 US 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbanit accession number 

10 UnigenelD: Unigene number 

Unigene Title: Unigene gene title 
R1: Ratio of tumor to normal breast tissue 

IS Pk^ ExAccn UnigenelD Unigene Title R1 

100113 NI\/L001269Hs.84746 chromosome condensation 1 2.3 

100114 X02308 Hs.82962 thymidylate synthetase 2.9 
100131 D12485 Hs.11951 ectonudeotide pyrophosphatase/phosphodiesterase 1 1.9 

20 100146 BE165499 Hs.2471 KIAA0020 gene product 1.9 

100147 D13666 Hs.136348 osteoblast specific factor 2 (fasdclin Mike) (pedostin) 7.5 

100154 H60720 Hs.81892 KIAA01 01 gene product 9.2 

100163 W44671 Hs.124 gene predicted from cDNA with a complete coding setpience 1.6 

100220 AW015534 Hs.217493 annexinA2 2.0 

25 100265 D38521 Hs.112396 KIAA0077 protein 1.5 

100271 BE160081 Hs.256290 S 100 calcium-binding protein All (calgizzarin) 13.5 

100275 BE242802 Hs.154797 KIAA0090 protein 5.1 

100323 D50920 Hs.23106 KIAA01 30 gene product 1.9 

100335 AW247529 Hs.6793 platelet-activating factor acetylhydrolase. isofonn lb, gamma subunit (29kD) 2.7 

30 100364 NIVI_004341Hs. 154868 carbamoyl-phosphate synthetase 2. aspartate transcarbamylase, and dihydfoorotase • 2.0 

100372 NM_014791Hs.184339 KIAA01 75 gene product 2.6 ' 

100393 D84145 Hs.39913 novel RGD-containing protein 3.2 

100400 AW954324 Hs75790 phosphatidylinositol gfycan, dass C 1.5 

100418 D86978 Hs.84790 KIAA0225 protein 2.0 

35 100482 M65028 Hs.81361 heterogeneous nuclear ribonucleoprotein A/B 2.9 

100518 NM_004415Hs74316 desmoplakin (DPI. DPII) 1.9 

100666 L05424 Hs. 169610 CD44 antigen (homing function and Indian blood group system) 5.7 

100667 L05424 Hs.169610 CD44 antigen (homing function and Indian blood group system) 9.0 

100668 L05424 Hs.169610 CD44 antigen (homing function and Indian blood group system) 7.6 
40 100678 AW502935 Hs.740 PTK2 protein tyrosine kinase 2 53.2 

100685 AA328229 Hs.184582 ribosomal protein t^4 1.8 

100690 AA3832S6 Hs.1857 estrogen receptor 1 1.6 

100783 AF078847 H5.191356 general transcription factor IIH. polypeptide 2 (44kD subunit) 5.9 

100850 AA636472 Hs.297939 cathepslnB 1.7 

45 100892 BE245294 Hs.180789 S164 protein 17 

100945 AF002225 Hs.180686 ubquitin protein ligase E3A (human papilloma vims E6-associated protein, Angelman syndrome) 1.5 

1 00969 AA1 57634 Hs.79172 solute earner family 25 (mitochondrial canien adenine nucleotide translocator), member 5 6.3 

100988 AK000405 Hs.76480 ubiquitin-like 4 11.4 

100999 H38765 Hs.80706 diaphorase (NADH/NADPH) (cytochrome ^5 reductase) 1.6 

50 101031 J05070 Hs.151738 matrix metalloproteinase 9 (gelatinase B, 92kD getatinase. 92kO type IV collagenase) 8.2 

101045 J05614 gb:Human proliferating cell nuclear antigen (PCNA) gene, promoter region. 5.0 

101077 N99692 Hs.75227 Empirically selected from AFFX single probeset 2.6 

101093 L06419 Hs.75093 procollagen-lysine. 2-oxoglutarate 5-dioxygenase (lysine hydroxylase, Ehlers-Danlos syndrome type Vl)1 .4 

101161 NM_006262Hs.37044 peripherin 16.9 

55 101186 AA020956 Hs.179881 core-binding factor, beta subunit 2.0 

101216 AA284166 Hs.84113 cyclin-dependent kinase inhibitor 3 (CDK2-assoctated dual specificity phosphatase) 1.8 

101228 AA333387 Hs.82916 chaperonin containing TCP1. subunit 6A (zeta 1) 17 

101247 AA132666 Hs.78802 glycogen synthase kinase 3 beta 1.9 

101249 L18964 Hs.1904 protein kinase C. lota 1.5 

60 101332 J04088 Hs.156346 topoisomerase (DMA) « alpha (170kD) 5.2 . 

101332 J04088 Hs.156346 topoisomerase (DMA) II alpha (170kD) 3.4 

101352 A1494299 IHs.16297 COX17 (yeast) homolog, cytochrome c oxidase assembly protein 6.3 

101396 BE267931 Hs.78996 proliferating cell nuclear antigen 4.2 

101445 M21259 gb:Human Alu repeats in the region 5' to the small nudearrib 1.9 

65 101470 NM.000546Hs.1846 tumor protein p53 (U-Fraumeni syndrome) 1.6 
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101478 NM_002890Hs758 RAS p21 protein activator (GTPase activating protein) 1 2.5 
101483 M24486 Hs.76768 procollagen-proline. 2-oxoglutarate 4-dioxy9enase (proline 4-hydroxylase), alpha polypeptide I 55 
101540 J04977 H5.84981 X-ray repair complementing defective repair in Chinese hamster cells 5 (double-strand-break rejoining 2.1 

101573 AW248421 Hs.250758 proteasome (prasome, macropain) 26S subunit, ATPase. 3 1.6 

5 101580 NM.012151HS.83363 coagulation factor Vlll-assodated (intronic transcript) 57 

101592 AF064853 Hs.91299 guanine nudeotide binding protein (G protein), beta polypeptide 2 1.8 

101592 AF084853 Hs.91299 guanine nucleotide binding protein (G pr 5.6 

101621 BE391804 Hs.62661 guanylate binding protein 1, interferDn^dudbIe.67kD 2.4 

101702 AW5O4089 Hs.179574 protein phosphatase 2 (fo/meriy 2A). regulatory subunit B (PR 52). alpha isoform 1.3 

1 0 101734 M74099 Hs.147049 cut (Drosophlla)-like 1 (CCAAT displacement protein) 2.1 

101759 M80244 Hs.184601 solute carrier farnily 7 (cationic amino acid transporter. y+ system), member 5 5.0 

101767 M81057 Hs.1 80884 cariboxypeptWase 81 (tissue) 14.4 

101782 AA306495 Hs.1869 phosphogiucomutase 1 5.2 

101805 AW409747 Hs.75612 stress-induced-phosphoprotein 1 (Hsp70/Hsp90-onganizing protein) 8.6 

15 101806 AA586894 Hs.112408 S 100 calcium-binding protein A7 (psoriasin 1) 8.9 

101810 NM_000318Hs-180612 peroxisomal membrane protein 3 (35kD, Zellweger syndrome) 3,2 

101879 AA176374 Hs.243886 nuclear autoantigenic spemri protein (histone-binding) 1.6 

101911 AA441787 Hs.1 19689 glycoprotein honnones, alpha polypeptkle 31.3 

101920 AF182645 Hs,8024 IK cytokine, down-regulator of HLA I! 1.8 

20 101973 U41514 Hs.80120 lJDP-N-acetyl-alpha-D.galadosamine:polypeptide N-acetyfealactosaminyltransfefase 1 ( 2.4 

101983 AI904232 Hs.75323 prohibitin 8.4 

102009 BE245149 Hs.82643 protein tyrosine kinase 9 1.3 

102036 BE250127 Hs.82906 CDC20 (cell division cycle 20, S. cerevisiae, homolog) 2.0 

102083 T35901 Hs75117 interleukin enhancer binding factor 2. 45kD 1.6 

25 102083 T35901 Hs.75117 interleukin enhancer binding factor 2, 4 1.3 

102107 BE258602 Hs.182366 heat shock protein 75 . 1.4 

102123 NM_001809Hs.1594 centromere protein A (17kD) 1.8 

102165 BE313280 Hs.159627 death associated protein 3 4.6 

102198 AW950852 Hs.74598 polymerase (DMA directed), delta 2. regulatory subunit (50kD) 4.3 

30 102217 AA829978 Hs.301613 JTV1 gene 6.7 

102220 U24389 Hs.65436 lysosomal 4.3 
102234 AW163390 Hs.278554 heterochromatin-like protein 1 1-9 

102260 ALO39104 Hs.159557 karyopherin alpha 2 (RAG cohort 1, importin alpha 1) 4.4 

102302 AA306342 Hs.69171 protein kinase C-like 2 27 

35 102330 BE298063 Hs.77254 chromobox homolog 1 (Drosophila HP1 beta) 1.5 

102339 BE378432 Hs.95577 cyclin-dependent kinase 4 2.3 

102348 U37519 Hs.87539 aMehyde dehydrogenase 3 femily, member B2 2.0 

102349 AU077055 Hs.289107 bacubvlral lAP repeat-containing 2 3.2 
102369 U39840 Hs.2g9867 hepatocyte nuclear factor 3. alpha 2.0 

40 102374 U33635 Hs.90572 PTK7 protein tyrosine kinase 7 6.2 
102391 AA296874 Hs.77494 deoxyguanosine kinase 1-5 
102455 U48705 Hs.75562 discoidin domain receptor family, member 1 6.9 
102465 NM_001359Hs.81548 2.4-dienoyl CoA reductase 1 , mitochondrial 1.8 
102488 U50939 Hs.61828 amyloid beta precursor protein-binding protein 1. 59kD 1.5 

45 102489 AL080116 HsJ4420 origin recognition complex, subunft 3 (yeast homotog)-Iike 3.3 
102494 A1188137 Hs75193 C0P9 homotog 2.1 
102501 AF217197 Hs74562 siah binding protein 1; FBP interacting represson pyrimidine tract binding splicing 3.2 
102522 BE250944 Hs.1 83556 solute carrier family 1 (neutral amino acid transporter), member 5 2.8 
102532 AF040253 Hs70186 suppressor of Ty (S.cerevisiae) 5 homolog 5.7 

50 102564 U59423 Hs79067 MAD (mothers against decapentaplegic, Drosophila) homolog 1 2.3 
102568 W81489 Hs.223025 RAB31, member RAS oncogene family 5.3 

102580 U60808 Hs.1 52981 CDP-diacylglycerol synthase (phosphatidate cytidylyltransferase) 1 2.1 . 

102581 AU077228 Hs.77256 enhancer of zeste (Drosophila) honwlog 2 ^ 1.6 

102582 U61232 Hs.32675 tubulin-specific chaperane e 2.1 
55 ■ 102617 AW161453 Hs.198757 C0P9 (constitutive photomorphogenic. ArabkJopsis. homotog) subunit 5 1.8 

102618 AL037672 Hs.81071 extracellular matrix protein 1 5.8 
102627 AL021918 Hs.158174 zinc finger protein 184 (KnippeWike) 1-3 
102663 NM_002270Hs.168075 karyopherin (irhportin) beta 2 1-8 
102676 BE262989 Hs.12045 putative protein 2.3 
60 102687 NF4.007019HS.93002 ubkiuitin carrier protein E2^ 4.3 
102689 U96132 Hs.171280 hydroxyacyM:k)enzyme A dehydrogenase, type 11 6.0 
102696 BE540274 Hs.239 fortchead box Ml 4.2 

102704 AU077058 Hs.54089 BRCA1 associated RING domain 1 1.9 

102705 T97490 Hs.5O002 small indudbte cytokine subfamily A (Cys-Cys), member 19 2.3 
65 102750 AB014460 Hs.66ig6 nth (E.C0II endomidease lll)-fike 1 1.2 

102801 BE252241 Hs.38041 pyridoxal (pyridoxine. vitamin B6) kinase 6.4 
102812 U90549 Hs.236774 high-mobility group (nonhistone chromosomal) protein 17-like 3 1.6 
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102827 BE244588 Hs.6456 chaperonin containing TCP1. subunit 2 (beta) 5.6 

102831 AA262170 Hs.80917 adaptor-related protein complex 3. sigma 1 subunit 2.0 

102844 AV653790 Hs.324275 WW domain-containing protein 1 1.3 

102868 X02419 Hs77274 plasminogen activator, urokinase 4.4 

5 102925 BE440142 Hs.2943 signal recognition particle 19kD 1.9 

102935 BE561850 Hs.80506 small nuclear ribonudeoprotein polypeptide A' 2.4 

102968 AU076611 Hs.154672 methylene tetrahydrofo!ate deh^rogenase (NAD-i- dependent), methenyltetrahydrofolate cydohydrote^^ 

102983 BE387202 Hs.1 18638 non-metastatic cells 1. protein (NM23A) expressed In 3.1 

1029B5 U95742 Hs.2707 G1 to S phase transiUon 1 5.2 

10 103023 AW500470 Hs.1 17950 multifunctional polypeptide similar to SAICAR synthetase and AIR carboxylase 1.6 

103038 AA926960 Hs.334883 CDC28 protein kinase 1 2.5 

103060 NM_005940Hs.155324 matrix metalloproteinase 11 (MMPII; stromelysin 3) 4.5 

. 103080 AU077231 Hs.82932 cydin D1 (PRAD1: parathyroid adenomatosis 1) 3.1 

103089 D31 152 Hs.1 79729 collagen, type X, alpha 1 (Schmid metaphyseal chondrodysplasfe) • 2.4 

15 103177 BE244377 Hs.48876 famesyl-diphosphate famesyltransferase 1 3.5 

103178 AA205475 Hs.275865 nTjosomal protein SI 8 9.9 

103179 NM_001777Hs.82685 CD47 antigen (Rh-related antigen, integrin-assodated signal transducer) 1.3 
103181 X69636 Hs.334731 Homo sapiens, clone IMAGE:3448306, mRNA, partial cds 2.0 
103185 NM_006825Hs.74368 transmembrane protein (63kD), endoplasmic reticuIum/Golgi intemiediate compartment 1.6 

20 103191 AA401039 Hs.2903 protein phosphatase 4 (formerty X), catalytic subunit 2.5 

103193 NM.004766HS.75724 ooatomer protein complex, subunlt beta 2 (beta prime) 2.2 

103194 NWL004939HS.785B0 DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 1 6.3 
103206 X72755 Hs.77367 monokine induced by gamma interferon 8.8 
103223 BE275607 Hs.1708 chaperonin containing TCP1. subunlt 3 (gamma) 3.0 

25 103232 X75g62 Hs.129780 tumornecrosistector receptor superbly, member 4 1.8 

103238 Ai369285 Hs.75189 death-assodated protein 5.6 

103297 NM_001545Hs.9078 immature coton cardnoma transcript 1 1.9 

1 03330 AI803447 Hs.77496 small nudear ribonudeoprotein polypeptide G 2.5 

. 103349 X89059 gb:H. sapiens mRNA for unknown protein expressed in macrophage 1.6 

30 103375 AL036166 Hs.323378 coated vesicle membrane protein 1.8 

103391 X94453 Hs.il4366 pyrroIine-5-cart)oxylate synthetase (glutamate gamma-semiakJehyde synthetase) 2.3 

103392 X94563 gbrH.sapiens dbi/acbp gene exon 1 & 2, 4.0 
103430 BE564090 Hs.20716 translocase of inner mitochondrial membrane 17 (yeasQ homolog A 1.3 
103491 AF264750 Hs.288971 myeloid/lymphoid or mixed-lineage leukemia 3 5.6 

35 103505 AL031224 Hs.33102 transcription factor AP-2 beta (activating enhancer-binding protein 2 beta) 5.1 

103547 A1376722 Hs.1 80062 proteasome (prosome. macropain) subunit, beta type, 8 (large rnultifunctional protease 7) 9.7 

103588 NM_006218Hs.857O1 phosphoinositide-3-kinase. catalytic, alpha polypeptide 2.0 

103613 NM_000346Hs.231 6 SRY (sex determining region Y)-box 9 (campomelic dysplasia, autosomal sex-reversal) 1 .3 

103621 BE379766 Hs.150675 polymerase (RNA) II (DNA directed) polypeptide K (7.0kD) 2.0 

40 103622 AA609685 Hs.278672 membrane component, chromosome 11, surface marker 1 2.3 

103727 A1878883 Hs.296381 growth factor receptor-bound protein 2 1.3 

103749 AL135301 Hs.8768 hypothetical protein FU 10849 1.8 

103754 A101570g Hs.172089 Homo sapiens mRNA; cDNA DKFZp586l2022 (from done DKFZp586l2022) 1.3 

103780 AA094752 Hs.169992 hypothetical 43.2 Kd protein 7.5 

45 103795 H26531 Hs.7367 Homo sapiens BTB domain protein (BDPL) mRNA. partial cds . 1.2 

103797 AA080912 gb:zn04d03.r1 Stratagene hNT neuron (937233) Homo sapiens cDNA done 5* snnilar 1 .5 

103813 AI042582 Hs.181271 CGI-120 protein 1.5 

103655 W02363 Hs.302267 hypothetical protein FU10330 1.5 

103886 AK001278 Hs. 105737 hypothetical protein FU10416 similar to constitutive photomorphogente protein 1 6.5 

50 104052 NM_002407Hs.97644 mammaglobin2 -2.9 

104079 AA251242 Hs.103238 ESTs 1.4 

104174 AA478984 Hs.6451 PRO0659 protein 5.6 

104227 AB002343 Hs.98938 protocadherin alpha 9 1.6 

104275 A1751970 Hs.101067 GCN 5 (general control of amino-add synthesis, yeast, homotog)-like 2 5.4 

55 104325 BE379765 Hs.150675 polymerase (RNA) II (DNA direded) polypeptide K (7.0kD) 6.3 

104370 AA324597 Hs.21851 Homo sapiens cDNAFLJ12900fis, done NT2RP2004321 1.6 

104423 R83113 Hs.1432 protein kinase C substrate 80K-H 5.2 

104482 AB037762 Hs.44268 myelin gene expression factor 2 1.2 

104532 A1498763 Hs.203013 hypothetical protein FU 12748 2.1 

60 104563 AL1 17403 Hs.306189 DKFZP434F1735 protein 1.2 

104667 Ai239923 Hs.30098 ESTs 1.3 

104757 A1694413 Hs.332649 olfactory receptor, family 2. subfamily I, member 6 2.3 

104804 AI858702 Hs.31803 ESTs. Weakly simflar to N-WASP [H,sapiensl . 1.3 

104806 AB023175 Hs.22982 KIAA0958 protein 2.3 

65 104827 AW052005 Hs,8551 PRP4/STKAA/D splidng factor 10.9 

104846 AI250789 Hs.32478 ESTs 5.6 

104854 AA041276 Hs.154729 3-phosphoinosltide dependent protein kinase-1 . 12.3 
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104867 AA278898 Hs.225979 hypometical protein similar to small G proteins, especially RAP-2A 2.0 

104871 T78044 Hs.28893 Homo sapiens mRNA; cONA DKFZp56402364 (from done DKFZp56402364) • 1.3 

104896 AW015318 Hs.23165 ESTs 17.7 

104909 AW408164 Hs.249184 transcription factor 19 (SCI) 5.0 

104916 AW958157 Hs.155489 NS1 -associated protein 1 1.7 

104919 AA02688Q Hs.25252 prolactin receptor 1-4 

104930 AF043467 Hs.32893 neurexophilin 2 2.2 

104973 NM_015310Hs.6763 KIAA0942 protein 5,0 

104974 Y12059 Hs.278675 bromodomaln-containing 4 1-4 

104975 AL1 36877 Hs.50758 SMC4 (structural maintenance of chromosomes 4» yeastHite 1 2.4 
104975 AL1 36877 Hs.60758 SMC4 (structural maintenance of chromoso 2.3 

104978 All 99268 Hs.19322 Homo sapiens. Similar to I^KEN CDNA2010317E24 gene, clone 1MAGE:3502019. mRNA. parti^^ 

104979 AA937934 Hs.321062 ESTs H 
104994 AI499930 Hs.334885 mitochondrial GTP binding protein 3.5 
105009 BE379584 Hs.34789 dolichykiiphosphooBgosaccharide-protelnglycosyllransferase 5.5 
105012 AF098158 Hs.9329 chromosome 20 open reading frame 1 3.3 
105028 AI050715 Hs.2331 E2F transcription factor 5. p130-binding 2.2 
105032 AA127818 gb:zl12a02.s1 Soares4)regnanLutenJS_NbHPU Homo sapiens cDNA done IMAGE:501674 3' 6,8 
105039 AA907305 Hs.36475 ESTs 2.5 
105041 AB037716 Hs.26204 KIAA1 295 protein 2.2 
105045 BE242899 Hs.129951 speckle-type POZ protein 3.8 
105079 AA151342 Hs.12677 CGM47 protein 9-5 

105087 AA147884 Hs.9812 Homo sapiens cDNA aJI 4388 fis, done HEMBA1002716 5.6 

105088 H58589 Hs.35156 Homo sapiens cDNA FU 11 027 fis. clone PLACE1 0041 14 2.2 
105095 Z78407 Hs.27023 veside transport-related protein 2.2 
105110 BE387350 Hs.33122 K(AA116Q protein 2*f 

105126 AW975433 Hs.36288 ESTs °'3 

105127 AA045648 Hs.301957 nudix (nudeoside diphosphate linked moiety X)-type motif 5 2.1 
105141 AA164687 Hs.177576 mannosyl(alpha-1,3-)-gIycoproteinbeta-1,4-N-acetylglucosaminyltransferase, isoenzyme A 27 
105158 AW976357 Hs.234545 hypothetical protein NUF2R 1-9 
105169 BE245294 Hs.180789 S164 protein y 
105186 AA191512 Hs.28005 Homo sapiens cDNAFUl 1309 fis. done PLACE1010076 4.8 
105254 AA071276 Hs.19469 K1AA0859 protein 1-9 
105281 AA263143 Hs.24595 RADSI-interacting protein 2.8 
105288 N99673 Hs.3585 ESTs. Weakly similar to AF1 26743 1 DNAJ domain-containing protein MCJ (H.sapiensl 1.9 
105302 AA700122 Hs.3355 sentrin-spedfic protease 5*2 
105331 AW270037 Hs. 179507 KIAA0779 protein l-^ 
105359 NM 016015HS.8054 CGl-68 protein 

105366 BE264645 Hs.282093 hypothetical protein FU2191 8 5,0 

105373 AW887701 Hs.32356 hypothetical protein FU20628 2.5 

105374 BE242803 Hs.262823 hypothetical protein FU10326 2.2 
105387 AW592146 Hs.108636 membrane prot^ CHI 2.3 
105393 AF167570 Hs256583 Interleukin enhancer binding factor 3. 90kD 5.4 

105399 BE3B6877 Hs.334811 Npw38-bindirig protein NpwBP 1-^ 

105400 AF198620 Hs.65648 RNA binding motif protein 6A 1-6 ^ 
105445 AA252395 gb:zs12g10.s1 NCI_CGAP^GCB1 Homo sapfens cDNA done IMAGE:685026 3\ mRNA sequence. 5.0 
105507 BE268348 Hs.226318 CCR4-N0T transcription complex, subunit 7 1.6 

105529 AA1 13449 Hs.32471 hypothetical protein FU20354 1-3 

105530 AB023179 Hs.9059 KIAA0962 protein 3.4 
105547 AA262640 Hs.27445 unknown 9.3 
105564 BE616694 Hs.288042 hypothetical protein FU 14299 1;4 

105596 AA579535 Hs.18490 hypothetical protein FU20452 10-9 

105597 AF054284 Hs.334826 spVidng factor 3b. subunit 1, 155kD 2.9 
105608 AI808201 Hs.287863 hypothetical proteb FU 12475 1-7 
105610 AA280072 Hs.99872 fetel Alzheimer ant^en A 
105617 AK000892 Hs.4069 glucocorticoid modulatory element binding protein 1 1.7 
105620 AW302245 Hs.181390 casein kinase 1, gamma 2 5.5 
105658 AA985190 Hs.246875 hypothetical protein FU20059 9.4 
105697 AW499988 Hs.27801 zinc finger protein 278 2.0 
105708 R26944 Hs.180777 Homo sapiens mRNA; cDNA DKFZp564M0264 (from clone DKFZp564M0264) 1.7. 
105743 BE246502 Hs.9598 sema domain, immunoglobulin donrain (Ig). transmembrane domain (TM) and short 2.6 
105746 AW151952 Hs.46679 hypothetical protein FU20739 1-5 
105759 AI123118 Hs.15159 chemokine-fike fador. alternatively spUced 1.3 
105771 AI267720 Hs,153221 synovial sarcoma, transkxated to X chromosome 1.6 
105820 AA741336 Hs.152108 transcriptional unit N143 2-2 
105826 AA478756 Hs.194477 E3 ubiquifin Hgase SMURF2 1-3 
105856 AI262106 Hs.12653 ESTs 2.4 
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105858 AF151086 Hs.281428 hypotheUcal protein 2,9 

105875 AK001708 Hs.32271 hypothetical protein FU 10846 1.4 

105930 AF016371 Hs.9880 peptidyi prolyl isomerase H (cyclophiDn H) 5.2 
108000 AW194426 Hs.20726 ESTs •1.7 

5 106011 AW081202 Hs.12284 Homo sapiens, clone IMAGE:2989556. mRNA. partial cds 2.8 

106017 AA477956 Hs.26268 ESTs 1.4 

106073 AL157441 Hs.17834 downstream neighbor of SON 1.4 
106078 AA130158 Hs.19977 ESTs. Moderately similar to ALU8_HUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 1.6 

106094 AA533491 Hs.23317 hypothetical protein FU1 4681 6.8 

10 106140 AB006624 Hs.14912 KiAA0286 protein 1.6 

106271 AA251393 Hs.289052 Homo sapiens, Similar to RIKEN cDNA 5430429M05 gene, clone.MGG:13i55. mRNA, complete cds 10.8 

105288 AB037742 Hs.24336 KIAA1 321 protein 1.3 

106300 Y10043 Hs.19114 high-mobility group (nonhistone chromosomal) protein 4 3.6 

106333 AL043114 Hs.22410 ESTs, Weakly similar to A54849 collagen alpha 1(VII)<^aln precursor [H.sapien 5.4 

15 106350 AK001404 Hs.194698 cydinB2 5.7 

106359 AW390282 , Hs.31 130 transmembrane 7 superfamily member 2 6.3 

106381 AB040916 Hs.24106 KIAA1 483 protein 6.5 

106389 AW748420 Hs.6236 Homo sapiens cDNA: FU21487 fis. clone COL05419 2.2 

106457 AF1 19256 Hs.27801 zinc finger protein 278 2.7 

20 106470 D63078 Hs.186180 Homo sapiens cDNA: FU23038 fis, clone LNG02039 2.3 

106531 AA454036 Hs.8832 ESTs 1.6 

106586 . AA243837 Hs.57787 ESTs 1.6 

106589 AK000933 Hs.28661 Homo sapiens cDNAFU 10071 fis. done HEMBA1001702 2.4 

106610 AA458882 Hs,79732 fibulini 7.9 

25 106624 NM.003595HS.26350 tyrosylprotein sulfotransferase 2 7.7 

106650 AL049951 Hs.22370 Homo sapiens mRNA; cDNA DKFZp564O0122 (from done DKFZp564O0122) 1.6 

106669 AV657117 Hs.184164 ESTs. Moderately similar to S65657atpha-1C-adrenergicieceptorspnce form 2 [H^apiens^ 1.3 

106713 BE614802 Hs.184352 hypothetical protein FLJ 12549 4.5 

106717 AA600357 Hs.239489 TIA1 cytotoxic granule-assodated RNA-Wnding protein 1.3 

30 106723 BE388094 Hs.21857 ESTs 1.6 

106795 AF1 74487 Hs.293753 Bd-2-related ovarian killer protein-like 5.7 

106829 AW959893 Hs.27099 hypotheUcal protein FU23293 similar to ARL-6 interacUng protein-2 16.2 

106831 BE564871 Hs.29463 centrin. EF-hand protein. 3 (CDC3i- yeast homolog) 1.5 

106846 AB037744 Hs.34892 KIAAI 323 protein 2.2 

35 106852 AF151031 Hs.300631 hypothetical protein 1.3 

106873 N49809 Hs.11197 Homo sapiens, clone 1MAGE:3343149, mRNA. partial cds 16.8 

106886 W79171 Hs.9567 GL002 protein 1.5 

106906 AA861271 Hs.222024 transcription factor BMAL2 2.2 

105920 AK001838 Hs.296323 serum/glucocorticoid regulated kinase 3.3 

40 106945 AK00051 1 Hs.6294 hypotheUcal protein DKFZp434L1435 similar to valyl tRNA synthetase 6.8 

106973 BE156256 Hs.11923 hypothetical protein 6.6 

106977 AL043152 Hs.50421 KIAA0203 gene product 4.8 

106978 AW631480 Hs.8688 ESTs 6.0 
107004 AA146872 Hs.300700 hypothetical protein FLJ20727 1.3 

45 107029 AF264750 Hs.288971 myeloid/lymphoid or mixed-lineage leukemia 3 1.8 

107071 AW385224 Hs.35198 ectonudeotide pyrophosphatase/phosphodiesterase 5 (putative function) 1.7 

107113 AK000733 Hs.23900 GTPase adivating protein 2.5 

' 107125 AK000512 Hs.69388 hypothetical protein FU20S05 1.7 

107136 AV66ig58 Hs.8207 GK001 protein 4.6 

50 107136 AV661958 Hs.8207 GK001 protein 3.3 

107146 AK001455 Hs.5198 Down syndrome critical region gene 2 2.0 

107151 AW378065 Hs.8687 ESTs 6.3 

107155 AW391927 Hs.7946 KIAAI 288 protein 33.5 

107174 BE122762 Hs.25338 ESTs 5.2 

55 107197 W15477 Hs.64639 glioma pathogenesis-related protein 6.1 

107221 AW888411 Hs.81915 leukemia-associated phosphoprotein p18 (stathmin) 17.4 

107243 BE219716 Hs.34727 ESTs, Moderately siniilar to 138759 zinc finger/leudne zipper protein IH.sapiens] 7.4 

107248 AW263124 Hs,315111 nudear receptor co-fepressor/HDAC3 complex subunit 1.8 

107263 D60341 Hs21198 translocase of outer mitochondrial membrane 70 (yeast) homolog A 6.6 

60 107265 BE379594 Hs.49136 ESTs. Moderately similar to ALU7_HUMAN ALU SUBFAMILY SO SEQUENCE CONTAMINATION 2.5 

107298 N95657 Hs.6820 ESTs. Moderately similar to Y0J1 ^.CAEEL HYPOTHETICAL 63.5 KD PROTEIN ZK353.1 IN 2.5 

107298 N95657 Hs.6820 ESTs, Moderately similar to YOJIjCAEEL H 1.7 

107299 BE277457 Hs.30661 hypothetical protein MGC4606 ' 3.2 
107316 T63174 Hs.193700 Homo sapiens mRNA; cDNA DKFZp586l0324 (from done OKFZp586I0324) 2.0 

65 107354 NM^006299Hs.96448 zinc finger protein 193 5.0 

107392 AW299900 Hs.267632 TATA element modulatory factor 1 1.2 

107481 AA307703 Hs.279766 kinesin family member 4A 1.6 
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107529 BE515065 Hs^96585 nucleolar protein (KKE/D repeat) 3.0 

107554 AA001385 Hs.59844 ESTs 1-3 

107681 BE379594 Hs.49136 ESTs. Moderatety slmflar to ALU7_HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 22 

mm AA018587 Hs.303055 ESTs, Weakly similar to ALU1.HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 2.1 

107859 AVV732573 Hs.47584 potassium vdtage-gated channel, delayed-reclifier, subfamily S. member 3 8.4 

107901 L42612 Hs.335952 keraUn6B 2.5 

.107901 L42612 Hs.335952 keratin 6B 1-6 

107922 BE153855 Hs.6l460 Ig superfamily receptor LNIR 2.2 

107974 AW956103 Hs.61712 pynivate dehydrogenase Idnase. isoenzyme 1 6*7 

108040 AL121031 Hs.159971 SWI/SNF related, matia asso(feted. actin dependent regulator of chromatin, s^^^ 1.5 



Hs.9071 



108647 BE546947 Hs.44276 

108695 AB029000 Hs.70823 

108717 AA122393 Hs.70811 

108740 AI089575 

108828 AK001693 

108859 AL121500 

108872 H06720 

108891 At801235 

108894 AK001431 



109002 AB028987 Hs72134 

109011 AA156542 Hs.72l27 

109026 AA1 57811 

109068 AA164293 Hs.72545 



1.3 
7.1 
2.5 
3.5 
3.4 
1.6 
1.7 
9.8 
7.2 
1.3 
2.7 
1.8 
1.5 
2.1 
5.3 
4.0 
5.6 
1,6 
6.2 
1.7 
1:4 



108230 AA054224 Hs.59B47 ESTs 
108274 AF129535 Hs.272027 F-box only protein 5 
108296 N31256 Hs.161623 ESTs 
108495 AAb83059 Hs.339659 ESTs 

108607 BE300380 Hs.69476 Homo sapiens cDNA FLJ12758 fis. done NT2RP2001328 
108621 AA101809 Hs.182685 ESTs 
108634 AW022410 Hs.69507 ESTs 

homeo box CIO 
KIAA1077 protein 
hypothetical protein FLJ20516 
progesterone membrane binding protein 
Hs.273344 DKFZP564O0463 protein 
Hs.178904 ESTs 
Hs.11 1680 endosulRne alpha 
Hs.48480 ESTs 
Hs.5105 hypothetical protein FLJ10569 

108955 AA149754 Hs.195155 Homo sapiens amino acid transport system N2 (SN2) mRNA, complete cds 
108982 AA151708 Hs.171980 homeo box (expressed in ES cells) 1 
108987 AA152178 Hs.23467 hypothetical protein FIJI 0633 
K1AA1064 protein 
ESTs 

gb:zo35d07.s1 Stratagene coton (937204) Homo sapiens cDNA ctone 3' similar to contains Alu repetitive 
ESTs 2.9. 
109101 AW608930 Hs.52184 hypothetical protein FLJ20618 ^-6 
109112 AW419196 Hs.257924 hypothetical protein FLJ13782 3.2 
109124 AKD00684 Hs.183887 hypothetical protein FLJ22104 
109139 AJ132592 Hs.59757 zinc finger protein 281 
109166 AA219691 Hs.73625 RAB6 interacting, kinestn-iike (rabkinesin 6) 
109198 BE566742 Hs.58169 highly repressed in cancer, ndi in leucine heptad repeats 
109213 NM-016663HS.82035 potential nuclear protein C50RF5; GAP-like protein 
109220 AW958181 Hs.189998 ESTs 
109233 AU077281 Hs.170285 nudeoportn 214kD (CAIN) 

109270 N99673 Hs.3585 ESTs. Weakly similar to AF126743 1 DNAJ domain-containing protein MCJ IH.sapiens) 
109273 AA375752 Hs.82719 Homo sapiens mRNA; cDNADKFZp586F 1822 (from done DKFZp586F1 822) 
109313 AF153201 Hs.86276 C2H2 (KruppeMype) zinc finger protein 
109341 AA213508 Hs.1 15099 EST 

109391 AL096858 Hs.l84245 KIAA0929 protein Msx2 interacting nuclear target (Ml NT) homolog 
109420 H83603 Hs.40408 homeo box C9 
109426 N30531 Hs.42215 protein phosphatase 1. regulatory subunit 6 
109429 AI160029 Hs.61438 ESTs 
109445 AA232103 Hs.189915 ESTs 
109450 AB032969 Hs.173042 K1AA1 143 protein 
109468 NM 015310HS.6763 KIAA0942 protein 
109478 AW074143 Hs.87134 ESTs 

Hs.1 18890 glycogen synthase kinase 3 alpha 
Hs.27319 ESTs 
Hs.16798 ESTs 

Hs.21907 histone acetyllransferase 
Hs.279009 matrix Gla protein 
K1AAO460 protein 
ESTs 

Hs.226429 ESTs. WeaWy similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 
110154 NM.014521HS.17667 SH3-domain binding protein 4 

110240 AI668594 Hs.176588 ESTs, WeaWysimlfer to CP4Y.HUMAN CYTOCHROME P4504A11 PRECURSOR IH.sapiensl 
110242 N41744 Hs.19978 CGWO protein 

110259 H28428 Hs.32406 ESTs. Weakly similar to I38Q22 hypothetical protein (H.sapiens] • 
110312 BE256986 Hs.11896 hypotheScal protein FLJ12089 
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109570 L40027 
109662 F02614 
109825 R71264 
110039 HI 1938 
110056 AA503041 
110085 AA603840 Hs.29956 
110110 T07353 Hs.7948 
110129 R51853 



2.6 
2.9. 
2.0 
5.3 
57 
5.3 
1.4 
2.9 
1,3 
25 
1.5 
2.2 
3.0 
1.9 
1.8 
3:7 
3,2 
2.0 
2.1 
1.4 
1.3 
2.0 
Z5 
1.7 
2.9 
17 
4.2 
A2 
1.3 
2.2 
2.1 
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110330 Ai288666 Hs.16621 DKFZP43411 16 protein 6.2 

110501 H55748 gb:yq94a01 .si SoaresfetaMiver spleen INFLSHoitk) sapiens cDNA clone IMAGE:203^ 6.1 

110504 H55915 Hs.2l0859 hypothetical protein FU 11 01 6 6.1 

110525 H57330 Hs.37430 EST 6.3 

5 110568 AK001160 Hs.5999 hypothetical protein FLJ 10298 1.3 

110699 T97586 Hs.l8090 ESTs 1.8 

110705 AB007902 Hs.32168 KIAA0442 protein 1,6 

110742 AW1 90338 Hs.28029 hypothetical protein MGC1 1256 7.6 

110751 AL138077 Hs.16157 hypothetical protein RJ 12707 2.5 

10 110762 BE044245 Hs.30011 hypothetical protein MGC2g63 9.3 

110765 AK000322 Hs.i8457 hypothetical protein aJ20315 5.5 

110769 BE000631 Hs^3B37 Homo sapiens cONAnJ11812fis: done HEMBA1008364 2.1 

110799 AlOSgaeO Hs.323401 dpy-30>iike protein 1.5 

110805 T25829 Hs24048 FK506 binding protein precursor 6.6 

15 110813 AA767373 Hs.35669 ESTs. Moderately simnartoALU1.HUA^ ALU SUBFAMILY J SEQUENCE CONTAMINATION 5.7 

110820 R33261 Hs.6614 ESTs, Weakly sirnilar to A43932 mucin 2 precursor, intestinal [H^apiens] 3.4 

110840 N31598 Hs.12727 hypothetk:al protein FU21610 1.7 

110844 AI740792 Hs.167531 methytcrotonoyl-Coenzynie A carboxylase 2 (beta) 1.7 

. 110854 BE612992 Hs.27931 hypothetical protein FLJ10607.slmitar to glucosamine-phosphate N-aoetyltransferase 4.7 

20 110856 AA992380 gb:ol37906.s1 SoaresJestis_NHT Homo sapiens cD^IA done 3' simBar to contains element 2.3 

110885 BE384447 Hs.16034 hypothetical protein MGC1 31 86 3.5 

110897 AL117430 Hs.6880 DKFZP434D1 56 protein 2.2 

110915 BE092285 Hs.29724 hypothetical protein FLJ 131 87 2.6 ' 

110918 H04360 Hs.24283 ESTs, Moderately similar to reduced expression in cancer [H.saplens] 1.9 

25 . 110958 NM_005864Hs.24587 signal transdudion protein {SH3 containing) 6.7 

110963 AK002180 Hs.11449 DKFZP56401 23 protein 2.0 

110981 AK001980 Hs.24284 ADP-ribosyltransferase (NAD+; poly(ADP-ribose) pblymerase)-like 2 1.3- 
1 1 0984 AW6 1 3287 Hs.801 20 UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acelylgalactosamlnyllransferase 1 (GalNAc-TI ) 1 .8 

111125 N63823 Hs.269115 ESTs. Moderately similar to Z195^HUMAN ZINC FINGER PROTEIN 195 [H.sapiens] 3.6 

30 111132 AB037807 Hs.83293 hypothetical protein 2.1 

111164 N46180 Hs.122489 Homo sapiens cDN A FLJ 13289 fis. clone 0VARC1 001 170 2.3 

111172 R67419 Hs.21851 Homo sapiens cDN A FU 12900 fis, done NT2RP2004321 3.7 

111174 AL050166 Hs.26295 Homo sapiens mRN A; cDNADKFZp586D1 122 (from done DKF2p586D1 122) 7.5 

111179 AK000136 Hs.10760 asporin (LRR dass 1) 7.1 

35 111184 AI815486 Hs.243901 Homo sapiens cDN A FLJ 2073 8 fis, clone HEP08257 6.7 

111184 AI815485 Hs.243901 Homo sapiens cDNA FLJ20738 fis. clone HE 3.3 

111189 N67603 Hs.272130 ESTs, Weakly similar to S65824 reverse transcriptase homolog [H.sapiensl 3.6 

111216 AW139408 Hs.152940 ESTs 1.5 

111221 AB037782 Hs.15119 KIAA1 361 protein 2.6 

40 111223 AA852773 Hs.334838 KIAA1 866 protein 4.6 

111239 N90956 Hs.17230 hypothetical protein FU22087 7.9 

111285 AA778711 Hs.4310 eukaiyotfc translation initiation fador 1 A 6.9 

111299 AB033091 Hs.74313 KIAA1 265 protein 5.0 

111312 A1523913 Hs.34504 ESTs 3.8 

45 111318 T99755 Hs.334728 ESTs 1.2 

111337 AA837396 Hs.263925 LIS 1 -interacting protein NUDE 1, rat homolog 5.1 

111352 H58589 Hs.35156 Homo sapiens cDNAFLJ1 1027 fis, done PLACE10041 14 2.2 

111370 AI478658 Hs.94631 brefeldin A-inhibited guanine nudeotide-exchange protein 1 2.8 

111384 N94606 Hs.288969 HSCARG protein 2.2 

50 111389 AK000987 Hs.169111 oxidation resistance 1 2.1 

1 1 1391 NM_003896Hs.225939 siatyltransferase 9 (CMP>NeuAc:iactosylceramide a!pha-2.3-sialyitFansferase; GM3 synthase) 5.1 

111392 W46342 Hs.325081 Homo sapiens, done IMAGE:3659S80. mRNA, partial cds 8.4 
111452 R02354 Hs.15999 ESTs 27 
111486 AI051194 Hs.227978 EST 6.5 

55 1 1 1 549 W90638 Hs.20321 ESTs. Moderately similar to ZRF1_HUMAN ZUOTIN REUTED FACTOR-1 (M-PHASE 1 .4 

111685 R10720 Hs.20670 EST 1.6 

111627 R52656 Hs.21691 ESTs 1.6 

111870 AB037834 Hs.18685 Homo sapiens mRNA for KIAA141 3 protein, partial cds 2.4 

111937 BE298665 Hs.14846 Homo sapiens mRNA; cDNA DKFZp564D01 6 (from done DKFZp564D01 6) 10.6 

60 111944 AW083791 Hs.21263 suppressor of potassium transport defed 3 6.6 

111987 NM.015310Hs.6763 KIAA0942 protein 5.1 

112134 R41823 Hs.7413 ESTs; calsynlenin-2 2.8 

112244 AB029000 Hs.70823 KIAA1077 protein 14.6 

112388 R46071 HS.3D1693 Homo sapiens, done 1MAGE:3638994. mRNA, partial cds 9.0 

65 112456 NM_016248Hs.232076 A kinase (PRKA)andior protein 11 1.4 

112464 AW007287 Hs.28538 Homo sapiens cDNA: FLJ21086 fis, done CAS03272 1 4 

112506 Ar742756 Hs.26079 ESTs 3.2 
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112513 R68425 Hs.13809 hypothetical protein FU10S48 2.0 

112752 AK001635 Hs.14838 hypothetical protein FU 10773 1.8 

112884 AK000004 Hs.5013 Homo sapiens mRNA for FU00004 protein, partial cds 6.6 

112923 T10258 Hs.5037 EST 1.5 

5 112936 AW970826 Hs.6185 K1AA1 557 prolan 3.2 

112958 R61388 Hs.6724 ESTs 6.0 

112966 Z44718 Hs.102546 glucocorticoid receptor DNA binding factor 1 6.4 

112978 AK000272 Hs.7099 hypothetical protein FU20265 1.2 

112995 AA737033 Hs.7155 ESTs, Moderately similar to 2115357ATYKI protein (M.nuisculus) 5.6 

10 112996 BE276112 Hs.7165 zinc finger protein 259 2.0 

113047 AI571940 Hs.7549 ESTs 1.9 

1 13049 AW965190 Hs.7560 Homo sapiens mRNA for KIAA1729 protein, partial cds 2.4 

113089 T40707 Hs.270862 ESTs 1.3 
1 13196 T57317 gb:yb51a03.s1 Stratagene fetal spleen (937205) Homo sapiens cDNA done IMAGE:74668 3', 1.7 

1 5 1 13248 T63857 gb:yc16e01 .si Stratagene lung (937210) Homo sapiens cDNA clone 3', mRNA sequence 2.8 

113254 AK002180 Hs.11449 DKFZP564 01 23 protein 1.3 

113277 AW971049 Hs. 11774 protein (peptidyl-pralyl cis/trans isomerase) NII\/IA-inleracting. 4 (pawulin) 3.2 

113429 AA688021 Hs.179808 ESTs 1.2 

113499 AI467908 Hs.8882 ESTs 5.9 

20 113547 H59588 Hs.15233 ESTs 2.0 

113554 AW503990 Hs.142442 HP1-BP74 3.6 

113647 AA813887 Hs.188173 Homo sapiens CDNA FU12l87fis. done MAWIMA1 000831 1.3 
1 13702 T97307 gb:ye53h05.sl Soares fetal fiver spleen INFLS Homo sapiens cDNA done IMAGE:121497 3'. 4.4 

113722 AV653556 Hs.184411 albumin 1.3 

25 1 13759 AW499665 Hs.9456 SWI/SNF related, matrix assodated, adin dependent regulator of chromatin, subfamily a, member 5 1 .2 

113777 BE266947 Hs.10590 zincfinger protein 313 13.4 

113783 Al^59588 Hs.7041 hypothetical protein DKFZp762B226 1.7 

113791 AI269096 Hs. 1 35578 chitobiase. di-N-acetyl- 1.3 

1 13808 W44735 Hs.9286 Homo sapiens cDNA: FU21278 fis, clone COL01832 3.3- 

30 113811 BE207480 Hs,6994 Homo sapiens cDIMA: FU22044 fis. done HEP09141 3.1 . 

113817 H13325 Hs.332795 hypothetical protein DKFZp76 101 7121 " 3.2 

113826 AW378212 Hs.24809 hypothetical protein FU 10826 2.3 

113834 T26483 Hs.6059 EGF-containIng fibulin-like extracellular matrix protein 2 11.3 

113868 W57902 Hs.90744 proteasome (prosome. macropain) 26S subunit, non-ATPase, 1 1 2.7 

35 113870 AL079314 Hs.16537 hypothetical protein, similar to (U05944) PRAJA1 6.1 

113885 AW959486 Hs.21732 ESTs 6.6 

113923 AW953484 Hs.3849 hypothetical protein FU22041 stnularto FK506 binding proteins 1.9 

113989 W87544 Hs266828 ESTs U 

114022 AI539519 Hs.120g69 Homo sapiens CDNA FU1 1562 fts. done HEMBA1003197 5.4 

40 114030 AI825386 Hs.164478 hypotheiicat protein FU21 939 similar to 5-azacytidine induced gene 2 9.4 

114060 AB029551 Hs.7910 RING1 and YY1 binding protein 1.8 

114196 AF017445 Hs.150926 fucose-1 -phosphate guanylyltransferase 1.5 

114226 AB028968 Hs.7989 KIAA1045 protein 1.8 
114253 BE149866 Hs.14831 Homo sapiens, Similar to zinc finger protein 136 (done pHZ-20), done MGC:1 0647, mRNA, complete cds 2.3 

45 114262 AL1 17518 Hs.3586 KIAA0978 protein 1.4 

114275 AW515443 Hs.306117 KIAA0305 protein 15.8 

114292 AI815395 Hs.184641 fatty add desaturase 2 1.9 

114309 AA332453 Hs.20824 CGI-85 protein 2.4 

1 14392 AA249590 Hs. 100748 ESTs, Weakly similar to A28996 proline-rich protein M14 precursor • mouse [M.musculus] 1.8 

50 114407 BE539976 Hs.103305 Homo sapiens mRNA; cDNA DKF2p434B0425 (from done DKFZp434B0425) 1.2 

1 14455 H37908 Hs.271616 ESTs. Weakly similar to ALUB_HUMAN ALU SUBFAMILY SX SEQUENCE 5.5 

114463 AL120247 Hs.40109 K1AA0872 protein 5.2 

114464 AI091713 Hs.1065g7 Homo sapiens. Similar to RIKEN cDNA 11 10012M11 gene, cfone 1MAGE:3688605. mRNA, partial cdsl^ 
114471 AA028074 Hs.l04613 RP42homotog 1.8 

55 114480 6E066778 Hs.151678 UDP-N-acetyt-aIpha>D-gaIadosamine:polypeptide N-acetylgaladosaminyltransferase 6 (GalNAcT^^ 13.4 

114671 AA766268 Hs.266273 hypotheUcal protein FU 13346 1.9 

114698 AA476966 Hs.1 10857 polymerase (RNA) IH (DNA directed) polypeptide K (12.3 kDa) 3.5 

114730 AI373544 Hs.331328 intennediate filament protein syncoiltn 3.8 

114767 AI859865 Hs.154443 minichromosome maintenance defident (S. cerevisiae) 4 1.6 

60 114774 AV656017 Hs.184325 CGI-76 protein 3.1 

114798 AA159181 Hs.54900 serologically defined colon cancer anf^en 1 3.5 

114860 AL157545 Hs.42179 bromodomain and PHD finger containing, 3 4.3 

114895 AA236177 Hs.76591 KIAA0887 protein 7.1 

114896 BE539101 Hs.5324 hypothetical protein 1.3 

65 1 14911 AA236672 gb:zt29f02.s1 Soares ovary tumor NbHOT Homo sapiens cDNA done IMAGE:723771 3*. mRNA sequence. 1.5 

114930 AA237022 Hs.188717 ESTs 2.0 

114938 AA242834 Hs.58384. ESTs 2.9 
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' 114965 AI733881 Hs72472 BMP4^1B 2.3 

115023 AF102546 Hs.63931 dachshund (Drosophila) homolog 1.3 

115038 AA252360 Hs.87968 tolMike receptor 9 1.6 

115061 AI751438 Hs.41271 Homo sapiens mRNA full length insert cDNA done EUROIMAGE 1913076 11.8 

S 115062 AA253314 Hs.154103 UM protein (simitar to rat protein kina 1.5 

115117 AI670847 Hs.5324 hypothetical protein 1.5 

115121 AI634549 Hs.88155 ESTs 2.8 

115206 AW1 83695 Hs.1 86572 ESTs 2.5 

115221 AW365434 Hs.79741 hypotheUcal protein FU 101 16 1.5 

10 115239 BE251328 Hs73291 hypotheUcal protein FU 10881 1.3 

115242 AI368236 Hs.283732 ESTs. Moderately similar to ALUI^HUMAN ALU SUBFAIVIILY J SEQUENCE 1.4 

115278 AK002163 Hs.301724 hypothetical protein FU1 1301 1.5 

115285 AW972872 Hs.293736 ESTs 2,4 

115291 BE545072 Hs.122579 hypothetical protein FU 10461 6.2 

15 115400 AI215069 Hs.89113 ESTs 6.6 

115468 AA314349 Hs.48499 tumor antigen SLP-8p 7.4 

115471 AK001376 Hs.59346 hypothetical protein FU 1051 4 1.4 

115479 AW301608 Hs.278188 ESTs, Moderately similar to 154374 gene NF2 pPDtein [H.sapiens] 4.0 

115496 AW247593 Hs71819 eukaryotic translation Initiation factor 4E binding protein 1 16.3 

20 115500 Y14443 Hs.88219 zinc finger protein 200 5.0 

115553 AJ275986 Hs71414 transcription factor (SMIF gene) 2.5 

115581 AI540842 Hs.61082 ESTs 6.1 

115587 6E081342 Hs283037 HSPC039 protein 2.9 

115590 AA399477 Hs.67896 7-60 protein 5.3 

25 115646 N36110 Hs.305971 solute carrier family 2 (fadlitated glucose transpbrter). member 10 4.7 

115652 BE093589 Hs.38178 hypothetical protein FU23468 10.6 

115655 AL048269 Hs.288544 Homo sapiens, clone MGCil 6063. mRNA. complete cds 127 

115663 AI138785 Hs.40507 ESTs 2.0 

115676 AA953006 Hs.88143 ESTs 3.0 

30 115690 AA625132 Hs.44159 hypothetical protein FU21 61 5 17 

115693 AF231023 Hs.55173 cadherin, EGF LAG seven-pass G-type receptor 3. flamingo (Drosophila) homolog 6.8 

115715 BE395161 Hs.1390 proteasome (prosome, macropain) subunit. beta type. 2 17 

•115734 AI950339 Hs.40782 ESTs 2.6 

115811 NM_015434Hs.48604 DKFZP434B1 68 protein 2.1 

35 115823 AI732742 Hs.87440 ESTs 2.1 

115837 AI675217 Hs.42761 ESTs 1.3 

115844 AI373062 Hs.332938 hypothetical protein MGC5370 4.4 

115866 AW062629 Hs.52081' K1AA0867 protein 7.2 

115875 N55669 Hs,333823 mitochondrial ribosomal protein L13 1.2 

40 115941 AI867451 Hs,46679 hypothetical protein FU20739 5.5 

115958 AB037753 Hs.62767 KIAA1 332 protein 9.8 

116003 BE275469 Hs.66493 Down syndrome critical region gene 5 1.4 

1 1 601 1 AL359053 Hs.57664 Homo sapiens mRNA full length insert cDNA clone EUROIMAGE 2005735 2.4 

116108 AA770688 Hs.28777 H2A histone family, member L 1.8 

45 116134 BE243834 Hs.50441 CGI-04 protein 1.4 

116189 N35719 Hs.44749 ESTs. Moderately similar to T00358 hypothetical protein KIAA0884 [H.sapiens] '1.2 

116195 AWB21113 Hs.72402 ESTs 2.1 

116238 AV660717 Hs.47144 OKFZP586N0819 protein 17 

116246 AF265555 Hs.250646 baculoviral lAP repeat-<X)ntaining 6 17 

50 116262 A1936442 Hs.59838 hypothetical protein FU10808 17 

116298 AI955411 Hs.94109 Homo sapiens cDNAFU13634fis, clone PUCE101 1133 1.9 

116318 AF097645 Hs.58570 deleted in cancer 1:RNAhericaseHDB/D\CE1 4.9 

116325 AI472106 Hs.49303 Homo sapiens cDNAFLJ 11 663 fis. done HEMBA1 004631 1.4 

116336 AL133033 Hs.4084 KIAA1025 protein 1.9 

55 116339 AK000290 Hs.44033 dipeptidyl peptidase 8 1.5 

116350 AA497129 Hs.184771 nudear factor l/C (CCAAT-binding transcription factor)' 1.9 

116358 AI149586 Hs.38125 interferon-induced protein 75. 52kD 1.9 

116365 N50174 Hs.46765 ESTs 6.1 

116368 N90466 Hs.71109 KIAA1 229 protein 1.6 

60 116417 AW499664 Hs.12484 Human done 23826 mRNA sequence - • 7.4 

116436 AA161411 Hs.58668 chromosome 21 open reading frame 57 2.1 

116452 AF218313 Hs.236828 putative heficase RUVBL 1.5 

116470 AI272141 Hs,83484 SRY (sex detennining region Y)-box 4 2.1 

116470 AI272141 Hs.83484 SRY (sex determining region Y)-box 4 1.2 

65 116575 AA312572 Hs.6241 phosphoinositide-3-kinase. regulatory subunit, polypeptide 1 (p85 alpha) 1.5 

116637 AKd01043 Hs.92033 integrin-linked kinase-assocfeted serine/threonine phosphatase 2C 2.7 

116640 X89984 Hs.211563 B-ceIICLL/lymphoma7A 2.3 
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116700 A1800202 Hs.317589 hypolheticai protein MGC1 0765 1.4 

116705 AW074819 Hs,12313 hypothetical protein FLJ 14566 3.4 

1 1 6732 AW152225 Hs.165909 ESTs, Weakly simaarto 138022 hypothetica! protein [H.saiMens) 2.9 

116921 AW068115 Hs.821 biglycan 8.3 

5 116926 H73608 Hs.290830 ESTs 1.7 

117034 U72209 Hs.180324 YY1-associated factor 2 3.4 . 

117132 AI393666 Hs.42315 plO-bmding protein 5,2 
1 17247 N21032 gb:yx46f06.s1 Soares melanocyte 2NbHM Homo sapiens cDNA done Iiy/1AGE:264803 3\ mRNA sequence. * 5.5 

117276 N71183 Hs.121 806 Homo sapiens cDNAFU11971 fis, done HEMBB1001208 1.5 
10 117284 AK001701 Hs.183779 Homo sapiens cDNA FU10590 fis. done NT2RP2004392. weakly similar to MNN4 PROTEIN 2.0 . 

117367 AI041793 Hs.42502 ESTs 2.0 

117368 AI878942 Hs.90336 ATPase, H+ transporting, lysosomal (vacuolar proton pump), member J 2.1 
117382 AF150275 Hs.40173 ESTs 2.7 
117412 N32536 Hs.42645 solute carrier family 16 (monocarboxylic acid transporters), member 6 1.4 

15 117557 AF123050 Hs.44532 diubiquith 3,4 

117588 N34895 Hs.44648 ESTs 3.4 

117745 BE294925 Hs.46680 CGI-12 protein 3.0 

117754 AA121673 Hs.59757 zinc finger protein 281 1.9 

117879 N54706 Hs.303025 chromosome 1 1 open reading frame 24 1.8 

20 117881 AF161470 Hs.260622 butyrate-induced transcript 1 ^ 5,7 

117904 BE540675 Hs.332938 hypothetical protein IVIGC5370 ' 5.9 

117911 AL137379 Hs.47125 hypothetical protein FU 1391 2 .1.7 

117933 Y10518 Hs.116470 hypothetical protein FU20048 1.7 . 

117983 All 10246 ^Hs.47367 K1AA1785 protein 5,4 

25 - 118078 N54321 Hs.47790 EST 5.2 

118301 AA453902 Hs.293264 ESTs 2.6 

118429 AA243332 Hs.74649 cytochrome c oxidase subunit Vic 2.5 

118472 Al_157545 Hs.42179 bromodomain and PHD finger containing, 3 4.1 

118488 AJ277275 Hs.50102 rapa-2 (rapa gene) 1-2 

30 118509 N22617 Hs.43228 Homo sapiens cDNAFU11835fis. done HEMBA1006595 1.5 

118528 AI949952 Hs.49397 ESTs . 7,4 

118656 AI458020 Hs.293287 ESTs 2.5 

118670 AA332845 Hs.152618 ESTs. Moderately similar to ZN91_HUMAN ZINC FINGER PROTEIN 91 [H.sapiens] 1,2 

118698 AB033113 Hs.50187 KIAA1 287 protein 2.1 

35 1 18737 AA199686 gb:2q75g09.r1 Stratagene hNT neuron (937233) Homo sapiens cDNA done iMAGE:647488 5' 5.2 

118925 N92293 Hs.206832 ESTs, Moderately simllarto ALU8_HUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 1.4 
118984 AI668709 Hs.240722 ESTs. Moderately similar to ALU8.HUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 3.6 

118986 AF148713 Hs.1 25830 bladder cancer overexpressed protein 4.8 

119206 W24781 Hs.293798 KIAA1710 protein 17 

40 119235 AW453069 Hs,3657 activity-dependent neuroprotective protein 2.2 

119235 AW453069 Hs.3657 activity-dependent neuroprotective prote 1-6 

119265 BE539706 Hs.285363 ESTs ' 1.4 

119279 N57568 Hs.48028 EST 25.1 

119298 NM_001241Hs,1 55478 cydinT2 1-6 

45 119338 AI417240 Hs.320836 ESTs. Weakly isimilar to A47582 B-cell growth factor precursor [H.sapiensJ 1.3 

119349 T65004 Hs.1 83561 ESTs 8-4 

119403 AL117554 Hs.119908 nucleolar protein NOP5/NOP58 6.7 

119478 AI624342 Hs.170042 ESTs 2.4 

119486 AI796730 Hs.55513 ESTs 2.1 

50 119513 W37933 Empirically selected from AFFX single probeset 1.9 

119601 AKD00155 Hs.91684 Homo sapiens mRNA; cDNA DKFZp5671103 (from ctone DKFZp667n03) 3.7 . 

119602 AW675298 Hs,233694 hypothetical protein FLJ1 1350 3.0 
119676 AA243837 Hs.57787 ESTs 1-4 
119682 W61019 Hs.57811 ESTs 1.2 

55 119774 AB032977 Hs.6298 KIAA1151 protein 1.B . 

119780 NM_016625Hs,191381 hypothetical protein 3.1 

119789 BE393948 Hs.50915 kallikrein 5 (KLK5; KLK-L2: stratum comeum tryptic enzyme) 9.2 
1 19805 AJ223810 Hs.43213 ESTs. Weakly similar to IEFS_HUMAN TRANSFORMATION-SENSITIVE PROTEIN lEF SSP 3.6 . 

119818 AA130970 Hs.58382 hypomelical protein FU 11 101 2.5 

60 119863 AA081218 Hs.5B608 Homo sapiens cDNA FU14206 fis. done NT2RP30031 57 27 

1 19905 AW449064 Hs.1 19571 collagen, type 111. alpha 1 (Ehlers-Danlos syndrome type IV. autosomal dominant) 2.6 . 

119966 AA703129 Hs.58963 ESTs 2.7 

120132 W57554 Hs.125019 lymphoid nuclear profem (LAF-4) mRNA 1.2 

120206 H26735 Hs.91668 Homo sapiens done PP1498 unknown mRNA 45.7 

65 120246 Ai924294 Hs.173259 uncharacterized bone marrow protein BM033 ' 1*2 

120253 AA131376 Hs.326401 fibroblast growth factor 12B 38.9 

120269 AW131940 Hs.104030 ESTs 9.6 
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120274 AA177051 gb:nc02a02.s1 NCLCGAP^Pr3 Homo sapiens cDNA done 1MAGE:194 simflar to contains Alu. 4.6 

1 20280 AA1 90577 gb:2p52g02.s1 Stralagene HeLa cell s3 937216 Homo sapfens cONA done Z\ mRNA sequence 2,0 

120296 AW995911 Hs.299883 hypothetical protein FU23399 1.8 

120297 AA191384 Hs.104072 ESTs, Weakly s War to Z195_HUMAN ZINC FINGER PROTEIN 195 IKsaplensJ 15.2 
5 120324 AA195517 Hs.191643 ESTs 5.5 

120325 AA195651 Hs.104106 ESTs 6.4 

120327 AK000292 Hs.278732 hypothetical protein FU20285 16.1 

120336 N85785 Hs.181 165 eukaryotic translation elongation factor 1 alpha 1 2.9 

120342 AW450669 Hs.45068 hypothetical protein DKF2p434l1 43 5.7 

10 120345 AA210722 Hs.104158 ESTs 4.5 

120349 AW969481 Hs.55189 hypothetical protein 16.8 

120352 R06859 Hs.193172 ESTs. Weakly similar to 138022 hypothetical protein [H.sapiens] 5.0 

120356 AF000545 Hs.296433 putative purinergic receptor 28,1 

120371 AA219305 Hs.104196 EST 12.4 

15 120382 AA228026 Hs.38774 ESTs 4.0 

120383 AL109963 Hs.123122 FSH primary response (LRPR1. rat) homotog 1 9.7 

120386 AW969665 Hs.154848 hypothetical protein DKFZp434D0127 32.6 

120388 AA232874 Hs.104245 ESTs 3.1 

120389 AW967g85 Hs.325572 ESTs. Moderately similar to ALU7.HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 21.7 
20 120396 AA134006 Hs.79306 eukaryotic translation initiation factor 4E 12.5 

120404 AB023230 Hs.g6427 KIAA1013 protein . 7.2 

120418 AW966893 Hs.26813 Homo sapiens mRNA; cDNA DKFZp586F1323 (from done DKFZp586F1 323) 1 1 .4 

120423 AA236453 Hs.18978 Homo sapiens cDNA: FU22822 fis. done KAtA3968 1.9 

120472 AI950087 gbwqOScOZxl NCLCGAP_Kid12 Homo sapiens cONA done 3'. mRNA sequence 19.4 

25 120473 AA251973 Hs.269988 ESTs 5.4 

120484 AA253170 Hs.96473 EST 10.4 

120504 AA256837 gb:zr84d10.s1 Soares.NhHMPu.SI Homo sapiens cDNA done IMAGE:682387 3*. mRNA sequence. 3.9 

120509 BE047718 Hs.96545 ESTs 9.4 

120520 AA258601 Hs.161731 EST 2.4 

30 120535 BE350244 Hs.98547 ESTs 2.5 

120551 AA279160 Hs.1 11407 Homo sapiens, done IMAGE:361 3029. mRNA, partial cds 5.2 

120570 AA280579 Hs.271445 ESTs. Weakly slmiiar to ALUI.HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 14.4 

120582 BE244830 Hs.284228 ZNF135-like protein 10.2 

120590 AW372799 Hs.1 25790 leucine-rich repeat-containing 2 2.1 

35 120596 AA282074 Hs.237323 N-acetylgluoosamine-phosptete mutase 7.5 

120519 AW965339 Hs.111471 ESTs 2.5 

120624 AW407987 Hs.173518 M-phase phosphoprotein homolog 52.0 

120639 AA286942 gb:zs56f05.s1 NCLCGAP_GCB1 Homo sapiens cDNA done IMAGE:701505 3' similar to contains Alu2.4 

120648 AA287095 Hs.140309 Homo sapiens, done IMAGE:3677194, mRNA, partial cds 5.0 

40 120553 AW063659 Hs.191649 ESTs 2.2 

120668 AW969638 Hs.1 12318 6.2 kd protein 2.2 

120669 BE536739 Hs.109909 ESTs 1.9 

120695 AA976503 gb:oq30a04.s1 NCLCGAP_GC4 Homo sapiens cDNA done 3* simflar to contains PTR7.t1 PTR7 46.8 

120696 AI821539 Hs.97249 ESTs 2.5 
45 120713 AW449855 Hs.96557 Homo sapiens cDNAFU12727fiS. done NT2RP2000027 5.9 

120718 AA292747 Hs.97295 ESTs 2.9 

120750 AI191410 Hs.96693 ESTs. Moderately similar to 21 09260ABceU growth factor [H.saplens] 7.0 

120774 AI608909 Hs.193985 ESTs 7.8 

120807 AA346385 Hs.30002 SH3-containing protein SH3GLB2; KIAA1 848 protein 6,8 

50 120809 AA346495 gb:EST52657 Fetal heart II Homo sapiens cONA 3* end similar to EST containing O family repeat. 4.4 

120938 AA386260 Hs.104632 EST 4.4 

120977 AA398155 Hs.97600 ESTs 4.4 

120984 BE262951 Hs.99052 ESTs 5.6 

120985 AI219896 Hs.97592 ESTs 1.2 
55 121011 AA398360 Hs.97608 EST 3.1 

121026 A1439713 Hs.165295 ESTs 3.5 

121081 AA398721 Hs.186749 ESTs, Highly similar to 137550 mismatch repair protein MSH2 [H^apiensJ 5.4 

121133 AA363307 Hs.97032 ESTs 3.7 

121176 AL121523 Hs.97774 ESTs 1.7 

60 121223 AI002110 Hs.97169^ ESTs. Weakly similar to dJ667H12.2.1 [H.sapiens] 2.9 

121320 AA403008 Hs.30l927 c6.1A 1.9 

121340 AW956981 Hs.97910 Homo sapiens CDNAFU13383 fis. done PLACE1G01024 3.5 

121408 AA406137 Hs.98019 EST 6.0 

121439 AA410190 Hs.98076 ESTs. Weakly similar to A47582 B-cell growth factor precursor (H.sap!ensl 7.4 

65 121450 AA406430 Hs.105362 Homo sapiens, done MGC:18257. mRNA, complete cds 6.9 

121452 AW971063 Hs,292882 ESTs 1.8 

121455 H56306 Hs.15165 retinoic.add induced 14 10.5 
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121457 W07404 Hs.1 44502 hypothetical protein FU22055 3.4 

121496 AA442224 Hs.97900 ESTs 14.4 

121505 AA494172 Hs.194417 ESTs 13.1 

121508 AA402515 Hs.97887 ESTs 28.0 

5 121513 AA416653 Hs.181510 ESTs 6.2 

121514 AA412112 gb:zt69b02.s1 SoaresJestis^NHT Homo sapiens cONA done IMAGE:727563 3\ mRNA sequence. 2.6 

121549 AA412477 Hs.98142 EST • 7.4 

121558 AA412497 gb:zt95g12.s1 SoaresJestis.NHT Homo sapiens cCNA clone iMAGE:7301 50 3* similar to cdntains2.8 

121577 AA411970 Hs.98096 EST 3.5 

10 ' 121581 AA416568 gb:2u05c10.s1 Scares Jestis_NHT Homo sapiens cDNA done S", mRNA sequence 6.1 

121589 AD001528 Hs.89718 spermine synthase 3.9 

121594 AA626010 Hs.98247 ESTs 2.2 

121622 AA416931 Hs.126065 ESTs 4.2 

121655 AA421537 Hs.178072 Homo sapiens mRNA; cDNADKFZp434B1023 (from done DKFZp434B1 023) 7.8 

1 5 121682 AA418160 Hs.86043 Homo sapiens cOm FLJ13558 fis. done PLACE1007743 2.0 

121690 AV660305 Hs.1 10286 ESTs 47 

121706 U55184 Hs.154145 hypothetical protein FU 11585 127 
121714 AA419225 Hs.98269 Homo sapiens cDNA FU1 1953 fis. done HEJWBB 1000883 ' 8.1 

121729 AI949597 Hs.98325 ESTs , 1.8 

20 121731 AA421041 Hs.180744 ESTs 4.0 

121744 AA398784 Hs.97514 ESTs 7.1 

121748 BE536911 Hs.234545 hypothetical protein NUF2R 19.5 

121773 AB033022 Hs.158654 K1AA1 196 protein 7.9 

121775 AA421773 Hs.161008 ESTs 1-7 

25 121776 AA292579 Hs.125133 hypothetical protein FU22501 6.6 

121786 AI810774 Hs.98376 ESTs 10-5 

121832 AW340797 Hs.98434 ESTs 5.8 

121836 AA328348 Hs.218289 ESTs 3.8 

121839 AA425691 Hs.191606 ESTs. Highly similar to KIAAI 048 protein [H.sapiensl 5.0 

30 121842 AF027406 Hs.104865 serine/mfeonine kinase 23 2.7 

121847 AA446628 Hs.2799 cartilage linking protein 1 2.3 

121871 AW972668 Hs.293044 ESTs 2.9 

121882 AA426376 Hs.98459 ESTs 5.0 
121911 AA427950 gb:zw50f02.s1 Soares_tDtalJetus,Nb2HF8„9w Homo sapiens cDNA clone IMAGE:773499 3' 7.2 

35 121915 AA428179 Hs.223405 ESTs, Moderately similar to A46010X-linkBd retinopathy protein [H-sapiens] 2.5 

121935 AA428647 Hs.98611 EST 2.3 

121983 AA298760 Hs.180191 hypothetical pfotehi FLJ1 4904 3.4 

121985 AI862570 Ks.299214 Homo sapiens. dortelMAGE:2822295. mRNA, partial cds 114 

121995 AA210863 Hs.3532 nemo^ke kinase 3.8 

40 121999 AA430211 Hs.98668 EST 6.4 

122009 AW292763 Hs.160822 Homo sapiens cDNA: FU20863fis. done ADKA01 804 2.2 

122013 AA431085 Hs.98706 ESTs 6.5 
122036 W92142 Hs.271963 ESTs, Weakly simflar to ALU5_HUMAN ALU SUBFAMILY SC SEQUENCE CONTAMINATION 13.1 

122050 AI453076 Hs.l66109 ELAV (embryonic lethal, abnomial vision. Drosophila)-like 2 9.1 

45 122060 AA431738 Hs.98750 EST 13.1 

122114 AW161023 Hs.104921 ESTs 1.5 

122188 AA398838 gb:zt80d01.rl Soares_testis_NHT Homo sapiens cDNA done S*. mRNA sequence 3.3 

122204 AA435936 Hs.98842 EST 5.6 

122246 AA329550 Hs.29417 HCF-bindmgtranscriptfon factor Zhangfei 5.1 

50 122257 AA438819 Hs.98899 ESTs 5.6 

122302 AA441801 Hs.104947 ESTs 5.8 

122341 AW601969 Hs.99010 hypothetical protein FU22263 similar to nudear receptor-binding SET-domain protein 1 . 2.0 

122356 AA443794 Hs.98390 ESTs 7.3 

122369 AA443985 Hs.303222 ESTs 12.2 

55 122371 AA868555 Hs.178222 ESTs . 5.0 

122372 AA446008 Hs.336677 EST 7.6 

122378 AB032948. Hs.21356 hypothetical protein DKF2p762K2015 2.5 

122405 AA446572 Hs.303223 EST 2.8 

122412 AA446869 Hs.l 19316 ESTs 7.3 
60 122415 AA446918 Hs.99088 EST 1.9 

122418 AA446966 Hs.99090 ESTs. Modeiately similar to similar to KIAA0766 [H.sapiens] 6.8 

122440 AW505139 Hs.9460 Homo sapiens mRNA; cDNA DKFZp547C244 (from done DKFZp547C244) 2.6 
122446 AA447603 Hs,99123 EST 1.8 

122448 AA447626 Hs.99127 EST 3.5 
65 122458 AI266159 Hs.104980 ESTs 1.5 

122460 AW418788 Hs.99148 ESTs. WeaWy similar to S43569 R01H10,6 protein - Caenorhabditis elegans [Celegans] 9.7 

122464 AA448158 Hs.99152 EST 4.8 
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122490 AA448349 Hs.238151 EST 6.1 

122492 AA448417 Hs,104990 ESTs 5.4 

122502 AA204969 Hs.234863 Homo sapiens cDNAFU 12082 fts, done HEMBB 1002492 1.3 

122510 AA449232 Hs.99195 ESTs 11.2 

5 122530 AW959741 Hs.40368 adaptor-related protein complex 1, sigma 2 subunit 10.1 

122547 AA779725 Hs.164589 ESTs 2.5 • 

122555 AA194055 Hs.293858 ESTs 1.9 

122570 AA452578 Hs.262907 ESTs 9.5 

122572 AA452601 Hs.99287 EST 11.0 

10 122586 AK001910 Hs.99303 Homo sapiens cDNAFU11048fis. done PUCE1004516 3.4 

122587 AB040893 Hs.6968 KIAA1480 protein 2.0 

122598 AI028173 Hs.99329 ESTs 1.7 

122599 AL355841 Hs.99330 hypothetical protein FU23588 4.4 
122602 'AA411925 Hs.301960 ESTs 4.6 

15 122607 AA453518 Hs.98023 ESTs 61.5 .. 

122614 AA453630 Hs.99339 EST 10.7 

122616 AA453638 Hs.161873 ESTs 107.3 

122617 AI681535 Hs.148135 serine/threonine kinase 33 121.4 

122618 AA453641 gb:zx48e06.s1 Soares testis NHT Homo sapiens cDNA done 3'. mRNA sequence 31.1 
20 122622 AA453987 Hs.144802 ESTs 5.6 

122717 AA456859 Hs.178358 ESTs 8.5 

122762 AI376875 Hs.105119 ESTs 10.4 

122829 AW204530 Hs.99500 ESTs 81.8 

122834 AA461492 Hs,99545 Homo sapiens cDNA FU 10658 fis, done NT2RP2006052 3.6 

25 122836 AA460581 Hs.290998 ESTs 4.5 

122837 AA461509 Hs.293565 ESTs, Weakly similar to putative p 150 [H. sapiens] 27 

122838 AA460584 Hs.334386 ESTs 75.3 
122854 AA600235 Hs.9625 NIMA (never in mitosis gene a)-related kinase 6 7.7 
122856 AI929374 HsJ5367 Src-like-adapter 5.8 

30 122861 AA335721 Hs.1 19394 ESTs 1.3 

122866 BE539656 Hs.283705 ESTs . 4.1 

122868 AF005216 Hs. 11 5541 Janus kinase 2 (a protein tyrosine kinase) 5.3 

122870 AW576312 Hs.318722 Homo sapiens cDNA: FU21766 fis, done COLF7179 9.9 

122872 AW081394 Hs.97103 ESTs 5.3 

35 122879 AA769410 Hs.128654 ESTs 13.9 

122907 AA470074 Hs.169896 ESTs 11.5 

122916 AA470140 Hs.229170 EST 17 

122981 AA478951 Hs.105629 ESTs 5.0 

123013 AW968324 Hs.17384 ESTs 15.4 

40 . 123016 AW338067 Hs.323231 Homo sapiens cDNAFUl 1946 fis. done HEI\/1BB1000709 2.8 

123034 AL359571 Hs.44054 ninein (GSK3B interacting protein) 87 

123072 AI382600 Hs.1043O8 ESTs. Weakly similarto KIAAI 395 protein [H.sapiens] 8.8 

123082 AA485360 Hs.105661 ESTs 3.9 

123088 AI343652 Hs.105667 ESTs 3.8 

45 123110 AA486256 Hs.193510 EST 7.4 

123114 BE304942 Hs.265848 myomegaBn 2.8 

123131 T52027 Hs.271795 ESTs, Weakly similar to 138022 hypothetical protein [H.sapiens] 2.4 

123132 AI061582 Hs.324179 Homo sapiens cDNA FU12371 fis. done MAIWMA1002434 15.6 
123136 AW451999 Hs.194024 ESTs 5.1 

50 123149 A1734179 Hs.105676 ESTs 23.8 

123152 AW601773 Hs.270259 ESTs 5.2 
123258 AA490929 Hs.105274 ESTs, Weakly similar to RMS1 HUMAN REGULATOR OF MITOTIC SPINDLE ASSEMBLY 1 [H.sapiens] 9.3 
123315 AA496369 gb:zv37d10.s1 Scares ovary tumor NbHOT Homo sapiens cDNA done IMAGE:755827 3* similar to4.1 

123369 AA504757 Hs.105738 ESTs 6.9 

55 123394 AA731404 Hs.105510 ESTs 3.6 

123433 AW450922 Hs.1 12478 ESTs 37 

123466 AA599042 Hs.1 12503 EST 7.4 

123470 AW303285 Hs.303632 Human DMA sequence from done RP11-110H4 on chromosome 5 Contains a pseudogene similar to 3.5 

123471 AB021644 Hs.197219 zinc finger protein 14 (KOX 6) 5.2 
60 123475 BE439553 Hs.250528 Homo sapiens, done IIWIAGE:4098694, mRNA, partial cds 17 

123482 N95059 Hs.55098 ESTs 1.6 

123486 BE019072 Hs.334802 Homo sapiens cDNA FU14680 fis, done NT2RP2004242. weakly similar to 2.4 

123508 AW380388 Hs.155546 KIAA1080 protein; Golgi-assodated, gamma-adaptin ear containing. ARF-binding protein 2 2.2 

123615 AA609170 gbaf12a12.s1 Scares Jestis_NHT Homo sapiens cDNA done 3', mRNA sequence 7.8 

65 123619 AA602964 gb:no97c02.s1 NCLCGAP_Pr2 Homo sapiens cDNA done. mRNA sequence 2.8 

123658 AA609364 gb:zu71d09.s1 Scares Jesfe_NHT Homo sapiens cONA done IMAGE:743441 3* simiiar to conta&« Alu. 1 .7 

123674 AI26g609 Hs.105187 kmesin protein 9 gene 5.7 
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123735 NM_013241Hs.95231 FH1/FH2 domain-containing protein 10.0 

123738 AA609891 Hs.112777 EST 5.2 

123753 AA609955 Hs.234961 Huntingtin interacSng protein E 30.6 

123804 AA620464 Hs.251915 EST, Wealdy similar to S65657 alpha-IC-adrenergic receptor splice form 2 [H.sapiens) 2.1 
5 123811 AA620586 gbae60g05.s1 Stratagene lung carchoma 937218 Honw sapiens cONA done IMAGE:951 320 3" 2.7 

123951 AB012922 Hs.173043 metasfasis^ssociated 1-tike 1 6.2 

123983 AJ272267 Hs.146178 choline dehydrogenase 4.4 

124001 L42542 Hs75447 ralA binding protein 1 7.0 

124006 AI147155 Hs.270016 ESTs 8.^ 

10 124070 AI950314 Hs.154762 HlV-1 rev binding protein 2 3.7 

124074 H05635 Hs.294030 topoisomerase-ielated function protein 4-2 1.2 

124178 BE463721 Hs.g7101 putative G protein-coupled receptor 3.1 

124203 AA372796 Hs.269339 ESTs. Weakly similar to AF161 356 1 HSPC093 (H.sapiens] 5.7 

124352 AA640891 Hs.102406 ESTs 31 

15 124375 D87454 Hs.192966 KIAA0265 protein 3.5 

124385 AI267847 gb:aq49a10.x1 Stanley Frontal NB pool 2 Homo sapiens cONA done similar to contains 57.1 

124390 AA317338 Hs7535 COBW-tike protein 2.8 

124391 AF155099 Hs.279780 NY-REN-1 8 antigen 7.1 
124417 N34059 gb:yv28h09.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA done 3' similar to contains A!u 3.3 

20 124428 H13540 Hs.82202 ribosomal protein LI 7 . 2.9 
124440 AA532519 Hs.129043 Human DNA sequence from done 989H1 1 on chromosome 22q1 3.1-13.2. Contains part of a 7.8 

124466 R10084 Hs.1 13319 kinesin heavy chain member 2 2.6 
1 24482 N53935 gb:yv59d09.s1 Soares fetal liver spleen 1 NFLS Homo sapiens cDN A done 3'. mRNA sequence 7.9 

124498 H79433 Hs.268997 ESTs 7.8 

25 124515 AA669097 Hs.109370 ESTs 3.3 

124608 N71076 Hs.102800 ESTs. Weakly similarto neuronal thread protein AD7C-NTP [H.sapiensl 4.5 

124631 NIVL014053HS.270594 FLVCR protein 3.2 

124634 AI765123 Hs.143671 Homo sapiens cDNA FLJ13533 fe. done PUCE1006371 5.8 

124637 AA1 60474 Hs.75798 hypothetical protein 9.3 

30 124642 AW968856 Hs.278569 sorting nexin 17 3.5 

124649 N92593 Hs.3l3054 ESTs 6.1 

124656 AW297702 Hs.102915 ESTs 8.3 

124661 R48170 Hs.78436 EphBI 5.6 
124683 AA381661 Hs.1 19878 ESTs. Weakly similar to M3K9_HUMAN MITOGEN-ACTIVATED PROTEIN KINASE KINASE 7,9 

35 124712 R09166 Hs.191148 ESTs 5.7 

124735 R22952 Hs.268685 ESTs 113 

124761 AA374756 Hs.93560 Homo sapiens mRNA lor KIAA1771 protein, partial cds 9.0 

124768 AW368528 Hs.100855 ESTs 8.1 

124775 R41772 Hs.100878 ESTs 4.9 

40 124777 R41933 Hs.140237 ESTs. Weakly similar to ALU1_HUMAN ALU SUBFAI^ILY J SEQUENCE 2.8 

124788 R43543 Hs.100912 Homo sapiens cDNA: FU22726ris. done HSl 15005 5.1 

124809 AL355722 Hs.106875 Homo sapiens EST from done 35214. full insert 4.2 

124811 R46058 Hs.288912 hypothetical protein FU22604 145 

124812 R47948 Hs.188732 ESTs 7.9 
45 124822 AA418160 Hs.86043 Homo sajMenscDNA FIJI 3558 fis, done PLACE 1007743 6.6 

124825 AA501669 Hs.336693 ESTs 2.3 

124833 AW975868 Hs.294100 ESTs 27 

124857 R63652 Hs,137190 ESTs 2.3 

124860 R65763 Hs.101477 EST 23.9 

50. 124863 A1382555 Hs.1 27950 bromodomain-containirig 1 2.0 

124876 AF135422 Hs.27059 GDP-mannose pyrophosphorylase A 4.4 
124878 BE397530 Hs.288057 hypothetical protein FLJ22242 27 

124902 H37941 Hs.101883 ESTs ' 57 

124903 AW296713 Hs.221441 ESTs 32.4 
55 124930 A1076343 Hs.173939 ESTs. Weakly similar to ALUB.HUMAN 111! ALU CLASS B WARNING ENTRY III [H.sapiens) 22.8 

"124942 R99978 Hs.268892 ESTs, Moderately similar to B34087 hypothetical protein [H.sapiens) 6.1 
124958 AI078645 Hs.431 murine leukemia viral (bmM) oncogene homolog 1.9 
124980 T40841 Hs.986B1 ESTs 4.5 
125002 T59338 Hs.269463 ESTs. Weakly simflar to ALU1.HUMAN ALU SUBFAMILY J SECXJENCE CONTAMINATION 4.9 

60 125047 T79815 Hs279793 ESTs 5.0 

125051 T79956 Hs.100588 EST 135.3 ' 

125056 T81310 Hs.100592 ESTs 5.4 
125101 AI472068 Hs.266236 KIAA1 856 protein 5.6 
125113 T96595 Hs.302270 ESTs.Weaklysin[iilartoALUF„HUMANni! ALU CUSS F WARNING ENTRY III [H.sapiens] 18 

65 125115 T97341 gb7e57e05.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA done IMAGE:121 856 3' similarto 9.6 

125125 A1222382 Hs.240767 Human DNA sequence from done RP1-12G14 on chromosome 6q24.1-25.2, Contains the 5" end of the gene .1.5 
125147 W38150 Empirically selected from AFFX single probeset 1.7 
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125161 W44657 Hs.144232 EST 107 

125249 AA630863 Hs.131375 ESTs, Moderately simnar to ALUB.HUMAN III! ALU CLASS B WARhrtNG ENTR U 

125255 AF0g8162 Hs.118631 timeless (Drosophila) homolog 9.4 

125279 AW401809 Hs.4779 KIAA1150 protein 1.5 

5 125280 A!123705 Hs.106932 ESTs 8.0 

125298 AW972542 Hs.289008 Homo sapiens cDNA: FU21814 fis. done HEP01068 1.5 

125660 AW292171 Hs.2397a scaffold attachment factor B 5,9 

125827 NM„003403Hs.97496 YY1 transcription factor 12 

125891 U29589 Hs.7138 cholinergic receptor, muscarinic 3 6.4 

10 126005 AW409701 Hs.1578 baculoviral lAP repeat-containing 5 (sun/ivin) 14.3 

126202 AA157632 Hs.272630 vacuolar proton pump delta polypeptide 2.4 

126595 AA643322 Hs.172028 a disintegrin and metalloproteinase domain 10 9.1 

127050 AW411066 Hs,274351 CGi-89 protein 17.0 

127274 AW965158 Hs.58582 Homo sapiens CDNAFLJ12789 fis. done NT2RP2001947 12.8 

15 128355 AW293012 Hs.161623 ESTs 7.3 

128493 D87466 Hs.240112 K1AA0276 protein 3.1 

128493 D87465 Hs.240112 KIAA0276 protein 1.3 

128522 BE173977 Hs.10098 putative nudeotar RNA helicase 9.4 

128527 AA504583 Hs,101047 transcription factor 3 (E2A immunoglobulin enhancer binding factors El 2/E47) 1.5 

20 128528 R39234 Hs.251699 ESTs, Weakly similar to IDN4-GGTR1 4 [H.sapiensJ 2.8 

128595 U31875 Hs.272499 short-chain alcohol dehydrogenase family member 12.1 

128599 NWL015366HS.102336 Rho GTPase activating protein 8 23 

128604 At879099 Hs.102397 GIOT-3 for gonadotropin ffidudbte transcription repressor-3 13 

128608 BE267994 Hs.102419 zinc finger protein 7.1 

25 128625 AB037841 Hs.102652 hypothetical protein ASH1 1.3 

128629 AL096748 Hs.102708 DKFZP434A043 protein 3.2 

128639 AW582962 Hs.102897 CGI-47 protein ^ 2.0 

128656 AA458542 Hs.10326 coatomer protein complex, subunit epsilon 1.4 

128656 AA458542 Hs.10326 coatomer protein complex, subunit epsilo 1-3 

30 1 28658 BE397354 Hs.324830 diptheria toxin resistance protein required for diphthamide biosynthesis {SaccharomycesH«ke 2 2.4 

128670 AA975486 Hs.1 03441 Homo sapiens. Similar to RIKENcDNA1700010L19 gene, done MGC:16214.mRNA. complete cds 7.1 

128691 W27939 Hs.l03834 hypothetical protein MGC5576 '7.7 

128696 BE081143 Hs.225977 nuclear receptor coactivator 3 3.8 

128700.^ Y15221 Hs.103982 small inducible cytokine subfamily B (Cys-X-Cys), member 11 1.6 

35 128714 T85231 Hs.179661 tubulin, beta 5 7.6 

128717 AK001564 Hs.104222 hypothetical protein FU 10702 5.5 

128733 BE147740 Hs.104558 ESTs. Moderately similar to 138022 hypothetical protein [H.sapiens] 2.7 

128737 AF292100 Hs.104613 RP42homolog 2.8 

128742 AA307211 Hs.251531 proleasome (prosome, macropain) subunit alpha type. 4 4.4 

40 128746 AI470163 Hs.323342 actin related protein 2/3 complex, subunit 4 (20 kD) 2.2 

128747 AB027249 Hs.104741 PDZ-binding kinase; T-cell originated protein kinase 2.8 

128772 BE302796 Hs.l05097 thymidine kinase 1. soluble 5.3 

128781 N71826 Hs. 105465 small nuclear ribonudeoprotein polypeptide F 53.9 

128797 NM_002975Hs.105927 stem cell growth factor; lymphocyte secreted C-type lectin 13.3 

.45 128806 AW630942 Hs.106061 RDRNA-blnding protein 2.6 

128814 AW248431 Hs.256526 nudear prelamin A recognition factor 2.2 

128830 BE2&1170 Hs.106357 valosin-containing protein 5.9 

128835 AK001731 Hs.106390 Homo sapiens mRNA; cDNA DKFZp586H0924 (from clone DKFZp586H0924) 1.6 

128854 BE159181 Hs.168232 hypothetical protein FU13855 2.2 

50 128854 BE159181 Hs.168232 hypothetical protein FU13855 1 9 

128868 AA419008 Hs. 106730 chromosome 22 openieadtng frame 3 3.0 

128868 AA419008 Hs.106730 chromosome 22 open reading frame 3 2.2 

128871 AF189723 Hs.106778 ATPase, Ca-M- transporting, type 2C, member 1 1.5 

128891 F34856 Hs.292457 Homo sapiens, done MGC:16362. mRNA. complete cds 13.3 

55 128905 R57988 Hs.10706 epithelial protein lost in neoplasm beta 4.7 

128920 AA622037 Hs.166468 programmed cell death iS 1-4 

128925 R67419 Hs.21851 Homo sapiens CDNAFU12900 fis. done NT2RP2004321 1.9 

128946 Y13153 Hs.107318 kynurenine 3-monooxygenase (kynurenine 3-hydroxylase) 7.2 

128949 AA009647 Hs.8850 a disintegrin and metalloproteinase domain 12 {mettrin alpha) (ADAM-1 2) 2.4 

60 128958 AW139032 Hs.107376 hypothetical protein DKFZp434N035 1-3 

128959 AI580127 Hs.107381 hypothetical protein FU 11 200 1P-9 

128965 AW150697 Hs.107418 ESTs 1.4 

128970 AI375672 Hs.165028 ESTs 1.3 

128975 BE550779 Hs.284233 NICE-5 protein 14.0 

65 128979 AW271217 Hs.281434 Homo sapiens cDNAFU14028fe. done HEMBAlb03838 1.6 

128995 AI816224 Hs.107747 DKFZP566C243 protein 1-9 

129019 AI950087 gbwq05c02,x1 NCI CGAP Kld12 Homo sapiens cONA done 3', mRNA sequence 2.9 
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129021 AL044675 Hs.173081 KIAA0530 protein 3-8 

129021 AIJ044675 Hs.173081 KIAA0530 protein 2.5 

129032 R80088 Hs.108104 ubiquitin-conjugating enzyme E2L 3 3-4 

129076 AW296806 Hs.326234 ESTs. Highly similar to T48422 hypothetical protein DKFZp434M2023.1 [H.^^^^ 5.0 

5 129078 AI351010 Hs.102267 lysosomal 2.1 

129088 AA744610 Hs.194431 palladin 17.1 

129095 L12350 Hs.108623 thrombospondin 2 2^7 

129096 AA463189 Hs.288906 WW Domain-Containing Gene 20.9 

129097 BE243933 Hs.108642 anc finger protein 22 (KOX 15) 3.0 
1 0 129099 AF146074 Hs.108660 ATP-binding cassette, subfamily C (CFTR/MRP). members 5.8 

129136 W93048 Hs.250723 hypothetical protein MGC2747 5.9 

129149 AA356620 Hs.108947 KIAA0050 gene product ^-3 

129172 AW162916 Hs.241576 hypothetical protein PR02577 l'^ 

129192 AA286914 Hs.183299 ESTs 2.1 

15 129194 AA150797 Hs.109276 latexln protein 3.2 

129198 N57532 Hs,109315 KIAA1415 protein 5.8 

129207 AI934365 Hs,109439 osteoglycin (osteoinductive factor, mimecan) 8 0 

129228 U40714 Hs.239307 tyrosyl-tRNA synthetase 2.9 

129229 AF013758 Hs.109643 polyadenylate binding protein-interacting protein 1 3.2 
20 129254 AA252468 HS.109B DKFZp434J 181 3 protein 2.6 

129255 A1961727 Hs.109804 HI histone family, member X ^-3 

129288 W26392 Hs.1 10080 ESTs. Weakly similar to S13495 pregnancy zone protein IH.sapiens] 9.6 

129296 AI051957 Hs.1 10122 ESTs ^-2 

129323 AA287239 Hs.5518 Homo sapiens cDNAFU 11 31 1 fis, done PLACE 101 01 02 5.1 

25 129340 H75334 Hs.1 1050 F-box oniy protein 9 ^-^ 

129347 BE614192 Hs.279869 melanoma-associated antigen recognised by cytotoxic T lymphocytes 7.6 

129362 U30246 Hs.1 10736 solute carrier family 12 (sodium/potassium/chloride transporters), member 2 6.7 

129366 BE220805 Hs.1 84697 Homo sapiens clone 23785 mRNA sequence 8-6 

129370 AI686379 Hs.110796 SARI protein J-^' 

30 129372 NM_016039Hs.1 10803 CGI-99 protein 2,0 

129403 AF149785 Hs,1 11 126 pituitary tumor-transforming 1 interacting protein 7.4 

129404 A1267700 Hs.317584 ESTs 5.0 
129404 AI267700 Hs.317584 ESTs 

129423 AA204686 Hs.234149 hypothetical protein FU20647 

35 129449 AI096988 Hs.111554 ADP-ribosylation factor-fike 7 Vo 

129453 AW974265 Hs.1 11632 Lsm3 protein 3.2 

129482 AA188185 Hs.289043 sprndlin ^ J 

129482 AA188185 Hs.289043 spindlin 3.6 

129513 AW843633 Hs.306163 hypothetical prolenAL1 101 15 J-l 

40 129515 AF255303 Hs.1i2227 membrane-associated nudeic acid binding protein 2.5 

129527 AA769221 Hs.270847 delta-tiibulin 3.2 

129559 W01296 Hs.11360 hypothetteai proteb FU 14784 ^-5 

129560 AA317841 Hs7845 hypotiietical protein MGC2752 5,8 
129570 AI923097 Hs.1 1441 chromosome 1 open reading frame 8 2,0 

45 129575 F08282 Hs.278428 progestin induced protein 1-5 

129587 H14718 Hs.1 1506 Human done 23589 mRNA sequence 5.8 

129588 BE408300 Hs.301862 postmeiotic segregation increased 2-like 9 ^-^ 
129591 N57423 Hs.179898 HSPC055 protein I'l 
129594 AW403724 Hs.36989 coagulation fador VII (serum prothrombin conversion accelerator) 9,0 

50 129596 AF035537 Hs.115521 REV3 (yeast homologj-like. catalytic subunit of DNA polymerasie zeta 1.6 

129628 U38945 Hs.1174 cyclin-dependent kirwse inhibitor 2A (melanoma, pi 6, inhibits CDK4) 2,2 

129628 U38945 Hs.1174 cydin-dependent kinase inhibitor 2A (me 1-4 

129629 AK000398 Hs.1 1747 hypothetical protein FU20391 3.8 
129649 AD000092 Hs.l6488 calreticulin - 3.3 

55 129675 NM.015556HS.172180 KIAA0440 protein ]3.4 

129680 U03749 gb:Human chromogranln A (CHGA) gene, promoter an • 14.1 

129689 AW748482 Hs.77873 B7homo!og3 2.6 
129702 AI304966 Hs.12035 ESTs, Weakly similar to 138022 hypothetical protein [H.sapiens] 7.4 
129720 AA156214 Hs.12152 APIVICF1 protein 2.0 

60 129721 NM_001415Hs.211539 eukaryotictranslation initiation factor2,subunit3 (gamma, 52kD) 1.7 
129726 H15474 Hs.132898 fatty add desaturase 1 8-3 

129778 AK001676 Hs,12457 hypothetical protein FU10814 ^ -^ 

129779 AA394090 Hs.12460 Homo sapiens done 23870 mRNA sequence 5.4 
129800 AF052112 Hs.12540 lysosomal ] l 

65 129806 AB023148 Hs.173373 KIAA0931 protein 1*2 
129815 BE565817 Hs.26498 hypothetical protein FU21 657 3.1 
129840 NM_006590Hs.12820 SnRNP assembly defective 1 homotog 1*8 
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129861 


AL049999 


Hs.85963 


DKFZP564M182 protein 


22 


129864 


A1393237 


Hs.129914 


runt-related transcription factor 1 (acute myeloid (eukemia 1; amll oncogene) 


1.7 


129869 


AI222069 


Hs.13015 


hypotheticat protein simitar to mouse Onajll 


2.7 


129922 


AF042379 


Hs,13386 


gamma-tubultn raniptex protein 2 


4.5 


129945 


BE514376 


Hs.1 65998 


PAkI mRNA-bindIng protein 


1,8 


129953 


AA412195 


Hs.13740 


ESTs 


2.5 


129972 


AW753185 


Hs.180628 


dynamin 1-Iike 


1.8 


129983 


U09848 


Hs.132390 


zinc finger protein 36 (KOX 18) 


1.3 


129989 


AB015856 


Hs.247433 


activating transcription factor 6 


4.0 


130010 


AA301116 


Hs.142838 


nudeolar phosphoprotein Nopp34 


1.6 


130081 


AA287325 


Hs.14713 


ESTs 


4.0 


130082 


S73265 


Hs.1473 


gastrin-releasing peptide 


1.8 


130097 


AL046962 


Hs.14845 


fori<head box 03A 


2.8 


130100 


ALl 35561 


Hs.14891 


hypothetical protein FLJ21047 


2.3 


130111 


X53002 


Hs.149846 


intPQiin beta 5 


2.3 


130112 


AA916785 


Hs.180610 


splicing factor proiine/gtutamine rich (poiypyrimidine tract-binding protein-associated} 


10 


130112 


AA916785 


Hs.180610 


splicing factor proline/glutamine rich ( 


2.1 


130128 


L76937 


Hs.150477 


Werner syndrome 


1.8 


130135 


AA311426 


Hs.21635 


tubulin, gamma 1 


6.1 


130211 


N[VL003358Hs.23703 


ESTs Moderateiv similar to CEGT HUMAN CERAMIDE GLUCOSYLTRANSFERASE [H saoie 


ns] 1.6 


130212 


D80001 


Hs.152629 


KIAAQ17g nrotein 


1.3 


130236 


R85367 


Hs.51957 


splicing factor, afginine/serine-rich 2, interacting protein 


2.0 


130241 


AL035588 


Hs.153203 


MvoD famflv inhibitor 


3.2 


130242 


)C79201 


Hs.153221 


svnnvial safncima tranfilrknatpri tn X rhmmfvfnmp 


5.4 


130249 


D81983 


Hs.322852 




4.8 


130263 


NMJ02497HS.153704 


Miivm ^iicvci III iitiLUOia yciic aj-iciauni iMllooc £. . 


1.4 


130287 


AA479005 


Hs,154036 


tumor suoDrp^Sna suhtran^fiprahlp ranrilriatp 3 


2.6 


130310 


AB011121 


Hs.154248 


amvotronhic lateral sdencmis 2 fiuvenile) chromosomp feaion candidatB'3 


6.3 


130353 


Z19084 


Hs.172210 


MIIF1 nrotpin 


6.2 




API 27577 


Hs.155017 






130357 


AJ224442 


Hs.155020 


niifatiwa mpihx/ttrariQfprAco 

|JUIuUVt7 II ICU Ij IIICII l3ICladC 




13n3SQ 


NM_013449Hs.277401 
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130367 

1 WWWWf 


AL135301 


Hs.8768 


hvnnthpHrj^l nrntpin Fl MOAdQ 


1.4 


130372 


Ald77464 


Hs.5011 


RMA hinrlinn mntif nmfpin Q 


3.3 


130393 


N89487 


Hs.155291 


KIAA0005 gene product 


1.8 


130399 


AW374106 Hs.155356 


hvnnihptinal nrntpin Mf^r*9ft4n similar tn a ntitafivp ntiiiYLCvltran^fpra^P 


3.4 


130407 


BE385099 


Hs.334727 


hvnothpfical nrnfpin M(nP3ni7 


2.3 


130409 


NM_001197Hs.l55419 


ROI 9-intprartinn ktllpr /annntn^i«!-inrtiir}nn\ 


2.7 


130419 


AF037448 


Hs.l 55489 




1.8 


130441 


U53630 


Hs. 155637 


Drotein kinase DNA-activatpd catalvtic nolvDenftdp 


2.3 


130448 


BE513202 


Hs. 15589 


PPAR bindino nrotein 


3.9 


130455 


D90041 


Hs.155956 


M-af%tvltran5«fprasp 1 fan/laminp N-Arpt\/Urr)n^fpm<p^ 


33.6 


130455 


D90041 


Hs.155956 


N-acetyltransferase 1 (arylamine N-ace^ 


4.6 


130471 


ALl 21438 


Hs.183706 


addudn 1 (alpha) 


2.7 


130485 


BE245851 


Hs.180779 


H2B histonfi familv meinhpr R 

1 Ir 1 f iit^iwilw lOilili^i il191IIUV7l 0 


5.0 


130487 


U49644 


Hs.77613 


ataxia telangiectas^ and Rad3 related 


4.3 


130498 


L38951 


Hs.180446 


IcafvoDherin nmoortin) hpia 1 

iwMl TV|/llwl If 1 IIIIIM^IU'IJ UCIO 1 


1.6 


130503 


BE208491 


Hs.295112 


KiAA0618 gene product 


16.1 


130511 


L32137 


Hs.1584 


cartllaoe oliaameric matrix orotein fDseudoachQndiDDlasia enlt^vseal dysplasia 1. muifinle) 


6.1 


130511 


L32137 


Hs.1584 


cartiiaoe oliaomeric matrix omtetn fnse 


5.3 




AW876523 


Hs.15929 


hVDOfhetical nratpin FL J19Qin 

1 1 J pwu i^ut^ai ^luiciii i imAi 1^57 lu 


2.1 


130542 


U64675 


Hs.179825 


RAN binding protein 2-like 1 


7.8 


130544 


AA321238 


Hs.4310 


eulcaryotic translation initiation factor 1A 


1.5 


130553 


AF062649 


Hs.252587 


pituitary tumor-transforming 1 


14.4 


130556 


AI907018 


Hs.15977 


Empirically selected from AFFX single probeset 


4.7 


130567 


AA383092 


Hs.1608 


replication protein A3 (14kD) 


7,9 


130568 


AA232119 


Hs.16085 


putative G-protein coupled receptor 


33 


130574 


AF083208 


Hs.16178 


apoptosis antagonizing transcription factor 


1.2 


130586 


AB007891 


Hs,16349 


KIAA0431 protein 


5.6 


130598 


AL042210 


Hs.16493 


hypothetical protein DKFZp762N2316; KIAA1603 protein 


1.4 


130601 


AA609738 


Hs.16525 


ESTs 


1.5 


130614 


AI354355 


Hs.16697 


down-regulator of transcription 1 , TBP-binding (negative cofactor 2) 


1.3 


130617 


M90516 


Hs.1674 


gtutamin&-fmctose-6-phosphate transaminase 1 


12.1 


130617 


M90516 


Hs.1674 


glutamine-fnjctose-6-phosphate transamin 


2.4 


130618 


AA383439 


Hs.16758 


Spir-1 protein 


15.9 


130667 


BE246961 


Hs.17639 


Homo sapiens ubiquitin protein r^ase (UBE3B) mRNA, pardal ods 


. 13.9 


130674 


AL048842 


Hs,194019 


attractin 


1.5 
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130675 AA442233 Hs.17731 hypothetical protein FU1 2892 5.4 

130692 AA652501 Hs.l3561 hypothetical protein MGC4692 5.0 

130893 R68537 Hs.17962 ESTs 2.0 

130712 AJ271881 Hs.279762 bromodomain-containing 7 1.8 

5 130714 AI348274 Hs.18212 DNA segment on chromosome X (uriique) 9879 expressed sequence 2.0 

130730 AB007920 Hs.18586 KIAA0451 gene product 3.7 

130744 H59696 HsA87A7 P0P7 (processing of precufsor, S. cerevisiae) homofog 3.1 

130751 AF052105 Hs.18879 chromosome 12 open reading frame 1.4 

130757 AL036057 Hs.18925 protein x 0001 5.7 

10 130768 AF258627 Hs.21 1562 ATP-binding cassette, sub-family A (ABC1), member 1 5.1 

1 30789 AK000355 Hs.8899 sirtuin (silent mating type infonnation regulation 2. S.cerevisiae, homolog) 5 5.2 

130815 AB018298 Hs.19822 SEC24 (S. cerevisiae) related gene famllyi member D 1.5 

130836 J05068 Hs.2012 transcobalamin I (vitamin B12 binding protein. R binder family) 15.7 

130841 AL157468 Hs.325825 Homo sapiens cDNA FU20848 fis. done ADKAO 1732 2.8 

1 5 130843 AA447492 Hs.20183 ESTs. Weakly similar to AF164793 1 protein x 013 [H.sapiens] 1.5 

130844 U76248 Hs.20191 seven in absentia (Drosophila) homolog 2 3.4 

130855 AJ243706 Hs.143323 putative DNA/chromatin binding motif 1.7 

130861 NM_016578Hs.20509 HBV pX associated proleln-B 1-9 

130879 NM_003416Hs,2076 zinc finger protein 7 (KOX 4. done HF.l 6) 1.4 

20 130880 BE514434 Hs.20830 kinesln^ke2 2.1 

130892 AL120837 Hs.20993 high-glucose-regulated protein 8 2.4 

130898 AB033078 Hs.186613 sphingosine-1 -phosphate lyase 1 1.7 

130911 BE409769 Hs.21189 DnaJ (Hsp40) homolog. subfamily A, member 2 1.8 

130919 N79110 Hs,21276 collagen, type IV, alpha 3 (Goodpasture antigen) binding protein 2.3 

25 130944 BE382657 Hs.21486 signal transducer and activator of transcription 1,91 kD 5.4 

130971 N39842 Hs.301444 KIAA1673 2.2 

130992 BE398091 Hs.74316 desmoplakin (DPI, DPII) 1-8 

130993 T97401 Hs.21929 ESTs 1-6 
131Q05 AV658308 Hs.2210 thyroid hormone receptor interactor 3 1-6 

30 131028 A1879165 Hs.2227 CCAAT/enhancer binding protein (C/EBP). gamma 1.2 

131042 A1826288 Hs.171637 hypothetical protein MGC2628 1.6 

131046 AA321649 Hs.2248 small inducible cytokine subfamily B (Cys-X-Cys). member 10 7.4 

131046 AA321649 Hs.2248 small inducible (^okine subfamily B (Cy 3.0 

131047 H23230 Hs.22481 ESTs, Moderately similar to A46010 X-linked retiriopathy protein [H.sapiens) 1.7 
35 131060 AA194422 Hs.22564 myosin VI 5.1 

131060 AA194422 Hs.22564 myosin VI 2.5 

131070 N53344 Hs.22607 ESTs 7,1 

131076 AA749230 Hs.26433 dolichyl-phosphale (UDP-N-acetylgiucosamine) N-acetylglucosaminephosphotransferase 1 ( 2.0 

131076 AA749230 Hs.26433 dolichyl-phosphate(UDP-N-acetylglucosam 1.9 

40 131099 AL133353 Hs,226581 COXl 5 (yeast) homolog. cytochrome c oxWase assembly protein 7.0 

131174 NM 006540Hs.2gi31 nudear receptor coactivator 2 19 

131185 8E280074 Hs.23960 cydinBI ^-8 

131206 AW138839 Hs.24210 ESTs . 2,0 

131213 AA885699 Hs.24332 CGI>26 protein 7.0 

45 131225 H82087 Hs.31659 thyroid hormone receptor-assodated protein. 95-kD subunit 7.5 

131231 N47468 Hs.59757 zinc finger protein 281 2.9 

131233 D89053 Hs.268012 fatty-add-Coenzyme A ligase. long-chain 3 3.5 

131243 AW383256 Hs.24752 spectrin SH3 domain binding protein 1 2.8 

131245 AL080080 Hs.24766 thioredoxin domain-oontaining 2.8 

50 131247 AL043iao Hs.326190 fatty acid amide hydrolase 5,6 

131281 AA251716 .Hs.25227 ESTs 5.7 
131283 X80038 Hs.339713 Homo sapiens done F19374 APO E-C2 gene cluster 1.3 
131305 AV656017 Hs.184325 CG 1-76 protein 5.0- 
131320 AA505691 Hs.145696 splldng factor (GC 1.3) 1.8 

55 131339 AF058696 Hs.25812 Nijmegen breakage syndrome 1 (nibrin) 2.6 
131339 AF058696 Hs.25812 Nijmegen breakage syndrome 1 (nibrin) 2.6 
131375 AW293165 Hs.143134 ESTs 5.4 
131390 BE269388 Hs.182698 mitochondrial nljosomal protein 1^0 * 5.3 

131410 BE259110 Hs.279836 HSPC1 66 protein 2.2 

60 131412 NWL012247HS.124027 SEl£NOPHOSPHATE SYNTHETASE ; Human selenium donor protein 2.0 
131429 AUM6302 Hs.26750 hypothetical protein FLJ21 908 1.4 
131458 BE297567 Hs.27047 hypothetical protein FU20392 1.7 
131475 AA992841 Hs.27263 KIAA1458 protein 2.0 
131501 AV661958 Hs.8207 GKOOl protein 2.6 

65 131501 AV661958 Hs.8207 GKOOl protein 1.6 
131511 AA732153 Hs.27865 Homo sapiens cONA: FU21333 fis. done COL02535 2.0 
131528 AU076408 Hs.28309 UDP-gluoose dehydrogenase 16 
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131532 BE268278 Hs.28393 hypothetical protein MGC2592 7.4 

131543 AW966681 Hs.41639 programmed cell death 2 2J2 

131544 AL355715 Hs.28555 programmed cell death 9 (PDCD9) 2.1 
131562 NM^003512Hs.28777 H2A histone family, member L 1.7 

5 131564 T93500 Hs.28792 Homo sapiens cDNAFUl 1041 fis. clone PLACE 1004405 5.1 

131564 193500 Hs.28792 Homo sapiens cDNA RJ1 1041 fis, clone PL 1.8 

131569 AL389951 Hs.271623 nucleoporin 50kD 5,0 

131618 BE393822 Hs.29545 Homo sapiens mRNA; cONA DKFZp761C029 (from clone DKFZp761C029); partial cds 1.8 

131622 R78195 Hs,29592 Homo sapiens cDNAaJ11436 fis. done HEMBA1001213 1.3 
10 131623 AB037791 Hs.29716 hypothetical protein FU 10980 2.2 

131623 AB037791 Hs.29716 hypothetical protein FU10980 1.9 
131643 AW410601 Hs.30026 HSPC182 protein 2.9 
131653 AW960597 Hs.30164 ESTs 1.3 
131656 AI218918 Hs.30209 KIAA0854 protein 2.8 

15 131669 X52486 Hs.3041 uracil-DNA glycosylase 2 2.8 

131692 BE559681 Hs.30736 KIAA0 124 protein 5.6 

131714 AA642831 Hs.31016 putative DNA binding protein 2.9 

131722 D13757 Hs.311 phosphofibosyl pyrophosphate amidotransferase 3.4 

131737 AKD01641 Hs.31323 inhibitor of kappa light polypepUde gene enhancer in B^lls. kinase ronriplex-assra 3.8 

20 131760 X76732 Hs.3164 nudeobindin 2 2.9 

131760 X76732 Hs.3164 nudeobindin 2 2.8 

131763 AI878932 Hs.317 topoisomerase (DNA) I 3,4 

131772 AA382590 Hs.170980 KIAA0948 protein 25.5 

131774 BE267158 Hs.169474 DKFZP586J01 19 protein 5.5 

25 - 131787 D87077 Hs.196275 KIAA0240 protein 2.4 

131793 AW966127 Hs.32246 Homo sapiens cONA FU14656 fis, done NT2RP2002439 7.9 

131795 BE5Q1849 Hs.32317 high-mobility group 20B 1.4 

131798 X86098 Hs.301449 adenovinis 5 El A binding protein 4.1 

131817 U20536 Hs.3280 caspase 6. apoptosis-related cysteine protease 4.2 

30 131824 U28838 Hs.32935 TATA box binding protein (TBP)-assodated factor. RNA polymerase HI. GTF3B subunit 2 3.5 
131850 AI251317 Hs.33184 ESTs - 5.1 

131878 AA083764 Hs.6101 hypothetical protein MGC3 178 5.8 
131885 BE502341 Hs,3402 ESTs 13.7 
131885 BE502341 Hs.3402 ESTs 2.4 

35 1 31 887 W17G64 Hs.332848 SWI/SNF related, matrix associated, actin dependent regulator of chronralin, subfamily e. member 1 3.2 
131900 AA099014 Hs.231029 Homo sapiens, clone MGC:15961. mRNA. complete cds 8.7 
131900 AA099014 Hs.231029 Homo sapiens, clone MGC: 15961. mRNA, com 2.0 

131904 AF078866 Hs.284296 Homo sapiens cDNA: FU22993 fis. done KATl 1914 5,5 

131905 AA179298 Hs.3439 stomatin-like 2 11-3 
40 131913 AW207440 Hs.185973 degenerative spemnatocyte(homologDrosophila:Upiddesaturase) 1.7 

131916 AA025976 Hs.34569 ESTs 5.2 
131925 AF151048 Hs.183180 anaphase promoting complex subunit 11 (yeast APC11 homotog) 2.7 ' 

131929 BE541211 Hs.34804 Homo sapiens cONAFU11472 fis, done HEMBA1001711 5.3 
.131941 BE252983 Hs.35086 ubiquitin specific protease 1 2.3 

45 131950 AA355113 Hs.35380 x OOl protein . 1.5 
131962 AK000046 Hs.267448 hypothetical protein FU20039 2.3 
131965 W79283 Hs.35962 ESTs 1-4 
131971 BE567100 'Hs.154938 hypothetical protein MDS025 3.5 
131977 U90441 Hs.3622 procollagenisrorme, 2-oxog!utafate 4-dioxygenase (proline 4-hydroxylase), alpha polypeptide II 6.5 

50 131985 AA503020 Hs.36563 hypothetical protein FU224 18 2.4 
131991 AF053306 Hs.36708 budding uninhibited by benzimidazoles 1 (yeast homolog). beta 2.1 
132019 H56995 Hs.37372 Homo sapiens DNA binding peptide mRNA. partial cds 3.2 
132031 AF193844 Hs.3758 C0P9 complex subunit 7a 5.8 
132052 BE265155 Hs.3832 clathrin-assodated protein AP47 1.5 

55 132084 NM.002267Hs.3886 karyopherin alpha 3 (importin alpha 4) 3.7 
132103 BE171921 Hs.3991 ESTs 1-4 
132105 AV646076 Hs.39959 ESTs 5.8 
132116 AW960474 Hs.40289 ESTs 1-7 
132176 AA857025 Hs.8878 kinesin-like 1 33 

60 132180 NM_004460Hs.418 fibroblast activation protein, alpha . 14.7 
132192 AA206153 Hs.4209 mitochondrial ribosomal protein 1^7 5.5 
132194 R42432 Hs.4212 ESTs - 4.4 

132203 NM_0O47a2Hs.194714 synaptosomal-assodated protein. 29kD • • 2.2 
132207 BE205939 Hs.42287 E2F transcription factor 6 2.2 

65 132235 AV658411 Hs.42656 KIAA1681 protein 7.8 
132240 AB016324 Hs.42676 KIAA0781 protein 1*5 
132252 AI566004 Hs.141269 Homo sapiens cDNA: FU21550 ris,.done COL06258 1-3 
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132374 AF1 55582 Hs.46744 

132376 AI279892 Hs.46801 

132384 AA312135 Hs.46967 

132393 AL135094 Hs.47334 

132450 AA100012 Hs.48827 



132528 T78736 Hs.50758 
132530 AA308105 Hs.50785 
132532 AA454132 Hs.5080 
132534 BE388673 Hs.5086 
132543 BE568452 Hs.5101 
132571 AW674699 Hs.5169 
132574 AW631437 Hs.5184 
132596 AK001484 Hs.5298 

132611 AA345547 Hs.53263 

132612 H12751 Hs.5327 



132266 AA301228 Hs.43299 hypothetical protein FLJ1 2890 
132273 AA227710 Hs.43658 DKFZP586L151 protein 
132276 AA653507 Hs.285711 hypothetical protein FU13089 

132268 N36110 HS.30S971 solute canierfamily 2 (fadlitated glucose transporter), number 10 
132294 AB023191 Hs.44131 KIAA0974 protein 

132298 NM_015986Hs.7120 cytokine receptor-like molecule 9 

132299 AW405882 Hs.44205 cortistatin 

132325 N37065 Hs.44856 hypothetical protein FU1 211 6 

132348 AW067708 Hs,170311 heterogeneous nuclear ribonudeoprotein D-like 

132370 AW572805 Hs.46645 ESTs 

coral UDP-gaIactose:N-acetyigaIactosamine-atpha-Rbeta 1,3'galactosyltransferase 
sorting nexin 14 
HSPC034 protein 
hypothetical protein FU 14495 
hypothetical protein FU 12085 
132452 AW973521 Hs.247324 mitochondrial ribosomal protein S14 
132456 AB011084 Hs.48924 KIAA0512 gene product; ALEX2 
132465 AW169847 Hs.49169 KIAAI 634 protein 
132470 AI224456 Hs.4934 H. sapiens polyA site DMA 
132484 X16660 Hs.1 19007 RAB4, member RAS oncogene family 
132518 AW885606 Hs.5054 ESTs 

SMC4 (stnjclural maintenance of chromosomes 4, yeast)-like 1 
SEC22, vesicle trafficking protein (S, cerevisiae)-iike 1 
mitochondrial ribosomal protein L16 
hypothetical protein MGC1 0433 
protein regulator of cytokinesis 1 
suppressor of G2 allele of SKP1 , S. oerevisiae, homolog of 
TH1 drosophaa homok)g 
C6I-45 protein 

hypothetk:al protein FU13287 
PR01914 protein. 
132616 BE262677 Hs.283558 hypothetical protein PR01 855 

132638 AI796870 Hs.54277 DNA segment on chromosome X (unique) 9928 expressed sequence 
132648 U51127 Hs.54434 hypothetical protein MGC1 71 5 
132668 AB018319 Hs.5460 KIAA0776 protein 
132692 AW191962 Hs.249239 collagen, type Vlll. alpha 2 

132715 F11875 Hs.5534 Homo sapiens cDNA FU 12961 fis, done NT2RP2005545 
1 3271 8 NM_004600Hs.554 Sjogren syndrome antigen A2 (60kD, ribonudeoprotein autoantigen SS-A/Ro) 
132724 A1142265 Hs.55498 geranylgeranyl diphosphate synthase 1 
132731 All 89075 Hs.301872 hypothetical protein MGC4840 
132744 AA010233 Hs.55921 glutamyl-prolyl-tRNA synthetase 
132760 AA1 25985 Hs.56145 thymosin, beta, Wentified in neuroblastoma cells 
132771 Y10275 Hs.56407 phosphoserine phosphatase 
132773 AA459713 Hs.295901 KIAA0493 protein 
132784 AI142133 Hs.56845 GDP dissodation inhibitor 2 
KIAA0310 gene product 

muti- (E. coli) homolog 1 (colon cancer, nonpolyposis type 2) 
KIAA0475 gene product 

solute carrier family 1 1 (proton-coupled divalent metal Ion transporters), member 2 
sex comb on midteg homolog 1 
tousled4ike kinase 2 
Hs.1 69610 CD44 antigen (homing function and Indian blood group system) 
Hs.57783 eukaryotic translation initiafion factor 3, subunit 9 (eta, 1 16kD) 
132842 NMJ)16154Hs.279771 Homo sapiens done PP1 596 unknown mRNA 
132844 F12200 Hs.5811 chromosome 21 open reading frame 59 
132851 U09716 Hs.287912 lectin, mannose-binding, 1 
132863 BE268048 Hs.236494 RABID, member RAS oncogene family 
132869 AW963217 Hs.203961 ESTs, Moderately similar to AF1 16721 89 PRQ21 68 (H.sapiens] 
132873 AW007683 Hs.58598 KIAAI 266 protein 

Rho-assodated. coiled-coil containing protein Idnase 2 
U2(RNU2) small nudear RNA auxiliary factor 1 (non-standard symbol) 
ring finger protein 15 
hypothetical protein FLJ10808 

132912 AW732760 Hs.167578 Homo sapiens cDNAFUl 1095 fis, clone PLACE 1005374 

132913 W78714 Hs.60257 Homo sapiens cDNAFU 13598 fis, done PLACE 1009921 

132940 T79136 Hs.127243 Homo sapiens mRNA for KIAA1724 protein, partial cds 

132941 AI817165 Hs.6120 hypothetical protein FU13222 
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132798 Ai026701 

132807 U07418 

132810 AB007944 Hs.5737 

132813 BE313625 Hs.57435 

132815 AI815189 

132817 N27852 

132821 AJ251595 

132833 U78525 



Hs.5716 
Hs.57301 



Hs.57475 
Hs.57553 



132875 NWL004850HS.58617 
132891 BE267i43 H5^9271 
132897 AW503667 Hs.59545 
132902 AI936442 Hs.59838 
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132942 AA554458 Hs.1 97751 KIAA0666 protein 1.8 

132952 AI658580 Hs.61426 Homo sapiens mesenchymal stem cell protein 0SC^6 mRNA. partial cds 2.2 

132962 AA576635 Hs.6153 CGM8 protein 4.9 

132972 M034365 Hs.288924 Homo sapiens cDNAFUl 1392 fe. clone HEMBA1000575 2.7 

5 ' 132973 AA035446 Hs.323277 ESTs 5.3 

132977 AA093322 Hs,301404 RNA binding motif protein 3 3.2 

132980 AA040696 Hs.62016 ESTs 1.3 

132994 Ml 12748 Hs.279905 clone HQ0310PRO0310p1 3.0 

133012 AA847843 Hs.827l1 Homo sapiens, done IMA6E:3351 295, mRNA 10,3 

10 133015 AJ002744 Hs.246315 UDP-N-acetyi-^lpha-D-galadosamineipolypepttde N-acetylgaiactosaminyltransferase 7 (^^ 2.1 

133016 AI439688 Hs.6289 hypometical protein FLJ20886 1.3 

133053 A1065016 Hs.6390 Homo sapiens clone FLB3344 PRO0845 mRNA. complete cds 6.0 

133062 AW500374 Hs.64056 PRO0149 protein 5.3 

133069 BE247441 Hs.6430 protein with pdyglutamine repeat caldum (ca24^) homeostasis endoplasmic reticulum protein 4.9 

15 133091 AK001628 Hs,64691 KIAA0483 protein 3.5 

133110 AA808177 Hs.65228 ESTs 13.1 

133134 AF198620 Hs.65648 RNA binding motif protein 8A 1.3 

1:^145 H94227 Hs.65g2 Homo sapiens, done IMAQE:2961368. mRNA. partial cds 2.2 

133152 Z11695 Hs.324473 mitogen-acttvated protein kinase 1 1.3 

20 133174 AA43i620 Hs.324178 hypothetical protein IVIGC2745 17.1 

133175 AW955632 Hs.66666 ESTs, Weakly similar to S19S60pro)ine-rich protein MP4- mouse [M.musculus] 1.8 

133177 X97795 Hs.66718 RAD54.{S.cerevisiae)-like 4.9 

133197 AI275243 Hs.180201 hypothetical protein FLJ20671 3.1 

133208 AI801777 Hs.6774 ESTs 4.4 

25 133226 AW954569 Hs.2g6287 Homo sapiens, Similar to bromodomain-containing 4, ctone IMAGE:3542455, mRNA. partial cds 1.7 

133228 AI492924 Hs.6831 golgi phosphoprotein 1 6.0 

133240 AK001489 Hs.242894 ADP-ribosylation factor-like 1 1.5 

133254 AI567421 Hs.273330 Homo sapiens, clone IMAGE:3544662, mRNA. partial cds 1.4 

133266 AI160873 Hs.69233 zinc finger protein .5.6 

30 133268 AW956781 Hs.293937 ESTs. Weakly similar to FXD2_HUIVIAN FORKHEAD BOX PROTEIN D2 [H.sapiens] 1.9 

133285 M76477 Hs.289082 GM2 ganglioside activator protein . 4.7 

133291 BE297855 Hs.69855 NRAS-related gene 5,0 

133314 AA102670 Hs.70725 gamma-aminobutyric add (GABA) A receptor, pi 27 

133321 T79526 Hs.179516 integral type I protein 9.3 

35 133327 AL390127 Hs.7104 Kmppel-like factor 13 4.4 

133347 BE257758 Hs71475 add cluster protein 33 1.8 

133360 AI016521 Hs71816 v-akt murine thymoma viral oncogene homolog 1 5.5 

133366 AA29281 1 Hs.72050 non-metastatic cells 5. protein expressed in (nucleoside-diphosphate kinase) 2.7 

133367 AF231919 Hs.18759 KIAA0539 gene product 17 
40 133370 AF245505 Hs.72157 DKFZP5641 1922 protein 1.8 

133383 BE313555 Hs.7252 KIAA1224 protein 1J 

133390 A1950382 Hs72860 phosphatidylserine receptor .1.3 

133391 AW103364 Hs.727 inhibin. bete A (activin A. activin AB alpha polypeptide) ' 16.1 
133394 AA305127 Hs.237225 hypothetical protein HT023 12.2 

45 133437 AL031591 Hs.7370 phosphotidylinositol transfer protein, beta 10.4 

133452 NM_002759Hs.274382 protein kinase Jnterteron-indudbte double stranded RNA dependent 1.2 

133453 AI659306 Hs73826 protein tyrosine phosphatase, non-receptor type 4 (megakaryocyte) 1.7 
133500 AW964804 Hs.74280 hypothetical protein FU22237 11.1 
133529 W45623 Hs.74571 ADP-ribosylation factor 1 2.8 

50 133540 AL037159 Hs74619 proteasome (prosome, macropain) 268 subunit non-ATPase, 2 2.9 

133543 AU077073 Hs.108327 damage-specific DN A binding protein 1 (127kD) 2.5 

133578 AU077050 Hs75066 transfin 1.5 

133579 X75346 Hs75074 mitogen-activated protein kinase-adivated protein kinase 2 2.1 
133582 BE391579 HsJ5087 Fas-activated serine/threonine kinase 1.3 

55 133594 AW160781 Hs.172589 nudear phosphoprotein similar to S. cerevisiaePWPI 2.2 

133595 AA393273 Hs75133 transcription fador 6-like 1 (mitochondrial transcription factor 1 -like) 1.5 

133599 N^L002885Hs75151 RAP 1, GTPase activating protein 1 57 

133621 NWL004893HS.75258 H2A histone family, member Y 25.5 

133627 NM_002047Hs75280 glycyl-tRNA synthetase 15.8 

60 133631 NIVL000401HS.75334 exostoses (multiple) 2 3.3 

133649 U25849 Hs.75393 acki phosphatase 1. soluble 1.6 

133690 AV661185 Hs.75574 mitochondrial ribosomal protein L19 4.1 

133720 1^7841 Hs.75737. pericentrioiar material 1 1.5 

133722 AW969976 Hs.279009 matrix Gla protein 6.3 

65 133751 AW402048. Hs.334787 Homo sapiens. Similar to fikely ortholog of yeast ARV1 . clone lt\4AGE:3506392. mRNA 3.9 

133757 T52946 Hs.196209 RAE1 (RNA export 1. S.pombe) homolog 1.7 

133760 BE271766 Hs.181357 (aminin receptor 1 (67kD. ribosomal protein SA) 1.8 
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133765 M62194 Hs.75929 cadherin 11.type2.0B-cadherai(osteoblasO 1-5 

133780 AA557660 Hs.76152 decorin 3.5 

133784 BE622743 Hs.301064 arfaptin 1 6-8 

133791 M34338 Hs.76244 spermidme synthase 2,6 

5 133797 AL133921 Hs.76272 retinoblastoma-blndlng protein 2 14 

133822 D50525 Hs.699 peptidylprotyi isomeiase B (cydophirn B> 3.0 

133842 AW797468 Hs.285013 putative human HLA class II associated protein I 13.5 

133845 AA147026 Hs.76704 ESTs 2.2 

133850 W29092 Hs.7678 ceJlularretinoicadd-binding protein 1 1o 

10 133859 U86782 Hs.178761 26S proteasome-assodated pad1 homolog 2.0 

133865 AB011155 Hs.170290 discs Jarge prosophila) homolog 5 2.8 

133867 AW340125 Hs.76989 KIAA0097 gene product 6 ' 

133868 AB012193 Hs.183874 cullin4A 2.5 
133881 U30872 Hs.77204 centromere protein F (350M00kD, mitosin) • 3.0 

15 133922 U30825 Hs.77608 splicing factor, arginine/serine-rich 9 1-4 

133924 D86326 Hs.325948 vesicle docldng protein p1 15 5.4 

133929 NM_006306Hs.211602 SMC1 (stniCtural maintenance of chromosomes 1» yeast)-like 1 4.9 

133936 L17128 Hs.77719 gamma-glutamyt carboxylase 37 

133941 BE244332 Hs.77770 adaptor-related protein complex 3. mu 2 subunit 12.1 

20 133959 X81789 Hs.77897 splidng factor 3a. subunit 3. 60kD 9-7 

133976 AI908165 Hs.169946 GATA-binding protein 3 {T-ceD receptor gene adivator) 3.1 

133989 AL040328 Hs.78202 SWI/SNF related, matrix associated, actin dependent regulator of chromatin 1.3 

133997 A1824113 Hs.78281 regulatorofG-protein signalling 12 9-7 

134010 AB016092 Hs.197114 RNA binding protein; AT-ridi element binding fador 2.4 

25 - 134015 D31764 Hs.278569 sorting nexin 17 2.5 

134070 NM_003590Hs.78946 cullln 3 ]-3 

134110 U41060 Hs.79136 LIV-1 protein, estrogen regulated 4.2 

134129 NM_014742Hs.79305 KIAA0255 gene product 2.2 

134134 H86504 Hs,173328 protein phosphatase 2. regulatory subunit B (B56). epsilon isofomi 5.0 

30 134200 BE559598 Hs.197803 KIAA01 60 protein 3.2 

134206 AF107463 Hs.79968 splicing fador 30, survival of motor neuron-related 2.5 

134208 NM_000288Hs.79993 peroxisomal biogenesis factor 7 2.1 

134219 NM 000402HS.80206 glucose-6-phosphate dehydrogenase Q ** 

134234 BE3b0D78 Hs.80449 Homo sapiens, done 1MAGE:3535294. mRNA. partial cds 2.6 

35 134275 AI878910 Hs.3688 clsplafin resistance-assodated overexpressed protein 18 

134292 AI906291 Hs.81234 immunoglobulin superfamBy. member 3 2.0 

134301 AW502505 Hs.81360 Homo sapiens cDNA: FU21927 fis. done HEP04178. highly similar to HSU90909 2,5 

134305 U61397 Hs.81424 ubiquitin-like 1 (sentrin) 2.8 
134324 AB029023 Hs.179946 KtAA1 100 protein lO-^ 

40 134326 AW903838 Hs.81800 chondroitin sulfate proteoglycan 2 (versican) 1-9 

134329 N92036 Hs.81848 RAD21 (S. pombe) homolog 2,6 

134337 NM^004922Hs.81964 SEC24 (S. cerevisiae) related gene family, member C 2,3 
134348 AW291946 Hs.82065 interleukin 6 signal transducer (gpl 30. oncostatin IVI receptor) 13.0 
134367 AA339449 Hs.822B5 phosphoribosylglydnamide fomiyltransferase, phosphoribosylglydnamide synthetase, 8.8 

45 134376 X06560 Hs.82396 2',5'-ol!goadenylate synthetase 1 (4046 kD) ^-5 
134379 AW362124 Hs.323193 hypothetical protein MGC3222 8-1 
1 34384 AI589941 Hs.8254 Homo sapiens, Similar to tumor differentially expressed 1 , done IMAGE:3639252» mRNA. partial cds 2.6 
134391 AA4173B3 Hs.82582 Integrin. beta-like 1 (v/ith EGF-like repeat domains) 4.1 
■134395 AA456539 Hs.8262 lysosomal 

50 134403 AA334551 Hs,82767 spenn specific antigen 2 2.6 
' 134405 AW067903 Hs.82772 collagen, type XI, alpha 1 '-3 

134411 BE272095.HS. 167791 retkxjiocalbin 1, EF-handcaldum binding domain 3.2 
134415 AI750762 Hs.82911 protein tyrosine phosphatase type IVA, member 2 1-9 
134421 AU077198 Hs.82985 collagen, type V, alpha 2 ^**'3 

55 • 134424 Z44190 Hs.83023 peroxisomal biogenesis factor 1 1B 2.4 

134446 AA112036 Hs.83419 KIAA0252 protein '*-2 

134447 M58603 Hs.83428 nuclear factor of kappa light polypeptide gene enhancer in B-cells 1 {p105) 1.6 
134470 X54942 Hs.83758 GDC28 protein kinase 2 2.1 
134480 NM^005000Hs.83916 Empirically selected from AFFX single probesel 5.3 

60 134485 X82153 Hs.83942 cathepsin K (pycnodysostosis) 2,5 
134498 AW246273 Hs.84131 threonyMRNA synthetase 2.1 
134513 AA425473 Hs.84429 KIAA0971 prole&l 3.8 
134516 AKD01571 Hs.273357 hypothetical protein FU 10709 2.4 
134520 BE091005 Hs.74861 activated RNA polymerase tltranscrtptioncofactor 4 6,7 

65 134529 AW411479 Hs.848 FK508-b{nding protein 4 (59kD) 2,3 
134577 BE244323 Hs.85951 exportin, tRNA(nudear export receptor for tRNAs) 5.5 
134582 AA927177 Hs.86041 CGG triplet repeat binding protein 1 5.8 
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134612 AW068223 Hs.171581 ubiquitin C-terminal hydrolase UCH37 2.2 

134624 AF035119 Hs.8700 deleted in fiver cancer 1 2.0 

134632 X78520 Hs.174139 chloride channel 3 2.3 

134654 AK001741 Hs.8739 hypothetical protein RJ10879 1-4 

5 134664 AA256106 Hs.87507 ESTs ■ 72.9 

134666 BE391929 Hs.8752 transmembrane protein 4 8.5 

134687 U62317 Hs.88251 arylsulfatase A 6,0 

134692 NM_.003474Hs.8850 a disintegrin and metalloproteinase domain 12 (meltrin alpha) 4,3 

134705 BE1 61887 Hs.88799 anaphase-promoting complex subunit 10 2.3 

10 134714 Y14768 Hs.890 lysosomal 6.7 

134719 AA852985 Hs.89232 chromobox homolog 5 (Drosophila HP1 alpha) 2.3 

134722 AF129536 Hs.284226 F-box only protein 6 2.9 

134724 AF045239 Hs.321576 ring finger protein 22 6.6 
134746 X07871 Hs.89476 CD2 antigen (p50), sheep red blood cell receptor ' 2,3 

15 134751 AW630803 Hs.89497 laminBI 6.2 

134790 BE002798 Hs.287850 integral membrane protein 1 1-9 

134806 AD001528 Hs.89718 spemiine synthase ' 1-8 • 

134834 AW451 370 Hs.8991 adaptor-related protein complex 1. gamma 2 subunft 1.4 

134850 AI701162 Hs.90207 hypothetical protein MGC11 138 1.4 

20 134853 6E268326 Hs.90280 5-aminoHTiidazole-4-carboxamkle ribonudeotide fomiyltFansferase/IMP cyctohydrolas^ 5.6 

134859 D264B8 Hs.90315 KIAA0007 protein 2.8 

134880 A187gi95 Hs.90606 1 5 kDa selenoprotein 1-7 

134910 AA532963 Hs.9100 Homo sapiens cDNAFU13100fis. done NT2RP3002255 17 
134925 AW885909 Hs.6976 PR01 073 protein . 2.1 

25 134955 AW401361 Hs.91773 protein phosphatase 2 (formerly 2A). catalytic subunit, alpha isofonn 1.3 

134971 AI097346 Hs.286049 phosphoserine aminotransferase 2.1 

134975 R50333 Hs.92186 Leman coiled-coll protein 2.3 

135011 AB037835 Hs.92991 KIAA14 14 protein 1-6 

1 35022 NM_000408Hs.93201 glycerol-3-phosphate dehydrogenase 2 (mitochondrial) 3.9 

30 135032 AW301984 Hs.173685 hypothetical protein FL J 126 19 6.2 

. 135077 AW503733 Hs.9414 K1AA1488 protein 2,0 

1 35083 AB036063 Hs.94262 pSS-indudble ribonudeotide reductase small subunit 2 homolog 1 .3 

135095 AF027219 Hs.9443 zinc finger protein 202 7.1 

135096 AA081258 Hs. 132390 zinc finger protein 35 (KOX 18) 3.2 
35 135153 A1093155 Hs.95420 JM27 protein 2.5 

135181 BE250865 Hs.279529 pxl9-like protein 1.4 

135199 AA477514 Hs-95247 translin-assodated factor X 5.0 

135207 N26427 Hs.9634 ESTs. Highly similar to C10_HUMAN PUTATIVE CIO PROTEIN (H.sapiensl 6.1 

135214 T78802 Hs.96560 hypothe«cal protein FLJ1 1656 4.6 

40 135243 BE463721 Hs.97101 putative G protein-coupled receptor 5.6 

. 135245 AI028767 Hs.262603 ESTs 3.5 

135257 AW291023 Hs.97255 ESTs, Weakly similar to A46010 X-llnked retinopathy protein [H.sapiens] 1.2 

135263 A108B775 Hs.55498 geranylgeranyl diphosphate synthase 1 2.6 

135274 AA448460 Hs.1 12017 GE36gene. 5.3 

45 135294 AA150320 Hs.9800 protein kinase Njmu-RI 9.1 

135295 AI090838 Hs.98006 ESTs 2.4 

135307 AI743770 Hs.98368 ESTs. Weakly similar to K1AA0822 protein [H.sapiens] 13^ 

135321 AI652069 Hs.98614 ribosome binding protein 1(dog 180kO honiolog) .2.6 

135354 AA456454 Hs.183418 celt division cyde 2-Uke 1 (PITSLRE proteins) 8.3 

50 135361 AA373452 Hs.167700 Homo sapiens cDNAFUl 0174 fis. done HEMBA1003959 1.5 

135389 U05237 Hs.99872 fetal Alzheimer antigen 4.9 

135400 X78592 Hs.99915 androgen receptor (dihydroteslosterone receptor; tesficularfeminizatran; spinal and bulbar 2.0 

134975 R50333 Hs.92186 Leman coiled-coil protein 2.6 

• 135011 AB037835 Hs.92991 KIAA14 14 protein 1-4 

55 135022 NWL000408HS.93201 glycerol-3-phosphate dehydrogenase 2 (mi 1.6 

135032 AW301984 Hs.173685 hypothetical protein FU12619 1-4 

135077 AW503733 Hs.9414 KIAA1488 protein 1-6 

135083 AB036063 Hs.94262 p53Hndudble ribonucleoUde reductases 2^ 

135095 AF027219 Hs.9443 zinc finger protein 202 ^ -^ 

60 135096 AA081258 Hs.132390 zinc finger protein 36 (KOX 18) 2.1 

135153 AI093155 Hs.95420 JM27 protein 4.4 

135181 BE250865 Hs.279529 pxl9-0ke protein 14.9 

135199 AA477514 Hs.96247 tfansfin-assodated factor X 1-3 

135207 N26427 Hs.9634 ESTs. Highly similar to C10_HUMANPUTATI 1.7 

65 135214 T78802 Hs.96560 hypothetical protein FU1 1656 6.1 

135243 BE463721 Hs.97101 putative G protein-coupled receptor 2.7 
135245 AI028767 Hs.262603 ESTs 12.2 
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65 



135257 AW291023 Hs.97255 ESTs. Weakly similar to A46010 X-linked 
1 35263 AI088775 Hs,55498 geranylgeranyl diphosphate synthase 1 
135274 AA448460 Hs.l 12017 GE38gene 

135294 AA1 50320 Hs.9800 protein kinase Njmu-RI 

135295 A1090838 Hs.98006 ESTs 

1 35307 AI743770 Hs.98368 ESTs. Weakly similar to KIAA0822 protein 

135321 A1652069 Hs.98614 ribosome binding protein 1 (dog 180kD ho 

135354 AA456454 , Hs.183418 cell division cyde 2-like 1 (PITSLRE pr 

135351 AA373452 Hs.167700 Homo sapiens cDNAFU10174ns, done HE 

135389 U05237 Hs.99872 fetal Alzheimer antigen 

135400 X78592 Hs.99915 androgen receptor (dihydrotestosterone r 

302256 AA857131 Hs.171595 HIV TAT spedfic factor 1 

302276 AW057736 Hs.323910 HER2 receptor tyrosine kinase (c-erb-b2. 

303135 AW592789 Hs.279474 HSPC070 protein 

303686 AK000714 Hs.109441 MSTP033 protein 

310085 R43191 Hs.101248 Homo sapiens done IMAGE:32553. mRNA seq 

315518 - AA808229 Hs.167771 ESTs 

317781 NM„007057Hs.42650 ZW10 interactor 

320836 AI268997 Hs.197289 rab3 GTPase-activatIng protein, non-cata 

321114 AA902256 Hs.78979 Golgi apparatus protein 1 

322221 N24236 Hs.l79662 nudeosome assembly protein 1 -like 1 

322474 AF118083 Hs.29494 PR01912 protein 

322556 BE041451 Hs.177507 hypothetical protein 

323541 AF292100 Hs.104613 RP42homolog 

407827 BE278431 Hs.40323 BUB3 (budding uninhibited by benzimldazo 
Hs.43627 SRY (sex detemnining region Y)-box 22 
Hs.48295 RNA helicase family 
Hs.101617 ESTs, Weakly similar to T32527hypotheti 
hypothetical protein, exijressed in osteo 
SFRS protein kinase 1 
UBX domain-containing 1 
high-mobility group (nonhistone chromoso 
TATA box binding protein (TBP)-assodate 
cathepsin K (pycnodysostosis) 
retinobtastoma-binding protein 6 
alpha thaiassemia/mental retardation syn 
v-myb avian myeloblastosis viral oncogen 
421225 AA463798 Hs.102696 MCT-1 protein 
421642 AF172066 Hs.106346 retlnoic add repressible protein 
421828 AW891965 Hs.279789 histone deaoetylase 3 
421983 AI252640 Hs.1 10364 pepfidylprolyl isomerase C (cydophilin 
422052 AA302744 Hs.104518 ESTs 
422055 NM_014320Hs,111029 putative heme-blnding protein 
423750 AF165883 Hs.298229 prefoldin2 

424001 W67883 Hs. 137476 paternally expressed 10 (PE610: KIAA105 

425182 AF041259 Hs.155040 zinc finger protein 217 

425284 AF155568 Hs.155489 NS1 -associated protein 1 

426372 BE304680 Hs.16953l DEAD/H (Asp-GIu-Ala-Asp/His) box polypep 

428049 AW183765 Hs.182238 GW128 protein 

428477 AW500533 Hs.1 1482 splicing fartor. arglnine/serine-rich 11 

DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 
Homo sapiens hepatocellular cardnoma-as 
Homo sapiens done 23736 mRNA sequence 
skbl (S. pombe) homolog 
446999 AA151520 Hs.334822 hypothetical protein MGC4485 
447111 A1017574 Hs.17409 cysteine-rich protein 1 (intestinal) 
447778 BE620592 Hs.71190 ESTs, WeaWy similar to SI 6508 hypofheti 
448873 NM_003677Hs^393 density-regulated protein 
449687 W68520 Hs.331328 intennediate filament protein syncoifin 
450701 H39960 Hs.288467 Homo sapiens cDNA FU 12280 fis, done MA 
450703 AA011202 Hs.184771 nuclear factor l/C (CCAAT-binding transc 
452461 N78223 Hs.108106 transcripfion factor 
452511 BE408178 Hs.285165 Homo sapiens cDNA FU20845 fis. done AD 
453157 AF077036 Hs.31989 DKF2P586G1722 protein 
453658 BE541906 Hs.87819 Homo sapiens, done MGC:2492. mRNA. comp 
100833 AF135168 Hs.108802 N-elhylmale^'de-senslfiva factor 
102481 U50360 gb:Human caldum, calmodulin-dependent p 
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408196 AL034548 

408813 AI580090 

409176 R73727 

413670 AB000115 Hs75470 

414108 AI267592 Hs75761 

414846 AW304454 Hs.77495 

416980 AA381133 Hs.80684 

417378 R57256 Hs.82037 

418283 S79895 Hs.83942 

418467 NM 006910HS.85273 

420269 U72937 Hs.96264 

420802 U22376 Hs.1334 



437562 AB001636 Hs.5683 

438449 AK001333 Hs.6216 

441560 F13386 HsJ888 

445580 AF167572 Hs.12912 



7.6 

1.8 

4.1 

1.2 

4.8 

5.8 

12.3 

5.7 

7.9 

1.9 

13.9 

5.3 

2.2 

1.4 

5,2 

2.3 

2.8 

2,0 

5.5 

1.4 

1.3 

2.9 

1.6 

1.8 

1.6 

6.1 

5.6 

2.6 

2.4 
1.5 

4.2 

23.6 

5.8 

1.3 

1.6 

2.3 

1.6 

3.5 

4.9 

3.1 

1.9 

2.4 

4.1 

7,0 

4.9 

3,4 

2.1 

7.5 

1.7 

2.4 

3.8 

5.6 

2.0 

7.5 

2.2 

2.8 

1.7 

5.9 

5.6 

1,4 

4.7 

2.9 

12.1 

4.7 

1.3 

3.2 

6.2 
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102827 
103549 
104331 
110018 
115008 
119075 
119615 
125006 
127609 
129209 
129917 
130182 
130365 
131135 
131853 
131881 
132726 
135193 
409487 
416040 



BE244588 Hs.6456 
BE270465 Hs,78793 
AB040450 Hs.279862 
AW579842 Hs. 104557 
AK001827 Hs.87889 



M10905 

AL034423 

BE065136 

X80031 

R62676 

M30773 

BE267033 

W56119 



Hs.287820 

Hs.75875 

Hs. 145696 

Hs.530 

Hs.17820 

Hs.278540 

Hs.192853 

Hs.155103 



NfvL016569Hs.267182 
AI681917 Hs.3321 
AW361018 Hs.3383 
N52298 Hs.55608 
X95525 Hs.96103 
H19886 

AWS19158 Hs.289044 



chaperonin containing TCP1, subunit 2 (b 

protein kinase C, zeta 

cdk inhibitor p21 binding protein 

hypothetical protein RJ 10697 

helicase-moi 

fibronectin 1 

ubiquitin-conjugating enzyme E2 variant 
splicing factor (CC1.3) 
collagen, type IV, alpha 3 (Goodpasture 
Rho-associated, coiled-coil containing p 
protein phosphatase 3 (fomierly 2B), reg 
ubiquitin-conjugating enzyme E2G 2 (homo 
eukaryotic translation initiation factor 
TBX3mso protein 

ESTs. Highly similar to IRX1_HUMAN IROQU 
upstream regulatory element binding prot 
hypothetical protein MGC955 
TATA box binding protein (TBP)-assQctate 
gb:yn57a05.ii Soares adult brain N2b5H 
Homo sapiens cDNA FU12048 fis. done HE 



7.9 
2,0 
5.3 
2.0 
5.7 
1.3 
2.9 

I. 7 
2.4 
5.2 
4,5 

II. 0 
3,3 
1.3 
3.2 
14.3 
3.0 
2.7 
2.3 
7.4 
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Table 4 A shows the accession numbers for those pkeys lacking unigenelD's for Table 4. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column, 

Pkey: Unique Eos probeset identifier number 

GAT number Gene clusternumber . 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 



123615 
123619 
101445 
124385 
124417 
124482 
102481 
103349 
110856 



120280 
113248 
120472 



30886^-15 

3716B1_1 

1650^-5 

656394J 

1642364J 

1657509J 

31281^-28 

11052.-2 

19346J4 . 



160212 J 
328626_1 
44573 2 



129019 44573.2 



120695 9683.3 



122188 
121581 
122618 
109026 
123658 
123811 
125115 
125147 
118737 
120274 
113196 
120504 
120639 



275673J 
283769J 
305217_1 
150431 1 



AA609170 

AA6Q2964AA609200 
M21259 

AI267847 N27351 
N34059 N46979 
N53935 N53950 
U50360 
X89059 

AA992380 N33063 N21418 H79958 R21911 H79957 

103797 109699_1 AA080912 AA075318 AA083403 AA076594 AA078992 AA084926 AA081881 AA113913 AA113892 
AA083821 AA134801 AA082953 AA070343 AA062835 AA075419 AA063293 AA071252 AA078900 AA062836 AW974305 

AA190577AA181657 

T63857 AW971220 AA493469 T63699 

AI950087 N7020a R97040 N36809 AI3081 19 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 AI820501 At820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 AA488964 
AA283144 AI890387 A1950344 AI741346 Ai689062 AA282915 AW102898 A1872193 AI763273 AW173586 AW150329 AI653832 
A1762688 AA988777 AA488892 AI356394 AW103813 AI539642 AA642789 AA856975 AW505512 AI961530 AW629970 
BE612881 AW276997 AW513601 AW512843 AA044209 AW856538AA1800O9 AA337499 AW961101 AA251669 AA251874 
AI819225 AW205862 AI683338 AI858509 AW276905 AI633006 AA972584 AA908741 AW072629 AW513996 AA293273 
AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 AW613456 A1373032 AI564269 
F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 F04005 
AI950087 N70208 R97040 N36809 AI308119 AW967677 N35320 A1251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 AI820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AA884444 N92578 F13493 AA927794 Ai560251 AW874068 AL134043 AW235363 AA663345 AW008282 AA488954 
AA283144 AI890387 Aig50344 AI741346 AI689062 AA282915 AW102898 AI872193 Ai763273 AW173586 AW150329 AI653832 
AI762688 AA988777 AA488892 AI356394 AW103813 A1539642 AA642789 AA856975 AW505512 AI961 530 AW629970 
BE612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 AW9611 01 AA251669 AA251874 
AI819225 AW205852 AI683338 AIB58509 AW276905 AI633006 AA972584 AA908741 AW072629 AW513996 AA293273 
AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 AW613456 AI373032 AI564269 
F00531 H83488 W37181 W78802 R66056 A1002839 R67840 AA3a0207 AW959581 T63226 F04005 

AA976503 A1917802 AA953664 AA404613 AA428771 BE280542 AW194691 A1927301 A1740458 AI796100 AI935603 AW052210 

AA970201 AI633384 AA425910 AI017004 AI241295 AA402816 AA291468 

AA398838AA435847 

AA416568 AA442889 AA417233 AA442223 
AA453641 AA454061 



AA157811 AA836869 
genbanK_AA609354 AA609364 
genbank^AA620586 AA620586 
genbank.T97341 T97341 
NOTJ^OUND.entre2^W38150 
382979J AA199686 N73861 
genbanH^AA177051 AA177051 
genbank_T57317 T57317 
genbanK.AA256837 AA256837 
genbanKJ\A286942 AA286942 



W38150 
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120809 genbanIcAA346495 AA346495 
113702 genbank_.T97307 T97307 

129680 23162_1 U03749 NM_001275 J03483 J03915 AI214509 AW245744 AL046455 AA318960 A1741505 AA843875 AI829382 AI560122 
AI858999 D55958 AI684005 D53170 AA854091 AI025609 D531 19 D54729 D55504 D55377 D55313 AW512244 AAB46441 
AW043898 AI969102 AA405741 AI091983 AA788784 AA706586 AA854361 AW470949 AA843095 AA772028 AI148432 
AI038109 AA782478 AA910064 AI220384 AA781296 AA843881 AA854064 AA843125 AA843419 AA319036 AA319054 
AI273831 W32275 AI584185 C05724 AA789023 AI686818 D54392 AI022485 AA431410 AA854232 W39212 W15214 AA894441 
AI803081 All 67381 AW245389 AA319430 AA335156 AID42646 AA327030 AA725170 T27943 AA889304 AA976699 AI687001 
A1621 107 AI865540 AA772107 C06286 AA319661 AA405992 



101045 


entrezJ05614 J05614 


117247 


genbank 


N21032 


110501 


genbank 


H55748 


103392 


entrez_X94563X94563 


105032 


genbank 


AA127818 


119513 


NOT.FOUND.entrez W37933 


105445 


genbank 


AA252395 


121514 


genbank 


AA412112 


121558 


genbank 


AA412497 


121911 


genbank 


AA427950 


123315 


714071_1 


AA496369AA496646 


114911 


genbank 


AA236672 


409487. 


1134778.1 


H198B6AW402808T10231 
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TABLE 5: Figure 5 from BRCA 001 US 
Table 5 shows genes upregulated in tumor tissue compared to normal breast tissue. 
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Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene nimber 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to nomial breast tissue 
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Pkey ExAccn UnigenelD UnigeneTitle 



100114 
100147 
100154 
100335 
100666 
100567 
100668 
100678 
100988 
101031 
101045 
101332 
101352 
101580 
101592 
101767 
101806 
101810 
101983 
102107 
102165 
102198 
102217 
102220 
102302 
102348 
102374 
102455 
■ 102568 
102618 
102687 
102689 
102704 
102705 
102801 
102827 
103060 
103080 
103178 
103206 
103238 
103547 
103549 
103886 
104325 
104827 
104846 
104854 
104887 



Hs.82962 

Hs.136348 

Hs.81892 



X02308 
D13666 
H60720 

AW247529 Hs-6793 
L05424 Hs.169610 
L05424 Hs.169610 
L05424 Hs,169610 
AW502935 Hs.740 
AK000405 HS76480 



J05070 
J05614 
J0408d 
AI494299 



Hs.151738 

Hs.156346 
Hs.16297 
NM_012151Hs.83363 
AF084B53 Hs.91299 
M81057 Hs.180884 
AA586894 Hs.1 12408 
NM^000318Hs-180612 
AI904232 Hs.75323 
BE258602 Hs.182366 
BE313280 Hs.159627 
AW950852 HsJ4598 
AA829978 Hs.301613 



U24389 

AA3a6342 

U37519 

U33635 

U48705 

W81489 

AL037672 



Hs.65436 

Hs.69171 

Hs.87539 

Hs.90572 

Hs.75562 

Hs.223025 

Hs.81071 



NM_007019Hs.93002 



U96132 

AU077058 

T97490 

BE252241 

BE244588 



Hs.171280 

Hs.54089 

Hs.50002 

Hs.38041 

Hs.6456 



NM_005940Hs.155324 
AU077231 Hs.82932 
AA205475 Hs.275865 
X72755 Hs,77367 
AI369285 Hs.75169 
AI376722 Hs.180062 
BE270465 Hs.78793 
AKD01278 Hs.105737 
BE379766 H5.150675 
AW0S2006 Hs.8551 
AI250789 Hs.32478 
AA041276 Hs. 154729 
AA278898 Hs.225979 



R1 

thymidylate synthetase 2.9 

osteoblast spedfic factor 2 (fasclclin 7.5 

KIAA0101 gene product 9.2 

platelet-activating factor acetylhydrola 2.7 

CD44 antigen (homing function and Indian 5.7 

CD44 antigen (homing function and Indian 9 

CD44 antigen (homing function and Indian 7.6 

PTK2 protein tyrosine kbase 2 53.2 

ubiquitin-like 4 11.4 

matrix metadopfDleinase 9 (gelafinase B 8.2 

gb:Human proliferating cell nuclear anti 5 

topoisomerase (DNA) II alpha (170kD) 3.4 

C0X17 (yeast) homotog, cytochrome c oxid 6.3 

coagulation factor Vlll-associated (Intr 5.7 

guanine nucleotide binding protein ( 5.6 

carboxypeptkiase B1 (tissue) 14.4 

S100 cak^um-binding protein A7 (psorias 8.9 

peroxisomal membrane protein 3 (35kD, Ze 3.2 

prohibitin 8.4 

heat shock protein 75 1 .4 

death associated protein 3 4.6 

polymerase (DNA directed), delta 2, regu 4.3 

JTV1 gene 6.7 

lysosomal 4.3 

protein kinase C-like 2 2.7 

aldehyde dehydrogenase 3 family, member 2 

PTK7 protein tyrosine kinase 7 6.2 

discoidin domain receptor family, member 6.9 

RAB31, member RAS oncogene family 5.3 

extracellular matrix protein 1 5.8 

ubiquitin carrier protein E2'C 4.3 
hydroxyacyl-Coenzyme A dehydrogenase, ty 6 

BRCA1 associated RING domain 1 1 .9 

small inducible cytokine subfamily A (Cy 2,3 

pyridoxal (pyridoxine. vitamin B6) kinas 6.4 

chaperonin containing TCP1 . subunit 2 (b 5.6 

matrix metalloprotelnase 1 1 (MMP1 1 ; stro 4.5 

cydin D1 (PRAD1: parathyroid adenomatos 3,1 

ribosomal protein S18 9.9 

monokine induced by gamma interferon 8.8 
death-assodated protein 5.6 
proteasome (prosome, macropain) siibunit. 9.7 
protein kinase C, zeta 7.9 
hypothetical protein F1J1041 6 similar to 6.5 
polymerase (RNA) II (DNA directed) polyp 6.3 
PRP4/STKAWDspIiclng factor 10.9 
ESTs 5.6 
3-phospholnositide dependent protein kin 1 2.3 
hypothetical protein similar to small ,G 2 
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104896 AW015318 Hs.23165 ESTs 17.7 

104909 AW408164 Hs.249184 transcription factor 19 (SCI) 5 

104916 AW958157 Hs.155489 NSl -associated protein 1 1.7 

104919 AA026880 Hs.25252 prolactin receptor 1-4 

5 104974 Y12059 Hs.278675 bromodomain-containtng 4 1.4 

104978 AI199268 Hs.19322 Homo sapiens, Similar to RIKEN cDNA 2010 7,2 

105012 AF098158 Hs,9329 chromosome 20 open reading frame 1 3.3 

105039 AA907305 Hs.36475 ESTs 2.5 

105079 AA151342 Hs.12677 CGI-147 protein 9.5 

10 105088 H58589 Hs.35156 Homo sapiens cDNAFU 11 027 fis, clone PL 2.2 

105393 AF167570 Hs.256583 interieuldn entiancer binding factor 3. 9 5.4 

105547 AA262640 Hs.27445 unknown 9.3 

105564 BE616694 Hs.288042 hypothetical protein FLJ14299 14 

105658 AA985190 Hs.246875 hypothetical protein FU20059 9.4 

15 105746 AW151952 Hs.46679 hypothetical protein FU20739 1-5 

105858 AF151066 Hs.281428 hypothetical protein 2.9 

105930 AF016371 Hs.9880 pepUdyl prolyl isomerase H (cyclophilin 5.2 

106094 AA533491 Hs.23317 hypothetical protein FU 14681 6.8 

106350 AK001404 Hs.l94698 cyclinB2 5.7 

20 106359 AW390282 Hs.31130 transmembrane 7 superfamily member 2 6,3 

106610 AA458882 Hs.79732 fibulinl 7.9 

106624 NI\/L003595Hs.26350 tyrosylprotein sulfotransferase 2 7.7 

106713 BE614802 Hs.184352 hypothetical protein FU 12549 4.5 

106829 AW9598g3 Hs.27099 hypothetica] protein FU23293 similar to 16.2 

25 106846 AB037744 Hs.34892 KIAA1 323 protein 2.2 

106873 N49809 H3.11197 Homo sapiens, done 1MAGE:33431 49. mRN A. 16.8 

108973 BE156256 Hs.11923 hypothetical protein 6.6 

107029 AF264750 Hs.288971 myeloid/lymphoid or mixed-lineage leukem 1.8 

107197 W15477 Hs.64639 glioma pathogenesis-related protein 6.1 

30 107859 AW732573 HS.475B4 potassium voltage^gated channel, detayed 8.4 

107901 L42612 Hs.335952 keratin 6B 2.5 

107922 BE153855 Hs.61460 Ig superfamily receptor LNIR 2.2 

107974 AW956103 Hs.61712 pynjvate dehydrogenase kinase, isoenzyme 6.7 

108274 AF129535 Hs.272027 F-box only protein 5 7.1 

35 108647 BE545947 Hs.44276 homeoboxCIO 9.8 

108695 AB029000 Hs.70823 KIAA1 077 protein 7.2 

108894 AK001431 Hs.5105 hypothetical protein FU 10569 4 

109011 AA156542 Hs.72127 ESTs 1.4 

109068 AA164293 Hs.72545 ESTs 2.9 

40 109273 AA375752 Hs.82719 Homo sapiens mRN A; cDNA DKFZp586F 1822 (f 2.9 

109468 NM_0153l0Hs.6763 KIAA0942 protein 3.2 

1 10240 AI668594 Hs.1 76588 ESTs. Weakly similar to CP4Y_HUMAN CYTOC 4.2 

110330 AI288666 Hs.16621 DKFZP43411 16 protein 6.2 

110501 H55748 gb:yq94a01 .si Scares fetal liver spleen 6.1 

45 110742 AW1 90338 Hs.28029 hypothetfcal protein MGC1 1256 7.6 

1 10762 BE044245 Hs,3001 1 hypothettaal protein MGC2953 9.3 

1 10856 AA992380 gb:ot37g06.s1 SoaresJeslis.NHT Homo sap 2.3 

110958 NM.005864HS.24587 signal transduction protein (SH3 contain 6.7 

111125 N63823 Hs.269115 ESTs. Moderately similar to Z195.HUMANZ 3.6 

50 111179 AK000136 Hs,10760 asporin (LRR class 1) 7.1 

111239 N90956 Hs.17230 hypothetical protein FU22087 7.9 

111285 AA778711 Hs.4310 eukaiyotic translation initiation factor 6.9 

111392 W46342 Hs.325081 Homo sapiens, done IMAGE:3659680, mRNA. 8.4 

111937 BE298655 Hs.14846 Homo sapiens mRNA; cDNADKFZp564D016(fr 10.6 

55 112244 AB029000 Hs.70823 K1AA1077 protein 14.6 

112995 AA737033 Hs.7155 ESTs, Moderately similar to 21 15357APrK 5.6 

113777 BE265947 Hs.10590 zinc finger protein 313 13.4 

113791 AI269095 Hs.1 35578 chitobiase. di-N-acetyl- 1-3 

113811 BE207480 Hs.6994 Homo sapiens cDNA: FU22044 fis. done H 3.1 

60 113834 T26483 Hs.6059 EGF-contalning fibulin-Uke extracellula 11.3 

113868 W57902 Hs.90744 proteasome (prosome, macropaln) 26S subu 2.7 

113870 AL079314 Hs.16537 hypothetical protein. simOar to (U06944 6.1 

113923 AW953484 Hs.3849 hypothetical protein FU22041 similar to 1.9 

114275 AW515443 Hs.306117 KIAA0305 protein 15.8 

65 114895 AA236177 Hs.76591 KIAA0887 protein 7.1 

114965 Ar733881 Hs.72472 BMP-RIB 2.3 

115061 AI751438 Hs.41271 Homo sapiens mRNA full length insert cON 11.8 
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115278 AK002163 Hs.301724 
115291 BE545072 Hs.122579 
115652 BE093589 Hs.38178 
115693 AF231023 Hs.55173 
5 115941 AI867451 Hs.46679 
115968 AB037753 Hs.62767 
116011 AL359053 Hs.57664 
116417 AW49g664 Hs.12484 
116470. AI272141 Hs.83484 

10 116637 AK001043 Hs.92033 
117132 AI393666 Hs,42315 
117881 AF161470 Hs,260822 
118528 A1949952 Hs.49397 
119075 M10905 Hs.287820 

15 119265 BE539706 Hs.285363 
119349 T65004 Hs.163561 
119403 AL1 17554 Hs.1 19908 
119789 BE393948 Hs.50915 
120206 H26735 Hs.91668 

20 120253 AA131376 Hs.326401 
120297 AA191384 Hs.104072 
120325 AA195651 Hs.104106 
120327 AK000292 Hs.278732 
120349 AW969481 Hs.55189 

25 120356 AF000545 Hs.296433 
120371 AA219305 Hs.104196 
120383 AL109963 Hs.123122 
120386 AW969665 Hs.154848 
120389 AW967985 Hs.325572 

30 120396 AA134006 • HS79306 
120418 AW966893 Hs.26613 
120472 AI950087 
120484 AA253170 Hs.96473 
120570 AA280679 Hs.271445 

35 120582 BE244830 Hs.284228 
120595 AA282074 .Hs.237323 
120624 AW407987 Hs.173518 
120695 AA976503 
120713 AW449855 Hs.96557 

40 120750 Ai191410 Hs.96693 
120774 AI608909 Hs.193985 
120807 AA346385 Hs.30002 
120809 AA346495 
120984 BE262951 Hs.99052 

45 121081 AA398721 Hs.186749 
121408 AA406137 Hs.98019 
121505 AA494172 Hs.194417 
121508 AA402515 Hs.97887 
121513 AA416653 Hs.181510 

50 121549 AA412477 Hs.98142 
121558 AA412497 
121655 AA421537 Hs.178072 
121744 AA398784 Hs.97514 
121748 BE536911 Hs.234545 

55 121773 AB033022 Hs.158554 
121832 AW340797 Hs.98434 
121839 AA425691 Hs.1 91606 
121882 AA426376 Hs.98459 
121911 AA427950 

60 121999 AA430211 Hs.98668 
122013 AA431085 Hs.98706 
122036 W92142 Hs^71963 
122356 AA443794 HS.9839Q 
122371 AA868555 Hs.178222 

65 122372 AA446008 Hs.336677 
122460 AW418788 Hs.99148 
122490 AA448349 Hs.238151 



hypothetical protein FU1 1 301 1 .5 

hypothetical protein FU10461 6.2 
hypothetical protein FLI23468 10.6 
cadherin, EGF LAG seven-pass G-type rece 6.8 
hypothetical protein FLJ20739 5.5 
KIAA1332 protein 9.8 
Homo sapiens mRNA full length insert cON 2.4 
Human done 23826 mRNA sequence 7.4 
SRY (sex determining region y)-box 4 2.1 
integrin-iinked Idnase-associated serine 2.7 
p104}tndbig protein 5.2 
butyrate-faiduced transcript 1 5.7 
ESTs 7.4 
fibronecfin 1 57 
ESTs 1.4 
ESTs 8.4 
nudeolar.protein NOP5/NOP58 67 
kallikrein 5 (KLK5; KLK-L2; stratum com 9.2 
Homo sapiens clone PP1498 unknown mRNA 457 
fibroblast growth factor 12B 38,9 
ESTs. WeaWyslmiiar to Z195.HUMAN ZINC 15.2 
ESTs 6.4 
hypothetical protein FLJ20285 16.1 
hypothetical protein 16.8 
putative purinengic receptor 28.1 
EST 12.4 
FSH primary response (LRPR1 , rat) homolo 9.7 
hypothetical protein DKFZp434D01 27 32.6 
ESTs, Moderately similar to ALU7_HUMAN A 21.7 
eukaryofic translation initiation factor 1 2.5 

Homo sapiens mRNA; cDNA DKFZp586F1323 (f 1 1 .4 
gbw<i05c02jc1 NCLCGAP_Kid12Homosaplen 19.4 
EST 10,4 
ESTs, Weakly similar to ALU1_HUMAN ALU S 14.4 
ZNF135-fike protein 10.2 
N-acetylgbcosamine-phosphate mutase 7.5 
M-phase phosphoprotein homolog 52 
gb:oq30a04.s1 NCLCGAP_GC4 Homo sapiens 46.8 
Homo sapiens cDNAFLJ12727fis, clone NT 5.9 
ESTs, Moderately similar to 2109260A Be 7 
ESTs . 7.8 
SH3-conlaining protein SH3GLB2; KIAA1848 6.8 
gb:EST52657 Fetal heart 11 Homo sapiens 4.4 
ESTs 5.6 
ESTs. H^hly similar to 137550 mismatch 5.4 
EST ' 6 

ESTs 13.1 
ESTs 28 
ESTs 6.2 
EST 7.4 
gb:zt95g12,s1 Soares_testis_NHT Homo sap 2.8 
Homo sapiens mRNA; cDNA DKFZp434B1023 (f 7.8 
ESTs 7.1 
hypothetical protein NUF2R 1 9.5 

KIAA1196 protein 7.9 
ESTs 5.8 
ESTs. Highly similarto KIAA1048 protein 5 
ESTs 5 
gb:zw50f02.s1 Soares^total_fetus.Nh2HF8_ 7.2 
EST 6.4 
ESTs 6.5 
ESTs, Weakly simBar to ALU5_HUMAN ALU S 13.1 
ESTs 7.3 
ESTs 5 
EST 7.6 
ESTs. Wea»y similar to S43569 R01H10.6 97 
. EST 6.1 
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122492 AA448417 Hs.104990 
122510 AA449232 Hs,99195 
122530 AW959741 Hs.40368 
122572 AA452601 Hs.99287 
5 122607 AA453518 Hs.98023 
122614 AA453630 Hs.99339 
122616 AA453638 Hs.161873 
122618 AA453641 
122622 AA453987 Hs.144802 
10 122717 AA456859 Hs.178358 
122829 AW204530 Hs.99500 
122838 AA460584 Hs.334386 
122856 AI929374 Hs.75367 
122868 AF005216 Hs.115541 
15 122907 AA470074 Hs.169896 
123016 AW338067 Hs.323231 
123034 AL359571 Hs.44054 
123136 AW451999 Hs.194024 
123152 AW601773 Hs.270259 
20 123394 AA731404 Hs.105510 
123466 AA599042 Hs.112503 
123486 BE019072 Hs.334802 
123615 AA609170 
123735 NM_013241Hs,95231 
25 123753 AA609955 Hs.234981 
124008 AI147155 Hs.270016 
124385 AI267847 
124440 AA532519 Hs.129043 
124656 AW297702 Hs.102915 
30 124683 AA381661 Hs.1 19878 
124735 R22952 Hs.268685 
124761 AA374756 Hs,93560 
124768 AW368528 Hs.100855 
124788 R43543 Hs.100912 
35 124811 R46068 Hs.288912 
124812 R47948 Hs.188732 
124822 AA418160 Hs.86043 
124860 R65763 Hs.101477 
124903 AW296713 Hs.221441 
40 124930 AI076343 Hs.173939 
124942 R99978 Hs.268892 
125051 T79956 Hs.100588 
125056 T81310 Hs.100592 
125101 A1472068 Hs.286236 
45 125115 T97341 

125280 A1123705 Hs.105932 
127274 AW966158 Hs.58582 
128528 R39234 Hs.251699 
128670 AA975486 Hs.103441 
50 - 128691 W27939 Hs.103834 
128772 BE302796 Hs.1050g7 
128781 N71826 Hs.105465 
128797 NWL002975HS.105927 ' 
128868 AA419008 Hs.106730 
55 128891 F34856 Hs.292457 
128946 Y13153 Hs.107318 
128975 BE560779 Hs.284233 
128995 AI816224 Hs.107747 
129019 A1950087 
60 129076 AW296805 Hs.326234 
129088 AA744610 Hs.194431 
129096 AA463189 Hs.288906 
129198 N57532 Hs.109315 
129347 BE614192 Hs.279869 
65 129362 U30246 Hs.1 10736 
129372 NM.016039HS.1 10803 
- 129404 A1267700 Hs.317584 



ESTs 5.4 
ESTs 11.2 
adaptor-related protein complex 1 , slgma 1 0. 1 
EST 11 
ESTs 61.5 
EST 10.7 
ESTs 107.3 
gb:zx48e06.s1 Scares Jestis.NHT Homo sap 31.1 
ESTs 5.6 
ESTs 8.5 
ESTs 81.8 
ESTs 75.3 
Src-like-adapter 5.8 
Janus kinase 2 (a protein tyrosine kinas 5.3 
ESTs 11.5 
Homo sapiens cDNA FU1 1946 fis. done HE 2.8 
ninein (GSK3B interacting protein) 8.7 
ESTs 5.1 
ESTs 5.2 
ESTs • 3.6 

EST 7.4 
Homo sapiens cDNAFU14680ris. clone NT 2.4 
gb:afi2a12.s1 Soares_testi.s_NHT Homo sap 7.8 
FH1/FH2 domain-containing protein 10 
Huntingtin interacting protein E 30.6 
ESTs . 8,1 

gb:aq49a10.x1 Stanley Frontal NB pool 2 57.1 
Human DNA sequence from ck)ne 989H1 1 on 7.8 
ESTs 8.3 
ESTs, Weakly similar to M3K9_HUMAN MITOG 7.9 
ESTs 11.3 
Homo sapiens mRNA for K1AA1771 protein, 9 
ESTs 8.1 
Homo sapiens cDNA: FU22726 fis, done H 5.1 
hypothetical protein FLJ22604 14.2 
ESTs 7.9 
Homo sapiens CDNAFU13558 fis. done PL 6.6 
EST 23.9 
ESTs 32.4 
ESTs, Weakly similar to ALUB_HUMAN 22.8 
ESTs, Moderately similar to B34087 hypot 6.1 
EST 135.3 
ESTs 5.4 
KIAA1856 protein 5.6 
gb;ye57e05.s1 Soares fetal liver spleen 9.6 
ESTs 8 
Homo sapiens cDNA FU12789 fis. ckme NT . 12.8 
ESTs. Weakly similar to ION4-GGTR14[H.s 2.8 
Homo sapiens/Similar to RIKEN cDNA 1700 7.1 
hypothetical protein MGC5576 7.7 
thymidine kinase 1 , soluble 5.3 
small nudear ribonudeoprotein polypept -53.9 
stem cell growth factor; lymphocyte seer 13.3 
chromosome 22 open reading frame 3 3 
Homo sapiens, done f\/lGC:16362. mRNA. com 1 3.3 
kynurenine 3-monooxygenase (kynurenine 3 7.2 
NlCE-5 protein 14 
DKFZP566C243 protein 1.9 
gb:wq05c02.x1 NCLCGAP_Kid12Homosapien 2.9 
ESTs. Highly similar to T46422 hypotheti 5 
palladin 17.1 
WW Domain-Containing Gene 20.9 
K1AA1415 protein 5.8 
melanoma-associated antigen recognised b 7.6 
solute carrier family 1 2 (sodium/potassi 6.7 
CGi-99 protein 2 
ESTs 5 
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Hs.12152 
Hs. 142838 
Hs.14845 



Hs.15589 

Hs.155956 

Hs.183706 

Hs.295112 

Hs.1584 

Hs,179825 

Hs,252587 

Hs.15977 

Hs.1608 

Hs.16178 

Hs.1674 

Hs.17639 

Hs.17962 

Hs.18747 



129482 AA188185 Hs.289043 
129559 W01296 Hs.11360 
129587 H14718 Ks.11506 
129629 AK000398 Hs.11747 
5 129649 A0000092 Hs.16488 
129680 U03749 
129889 AW748482 Hs.77873 
129702 A1304966 Hs.12035 
129720 AA1 56214 
10 130010 AA301116 
130097 AL046962 
130135 AA311426 Hs.21635 
130211 NM_003358Hs.23703 
130242 X79201 Hs.153221 
15 130359 NM_013449Hs.277401 
130365 W56119 Hs.155103 
130448 BE513202 
130455 D90041 
130471 AL121438 
20 130503 BE208491 
130511 L32137 
130542 U64675 
130553 AF062649 
130556 AI907018 
25 130567 AA383092 
130574 AF083208 
130617 M90516 
130657 BE246961 
130693 R68537 
30 130744 H59596 

130757 AL036067 Hs.18925 
130880 BE514434 Hs.20830 
130944 BE382657 Hs.21486 
131046 AA321649 Hs.2248 
35 131060 AA194422 Hs.22564 
131099 AL133353 Hs.226581 
131135 NM_016569Hs.267182 
131185 BE280074 Hs.23960 
131225 H62087 
40 131245 AL080080 
131283 X80038 
131569 AL389951 
131643 AW410601 Hs.30026 
131714 AA642831 Hs.31016 
45 131722 013757 Hs.311 
131760 X76732 Hs.3164 
131793 AW966127 Hs.32246 
131885 BE502341 Hs.3402 
131900 AA099014 Hs.231029 
50 131905 AA179298 Hs.3439 
131941 BE252983 Hs.35086 
131971 BE567100 Hs.154938 
132180 NM.004460Hs.418 
132203 NM_004782Hs.194714 
55 132273 AA227710 Hs.43658 
132288 N36110 Hs.305971 
132294 AB023191 Hs.44131 
132348 AW067708 Hs.170311 
132370 AW572805 Hs.46645 ' 
60 132384 AA312135 Hs.46967 
132450 AA100012 Hs.48827 
132465 AW169847 Hs.49169 
132532 AA454132 Hs.5080 
132574 AW631437 Hs.5184 
65 132638 A1796870 Hs.54277 
132718 NM,004600Hs.554 . 
132726 N52298 Hs.55608 



Hs.31659 
Hs.24766 
Hs.339713 
Hs.271623 



spindlin 6.7 

hypothetical protein FU14784 7.5 

Human done 23589 mRl^lA sequence 6.8 

hypothetical protein FU20391 3.8 

cafreticulin 3.3 
gb:Human chromogranin A (CHGA) gene, pro 14.1 

B7homoiog3 2.6 . 

ESTs, Weakly similar to 138022 hypotheti 7.4 

APMCF1 protein 2 

nudeoiar phosphoprotein Nopp34 1 .6 

forWieadt)ox03A 2.8 

tubulin, gamma 1 6.1 
ESTs, IWodeiately similar to CEGT_HUMAN CI. 6 

synovial sarcoma, translocated to X chro 5.4 

bromodomain adjacent to zinc finger doma 8.5 

eukaryotic translation initiation factor 1 1 

PPAR binding protein 3.9 

N-acetyltransferase 1 (arylamine N-acety 33.6 

addudn 1 (alpha) 2.7 

K1AA0618 gene product 16.1 

cartilage oligomeric matrix protein (pse 6.1 

RAN binding protein 2-(il<e 1 7.8 

pituitary lumor-transfonning 1 14.4 

Empirically selected from AFFX single pr 4.7 

repncation protein A3 (14kD) 7.9 

apoptosrs antagonizing transcripSon fac 1 .2 

glutamine-fructose«6-phosphatetransamin 12.1 

Homo sapiens ubiquitin protein ligase (U 1 3.9 

ESTs . 2 

P0P7 {processing of precursor, S, cerevi 3.1 

protein X 0001 5.7 

kinesin^ilce2 2.1 

signal transducer and activator of trans 5.4 

small indudble cytokine subfamily B (Cy 7.4 

myosin VI 5.1 

C0X15 (yeast) homolog, cytochrome c oxid 7 
TBX3-iso protein 3.3 
cydinBI 5.8 
thyroid honnone receptor-associated prot 7.5 
thioredoxin domain-containing 2.8 
Homo sapiens done F19374 APO E-C2 gene 1 .3 
nucleoporin 50kD 5 
HSPC182 protein 2.9 
putative DNA binding protein 2.9 
phosphoribosyl pyrophosphate amidotransf 3^4 . 
nudeoblndin 2 2.9 
Homo sapiens cDNA FU 14656 fis, done NT 7.9 
ESTs . 137 
Homo sapiens, done MGC:15961, mRNA, com 8.7 
stomatln-IIke 2 11.3 
ubiquitin spedfic protease 1 2.3 
hypothetical protein MDS025 3.5 
fibroblast activation protein, alpha 14.7 
synaptosoma^^ssodated protein. 2dkD 7.8 
DKFZP586L151 protein 10 
solute carrier family 2.(fadlitated ghj 9.2 
K1AA0974 protein 2 
heterogeneous nudear ribonudeoprotein 125 
ESTs 28.3 
HSPC034 protein 6.1 
hypothetical protein FU12085 8.6 
KIAA1634protan 6.1 
mitochondrial ribosoma) protein L16 7.1 
TH1 drosophite homolog 14 
DNA segment on chromosome X (unique) 99212.4 
Sjogren syndrome antigen A2 (60kD, ribon 37 
hypothetical protein ^/lGC955 14.3 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



132731 
132744 
132773 
132798 
132810 
132833 
132842 
132851 
132891 
132941 
132972 
132980 
132994 
133016 
133177 
133208 
133254 
133286 
133268 
133285 
133390 
133391 
133540 
133594 
133621 
133720 
133760 
133784 
133791 
133797 
133822 
133850 
133865 
133881 
133924 
133959 
133989 
133997 
134234 
134348 
134376 
134379 
134405 
134421 
134480 
134516 
134529 
134751 
134790 
134806 
134850 
134859 
134971 
135181 
135207 
135245 
135257 
135307 
135321 
135354 
135400 
302276 
317781 
321114 
322556 
420802 
424001 



AI189075 Hs.301872 

AA010233 Hs.55921 

AA459713 Hs.295901 

AI026701 Hs.5716 

AB007944 Hs.5737 

U78525 Hs.57783 
NM_016154Hs.27977l 

U09716 Hs.287912 

BE267143 Hs.59271 

A1817165 Hs.6120 

AA034365 Hs.288924 

AA04Q696 Hs.62016 

AA1 12748 Hs.279905 



A)439688 

X97795 

AI801777 

AI567421 

AI160873 



Hs.6289 

Hs.66718 

Hs.6774 

Hs.273330 

Hs.69233 



AW956781 Hs.293937 
M76477 Hs,289082 
A1950382 Hs.72660 
AW103364 Hs.727 
AU)37159 Hs.74619 
AW160781 Hs.172589 
NMj004893Hs.75258 
L27841 Hs.75737 
BE271766 Hs.181357 
BE622743 Hs;301064 



M34338 
AL133921 
D50525 
W29092 



Hs.76244 

Hs.76272 

Hs.699 

Hs.7678 



AB011155 Hs.170290 
U30872 Hs.77204 



D86326 
X81789 



Hs.325948 
Hs.77897 



AL040328 Hs78202 
AI824113 HS78281 
BE300078 Hs.80449 
AW291946 Hs.82065 
X06560 Hs.82395 
AW362124 Hs.323193 
AW067903 Hs.82772 
AU077196 Hs,82985 
NM_005000Hs.83916 
AK001571 Hs.273357 
AW411479 Hs.848 
AW530803 Hs.89497 
BE002798 Hs.287850 
AD00152B Hs.89718 
AI701162 Hs.90207 
D26488 Hs.90315 
A1097346 Hs.286049 
BE250865 Hs.279529 
N26427 Hs.9634 
AI028767 Hs.262603 
AW291023 Hs.97255 
AI743770 Hs,98368 
AI652069 Hs.98614 
AA456454 Hs.l 83418 
X78592 Hs.99915 
AW057736 Hs,323910 
NM_007057Hs.42650 
AA902256 Hs78979 
BE041451 Hs.177507 
U22376 Hs.1334 
W67883 Hs.l 37476 



hypothetical protein MGC4840 5.9 

glutamyf-pro^l-fRNA synthetase 6.4 

KiAA0493 protein 14.6 

K1AA0310 gene product 2.5 

KIAA0475 gene product 4,2 

eukaryotic translation initiation factor 6.1 
Homo sapiens done PP1596 unknown mRNA7.1 

lectin, mannose-binding. 1 6.1 

U2(RNU2) small nuclear RNA auxiliary fac 27 

hypothetical protein FLJ13222 2.1 

Homo sapiens cDNA FUl 1392 fis, clone HE 3.5 

ESTs .1.3 

clone HQ0310 PRO0310p1 17.1 

hypothetical protein FLJ20886 4.4 

RAD54 (S.cerevisiae)-I(ke 4.4 

ESTs 5.5 
Homo sapiens, clone IMAGE:3544662, mRNA. 1 .3 

zinc finger protein 16.1 
ESTs, Weakly similar to FXD2^HUMAN FORKH 12.2 

GM2 ganglioside activator protein 10.4 

phosphatidylserine receptor 5.7 

inhibin, beta A (adhrin A. activin A6 a 25.5 

proteasome (prosome, macropain) 26S subu 1.7 

nuclear phosphoprotein similar to S. cer 2.6 

H2A histone family, member Y 13.5 

pericentriolar material 1 6.7 

laminin receptor 1 (67kD. ribosomal pro! 5.4 

arfaptin 1 12.1 

spemnldine synthase 9.7 

retinoblasloma-binding protein 2 1.3 

peptidylprolyl isomerase B (cyclophilin 9.7 

cellular retinoic acid-binding protein 1 4.2 

discs, large (Drosophila) homolog 5 5 

centromere protein F (350M00kD. mitosin 9.1 
vesicle docking protein p115 1.8 

splicing factor 3a, subunit 3, 60kO 10.4 

SWI/SNF related, matrix assodated, ac« 2,6 

regulator of G-proteIn signalling 12 13 
Homo sapiens, clone IMAGE:3535294, mRNA, 10.3 

interieukin 6 signal transducer {gpl 30, 6.7 

2',5*-oligoadenylate synthetase 1 (40-46 5.5 

hypothetical protein MGC3222 5.8 

collagen, type XI, alpha 1 72.9 

collagen, type V, alpha 2 6.7 

Empirically selected from AFFX single pr 6.2 
hypothetical protein FU10709 1.4 
FK506>binding protein 4 (59kO) 2.8 
taminBI 6.1 
integral membrane protein 1 1.2 
spemnine synthase 2.6 
hypothetical protein MGC1 1 1 38 9.1 
KIAA0007 protein • 13.3 
phosphoserine aminotransferase 2 
px19-Iike protein 14.9 
ESTs. Highly similar to C10_HUMAN PUTAT1 1 .7 
ESTs 12.2 
ESTs. Weakly similar to A46010 X-linked 7.6 
ESTs. Weakly similar to KIAA0822 protein 5.8 
ribosome binding protein 1 (dog 180kD ho 12.3 
cell division cycle 2-like 1 (PITSLRE pr 57 
androgen receptor (dihydrotestoslerone r 1 3.9 
HER2 receptor tyrosine kinase (c 5.3 
ZWlOinterador 2.8 
Golgi apparatus protein 1 5.5 
hypothetical protein 2.9 
v-myb avian myeloblastosis viral oncogen 2.3 
paternally expressed 10 (PEG10; K1AA105 7 
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' 425182 AF041259 . Hs.155040 zinc finger protein 217 2.3 

446999 AA151520 Hs.334822 hypothetical protein MGC4485 7.5 

450701 H39960 Hs.288467 Homo sapiens cDNAFU12280fis. done MA 5.6 ' 

452461 N78223 Hs.108106 transcription factor 4.7 

5 453157 AF077036 Hs.31989 DKFZP586G1 722 protein 12.1 
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TABLE 5A 



Table 5 A shows the accession numbers for those pkeys lacking unigenelD's for Table 5. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
5 designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

10 Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



IS' Pkey CAT number Accessions 

123615 30686_-15 AA609170 
124385 656394_1 A1267847 N27351 

110856 19346_14 AA992380 N33063 N21418 H79958 R21911 H79957 

20 120472 44573_2 A1950087 N70208 R97040 N36809 AI308119 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 

AA908598 AA251875 AI820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 
AA488964 AA283144 AI890387 Ai950344 AI741346 AI689062 AA282915 AW102898 AI872193 A1763273 AW173586 
AW150329 A1653832 AI762688 AA988777 AA488892 AI356394 AW103813 A1539542 AA642789 AA856975 AW505512 

25 AI961530 AW629970 BE612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 AW961 101 

AA251669 AA251874 AI819225 AW205862 AI683338 AI858509 AW276905 A1633006 AA972584 AA908741 AW072629 
AW513996 AA293273 AA959759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 
AW61 3456 AI373032 AI564269 F00531 H83488 W37181 W78802 R66056 A1002839 R67840 AA300207 AW959581 T63226 
F04005 

30 129019 44573 2 . Ai950087 N70208R97040 N36809 AI308119 AW967677 N35320AI251473 H59397 AW971573 R97278 W01059 AW967671 

AA9b8598 AA251875 A1820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 
AA488964 AA283144 AI890387 AI950344 A1741346 A1689062 AA282915 AW102898 A1872193 A1763273 AW173586 
AW150329 AI653832 AI762688 AA988777 AA488892 AI3563g4 AW103813 AI539642 AA642789 AA856g75 AW505512 

35 AI961530 AW629970BE612861 AW276g97 AW513601 AW512843 AA044209 AW856538AA180009 AA337499 AW961101 

AA251669 AA251874 AI819225 AW205a62 AI683338 AI858509 AW276905 Ai633006 AA972584 AA908741 AW072629 
AW513996 AA293273 AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 
AW613456 AI373032 AI564269 F00531 H83488 W37181 W78802 R86056 AI002839 R67840 AA300207 AW959581 T63226 
F04005 

40 120695 9683^3 AA976503 AI917802 AA953664 AA404613 AA428771 BE280542 AW194691 AI927301 AI740458 

AI796100 AI935603 AW052210 AA970201 AI633384 AA425910 AI017004 AI241295 AA402816 AA291468 
122618 305217.1 AA453641 AA454061 
125115 genbank_T97341 T97341 
120809 . genbank_AA346495 AA346495 
• 45 129680 23162 1 U03749 NrvL001275 J03483 J03915 AI214509 AW245744 AL046455 AA318960 AI741505 AA843875 AI829382 A1560122 

AI858999 D55958 AI684005 D53170 AA854091 A1025609 D53119 D54729 D55504 D55377 D55313 AW512244 AA846441 
AW043898 A1969102 AA405741 A1091983 AA788784 AA706586 AA854361 AW470949 AA843095 AA772028 AI148432 
AI038109 AA782478 AA910064 AI220384 AA781296 AA843881 AA854064 AA843125 AA843419 AA319036 AA319054 
AI273831 W32275 AI584185 C05724 AA789023 A1686818 D54392 AI022485 AA431410 AA854232 W39212 W15214 
50 AA894441 AI803081 AI167381 AW245389 AA319430 AA335156 AI042646 AA327030 AA725170 T27943 AA889304 

AA978699 AI687001 AI621 107 Ar865540 AA772107 C06286 AA319661 AA405992 
101045 entrezJ05614J05614 
110501 genbank^H55748 H55748 
121558 genbanK.AA412497 AA412497 
55 121911 genbank_AA427950 AA427950 
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TABLE 6: Figure 6 from BRCA 001 US 
Table 6 shows genes upregulated in tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbanic accession number 

UnigenelO: Unigene number 

10 Unigene TlUe: Unigene gene title 

R1 : Ratio of tumor to nomnal breast tissue 



Pkey ExAccn UnlgenelD UnigeneTitle R1 



100147 D13666 Hs.136348 osteoblast specific factor 2 (fasdcfin 7.5 

100678 AW502935 Hs.740 PTK2 protein tyrosine Idnase 2 53^ 

101806 AA586894 Hs.112408 S 100 calcium-binding protein A7 (psorias 8.9 

102455 U48705 Hs.75562 discoidin domain receptor family, member 6.9 

20 103206 X72755 Hs.77367 monokine induced by gamma interferon 8.8 

105743 BE246502 Hs.9598 sema domain, immunoglobulin domain (Ig), 2.6 

105746 AW1 51952 Hs.46679 hypothetical protein FU20739 1.5 

106373 AW503807 Hs.21907 histone acetyltransferase 1.8 
110240 AI668594 Hs.l76588 ESTs, Weakly similar to CP4Y_HUMAN CYTOC 4.2 

25 119260 AK001724 Hs.102950 coat protein gamma-cop 3.2 

120206 H26735 Hs.91668 Homo sapiens done PP1498 unknown mRNA 45.7 

120253 AA131376 Hs.326401 fibroblast growth factor 12B 38.9 

120297 AA191384 Hs.104072 ESTs. Weakly similar to Z195_HUMAN ZINC 15.2 

120624 AW407987 Hs.173518 M-phase phosphoprofein hpmotog 52 
30 120695 AA976503 gb:oq30a04.s1 NCLCGAPjGC4 Homo sapiens 46.8 

120807 AA346385 Hs.30002 SH3-containing proteb SH3GLB2; KIAA1848 6.8 

121508 AA402515 Hs.97m ESTs 28 

122607 AA453518 Hs.98023 ESTs 61.5 

122616 AA453638 Hs.161873 ESTs 107.3 

35 122618 AA453641 gb:zx48e06.s1 Soares_lestis_NHT Homo sap 31.1 

122829 AW204530 Hs.995O0 ESTs B1.8 

122838 AA460584 Hs.334386 ESTs 75.3 

123753 AA609955 Hs.234961 Huntingtin interacting protein E ^ 30.6 

124385 AI267847 gb:aq49a10.x1 Stanley Frontal NB pool 2 57.1 

40 124860 R65763 Hs.101477 EST 23.9 

124930 AI076343 Hs.1 73939 ESTs. Weakly similar to ALUB^HUMAN IH! 22.8 

125051 T79956 Hs.100588 EST 135.3 

128781 N71826 Hs.105465 small nudear ribonucleoprotein pblypept 53.9 

129096 AA463189 Hs.288906 WW Domain-Containing Gene . 20.9 

45 129347 BE614192 Hs.279869 melanoma-associated antigen recognised b 7.6 

129689 AW748482 Hs,77873 B7 homolog3 2.6 

130503 BE208491 Hs.295112 KIAA061 8 gene product 16.1 

130511 L32137 Hs.1584 cartilage oligomeric matrix protein (pse 6.1 

131046 AA321649 Hs.2248 small indudble cytokine subfemiiy B (Cy 7.4 

,50 131643 AW410601 Hs.30026 HSPC182 protein 2.9 

131925 AF151048 Hs.183180 anaphase promoting complex subunit 11 (y 2.7 

V 132180 NM_004460Hs.418 fibroblast activatran protem. alpha 14.7- 

132370 AW572805 Hs.46645 ESTs 28.3 

132994 AA1 12748 Hs.279905 clone HQ0310 PRO0310p1 17;l 

55 133016 AM39688 Hs.6289 hypothetical protein FU2D886 4.4 

133266 AI160873' Hs.69233 zinc finger protein 16.1 

133391 AW103364 Hs.727 inhibin, beta A (acfivin A, ac&nn ABa 25.5 

134169 AI8g0916 Hs.178137 transducer of ERBB2. 1 .12 

134219 NM 000402H5.80206 glucose-6-phosphate dehydrogenase 1-9 

60 134405 AW087903 Hs.82772 collagen, type XI. alpha 1 72.9 

134529 AW41 1479 Hs.848 FK506.binding protein 4 (59kD) 2.8 

134975 R50333 Hs.92186 Leman cofled-coil protein 2.6 

135181 BE250865 Hs.279529 px19.like protein 145 

322555 BE041451 Hs.177507 hypothetical protein 2.9 
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TABLE 6A 



Table 6 A shows the accession numbers for those pkeys lacking unigeneED's for Table 6. For 
5 each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 
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CAT number. 
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Unique Eos probeset Identifier nunnber 
Gene duster number 
Genbank accession numbers 
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Pkey 



CAT number Accessions 



124385 
120695 



656394.1 
9B83J 



AI267847 N27351 

AA976503 AI917802 AA953664 AA404613AA428771 BE280542 AW194691 AI927301 AI740458 Ai796100AI935603 
AW052210 AA970201 A16333B4 AA425910 AI017004 AI241295 AA402816 AA291468 
AA453641 AA454061 



20 



122618 



305217_1 
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TABLE 7: Figure 7 from BRC A 001-1 US 

Table 7 shows genes upregulated in tumor tissue compared to normal breast tissue. Open 
reading frames in the sequences have been characterized as having a signal sequence (SS), a 
transmembrane domain (TM) or other. 
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Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unlgene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to nomial breast tissue 

ORF struct info: Structural characterization of open reading frame for ttie sequence of tiie gene 

Pkey ExAccn UnigenelD UnigeneTitle 

100113 NM_001269 Hs.84746 chromosome condensation 1 

100114 X02308 Hs.82962 thymidylate synthetase 
100131 D12485 Hs.11951 ectonucleotide pyrophosphatase/phosphodi 

100146 BE185499 Hs.2471 KIAA0020 gene product 

100147 D13666 Hs.136348 osteoblast specific factor 2 (fasdclin 
100154 H60720 Hs.81892 KIAA0101 gene product 
100163 W44671 Hs.124 gene predicted from cDNA with a complete 
100220 AW015534 Hs.217493 annexinA2 
100265 D38521 Hs.112396 KIAA0077 protein 
100271 BE160081 Hs.256290 S100 calckinvbinding protein All (caigiz 
100275 BE242802 Hs.154797 KIAA0090 protein 
100323 D50920 Hs.23106 KIAA0130 gene product 
100335 AW247529 Hs.6793 platetet-aclivatihg factor acetylhydrola 
100364 NM„004341 Hs.1 54868 carbamoyl-phosphate synthetase 2, aspart 
100372 NMJ)14791 Hs.184339 KIAA01 75 gene product 
100393 D84145 Hs.39913 novel RGD-contatning protein 
100400 AW954324 Hs75790 phosphatidylinositol glycan. dass C 
100418 D86978 .Hs.84790 KIAA0225 protein 
100482 M65028 Hs.81361 heterogeneous nuclear ribonudeoprotein 
100518 NM^004415 Hs.74316 ■ desmoplaWn {DPI. DPII) 

100666 L05424 Hs.169610 CD44 antigen (homing function and Indian 

100667 L05424 Hs.169610 CD44 antigen (homing function and Indian 

100668 L05424 Hs.169610 CD44 antigen (horrang function and Indian 
100678 AW502935 Hs.740 . PTK2 protein tyrosine kinase ^ 
100783 AF078847 Hs.191356 general transcription factor I IH, polype 
100892 BE245294 Hs.180789 S164 protein 
100945 AF002225 Hs.180686 ubiquitin protein Figase E3A (human papi 
100969 AA157634 Hs.79172 solute earner famHy 25 (mitochondrial 
100988 AK0004O5 Hs.76480 ubtquitin-like 4 

100999 H38765 Hs.80706 diaphorase (NADH/NADPH) (cytochrome b-5 
101031 J05070 Hs,151738 matrix metalloproteinase 9 (geJatinase B 

101045 J05614 gb:Human proliferating cell nuclear anti 
101077 N99692 Hs.75227 Empirically selected from AFFX single pr 
101093 106419 Hs.75093 procoltagen-lysine. 2-oxoglutarate S^lio 
101186 AA020956 Hs.l79881 core-binding factor, beta subunit 
101216 AA284166 Hs.84113 cydin-dependent kinase inhibitor 3 (CDK 
101228 AA3333B7 Hs.82916 chaperoninooniainingTCPl. subunit 8A( 
101247 AA132666 Hs.78802 glycogen synthase kinase 3 beta 
101249 L18964 Hs.1904 protein kinase C, iota 
101332 J04088 Hs.156346 topoisomerase (DNA) II alpha (170kD) 
101352 AI494299 Hs.16297 C0X17 (yeast) homotog. cytochrome c oxid 
101396 BE267931 Hs.78996 proliferating ceil nuclear antigen 
101445 M21259 gb:Human Alu repeats in the region 5' to 

101470 NM_000546 Hs.1846 tumor protein p53 (Li-Fraumeni syndrome) 
101478 Nl\/L002890 Hs.758 RASp21 protein activator (iSTPaseactiva 
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101483 M24486 
101540 J04977 
101573 AW248421 
101580 NM.012151 
5 101592 AF064853 
101621 BE3918G4 
101702 AW504089 
\ 101734 M74099 
101759 M80244 
10 101767 M81057 
101782 AA306495 

101805 AW409747 

101806 AA586894 
101810 NM_000318 

15 101879 AA176374 
101911 AA441787 
101920 AF182645 
101973 U41514 
102009 BE245149 

20 102036 BE250127 
102083 T35901 
102107 BE258602 
102123 NM_001809 
102165 BE313280 

25 102198 AW950852 
102217 AA829978 
102220 U24389 
102234 AW163390 
102260 AL039104 

30 102302 AA306342 
102330 BE298063 
102339 BE378432 

102348 U37519 

102349 AU077055 
35 102369 U39840 

102374 U33635 
102391 AA296874 
102455 U48705 
102465 NM_001359 

40 102488 U50939 
102489 AL080116 
102494 AI188137 
102501 AF217197 
102522 BE250944 

45 102532 AF040i253 
102564 U59423 
102568 W81489 

102580 U60808 

102581 AU077228 
50 102582 U61232 

102617 AW161453 

102618 ALJ037672 
102627 AL021918 
102663 NML002270 

55 102676 BE262989 
102687 NKA.007019 
102689 U96132 
102696 BE540274 
102704 AU077058 

60 102705 T97490 

102750 AB014460 
102801 BE252241 
102812 U90549 
102827 BE244588 

65 102844 AV653790 
102868 X02419 
102925 BE440142 



Hs.76768 p/ooonagen-prorine, 2-Qxog(utarate 4-di 

Hs.64981 X-iay repair complementing defective rep 

Hs.250758 proteasome (prosome, macropain) 26S subu 

Hs.83363 coagulation fader Vlll-associated (intr 

Hs.91299 guanine nucleotide binding protein (G pr 

Hs.62661 guanylate binding protein 1. interferon- 

Hs.179574 protein phosphatase 2 (fonneriy 2A), reg 

Hs.147049 cut (DrosophilaHike 1 (CCAAT dispiacem 

Hs. 1 84601 solute carrier family 7 (cationic amino 

Hs. 1 80884 carboxypeptidase B 1 (tissue) 

Hs.1869 phosphoglucomutase 1 

HsJ5612 stress-induced-phosphoprotein 1 (Hsp70/H 

Hs. 1 12408 S1 00 calcium-binding protein A7 (psorias 

Hs, 18051 2 peroxisomal membrane protein 3 (35kD. Ze 

Hs.243886 nuclear autoantigenic spenn protein (his 

Hs.1 19689 glycoprotein hormones, alpha polypeptide 

Hs.8024 IK cytokine, down-regulator of HLA 11 

Hs.80120 UDP-N-acetyl-alpha-D-galactosamine:po1yp 

Hs.82643 protein tyrosine kinase 9 

Hs.82906 CDC20 (cell division cyde 20, S. cerevi 

H s J5 1 1 7 interteukin enhancer binding factor 2. 4 

Hs. 1 82366 heat shock protein 75 

Hs.1 594 centromere protein A (1 7kD) 

Hs.1 59627 death associated protein 3 

Hs.74598 polymerase (DNA directed), delta 2. regu 

Hs.301613 JTV1 gene 

Hs.65436 lysosomai 

Hs.278554 heterochromatin-like protein 1 

Hs.159557 karyopherin alpha 2 (RAG cohort 1 . impor 

Hs.69171 protein kinase C-like 2 

Hs.77254 chromobox homolog 1 (Drosophila HP1 beta 

Hs.95577 cyclin-dependent kinase 4 

Hs.87539 aldehyde dehydrogenase 3 family, member 

Hs.289107 baculoviral lAP repeat-containing 2 

Hs.299867 hepatocyte nuclear factor 3, alpha 

Hs.90572 PTK7 protein tyrosine kinase 7 

Hs.77494 deoxyguanosine kinase 

Hs75552 discoidin domain receptor family, member 

Hs.81548 2,4-dienoyl CoA reductase 1, mitochondri 

Hs.61828 amytoid beta precursor protein-binding p 

Hs74420 origin recognition complex, subunit 3 (y 
Hs J5193 C0P9 homolog 
Hs.74562 siah binding protein 1 ; FBP interacting 

Hs. 1 83556 solute carrier family 1 (neutral amino a 

Hs.70186 suppressor of Ty (S.cerevisiae) 5 homolo 
Hs.79067 MAD (motiiers against decapentaplegic. Dr 
Hs.223025 RAB31 , member RAS oncogene family 
Hs.1 52981 CDP-diacylglycerol synthase (phosphatida 
Hs77256 enhancer of zeste (Drosophila) homolog 2 
Hs.32675 tubulin>speciric diaperone e 
Hs.198767 C0P9 (constitutive photomorphogenic. Ara 
Hs.81071 extraceilular matrix protein 1 
Hs.158174 zinc finger protein 184 (Kmppel-like) 
Hs.168075 karyopherin (imporfin) beta 2 
Hs.1 2045 putative protein 
' Hs.93002 ubiquitin earner protein E2-C 
Hs.171280 hydroxyacyl-Coenzyme A dehydrogenase, ty 
Hs.239 forichead box Ml 
Hs,54089 BRCA1 associated RING domain 1 
Hs.50002 small inducible cytokine subfamily A (Cy 
Hs.66196 nUi (E.coli endonuclease lll)-Cke 1 
Hs.38041 pyridoxal {pyfadoxme, vitamin B6) kinas 
Hs.236774 high-mobility g roup (nonhistone chromoso 
Hs,6456 chaperonin contaNng TCP1 , subunit 2 (b 
Hs.324275 WW domain-containing protein 1 
Hs.77274 plasminogen activator, urokinase 
Hs.2943 signal recognition particle 19kO 
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1.6 


other 


m'l'iQi AA^mn'^Q 


He 900"^ 


nrntein nhnenhataep A /fAimpriv X\ f^tA 


2.5 


other 


lUoiyo iNiVLuu**fDO 


He 7*^794 
fib. / 3/ 




2.2 


TM 




He 7A<^An 
nb./ 030U 


nPAn/H /AeA.r^liuAla.AeA/Hie) Hay nnlvrtpn 


6.3 


TM 




He 77'^R7 
no,/ f Out 


mnnAlcinP InHi irpH hv nAmma intprf^mn 


8.8 


TM 




We 17nA 

ns. 1 /uo 




3 


other 




He A 9Q7An 

ns. 1 zy / ou 


iiimnr nppiY^eie faWAr rPPontAT eiinpffsmi 


1.8 


other 


10j2oo AlJbyzoo 


ns.f 3 loy 


u63in-aSsociai6u proicin 




TM 




ns.yu/o 


llTilTlalUrc UUiOii UdlLuiUiTia uaflbi^ipi 1 


1 Q 


? 




Wc 774QR 

ns.f /*iyD 


email n t t^oo r rihtoni i/'^IPAnrAf pin AAl\/npnt 

bludii iiuuicdi iiLiuiiutHcupiuiciii \t\Jijy\syi 


2.5 


other 


lUJoAy Xoyuoy 




/iK<U eonipne mOKIA if\r iint/nnMin AfAiQtn OV 

yu.n.bopicrio iiirMHM lui UiDviiuwii piuicui ca 


1.6 


other 


10Jo7b ALOJblDD 


nS.3/3o/o 


COdicu vcstCic rnciiiDidiie piuiciii 


1 R 


other 


103391 X94453 


nS.n4obb 


pyrroItnB'S-csrboxylste synthstssG (9^t 


9 1 


other 


1 033yz Xy4t3bJ 




QD.n. sapiens □Di/acDp Qcne exon i « z. 


' A 


TM 


103430 Dt5D40idU 


nS./0/ lb 


uaiisiocdse Oi inner niaocnonunai nicmur 


1 '\- 


other 


103491 Arzb4/oU 


nS./o03/l 


myeiOtQ/iynipnoio or rniAcu-'iincagc icUKem 


R 7 


7 


1 U J3u3 ALUd 1 i£z*t 


Ue "in 09 


traner*rintinn farlAr AP-9 hofa /arfh/afi 
uaiiSwfipuun lowiui f\r~C ucid \auuvaii 


5.1 


ottier 


10334/ MoJo/dZ 


nS.lOuUDZ 


pruicaSOiTic \pr(»uiiie, niacrupaiii/ auuuiui, 


. Q7 


? 


1 039OO NMJUUO^l o 


nS.o3/Ui 


pnospnoinosiuuc-o~Kinase, caiaiyuc, ai 


0 

c 


other 


103oio NM_UUU34o 


Ue O'iiA 

nS./3iD 


CDV /eav Hatarmininn fttninn Vl_hAV Q tt*Si 

oKT (sex ueiBiniining legion i )-uua a \ua 


1 3 


? 


103621 Dc3f97ob 


Lie ^tifSfrjtl 

rtS. i3uo/9 


nnttfmaraea /DKIA\ II /^MA rlim/*taH\ HAlun 

poiyrnciase ^kinm^ ii ^uima □iiccicu/ puiyp 


O 

£. 


other 


10oO£^f AAouyood 


nS./f 00// 


mamKrano rv\mnAnonf ^KmnriACmnp 11 eiirfa 
iHCiTiDians vOriipuilclilt UMUilluamiic 1 1, auiia 


2.3 


TM 


103/ 2r Alo/oooo 


ns./yoooi 


nmitrfh f^nkriT raoanf Ar.h/M ifiH nmfotn 9 

yruWui laCiur rcCcpiui'UDUiiu piuiGiii ^ 


1.3 


other 


'I ni7RA A im f;7fio 

luo/o** Miuio/uy 


Ue i79nAQ 

ns. i//uoy 


HnmA eanipne mRMA' rHKIA nKF7nSAfil9n99 (i 
nUillU bdpicilb lllrMNM, L>L./|Nr\ urv ^puOOi^vf^i^ ^i 


1.3 


ottier 


■iivi7nn AAnQ47^o 


Uq 1AQQQ9 




7.6 


? 


1037yb n^bool 


Ue 7'Mt7 

nS./oo/ 


nOlTlU bapicnS O 1 D UUiMain piUlcUl \OUri-f III 


1.3 


SS.TM 


103/9/ AAUoU91<i 




nk'Tnn^Hn') ri Qtratanpna hNIT nPilfAn /Q'%7 

Qu.znUnuUO.i 1 ouaioycnc ihn i iicuiun yjoi 


1.6 


other 


103£)13 AIU4z:OOZ 


nS.io I// i 


CCi\ i 9n nrAtntn 

uoi-i/u proieiii. 


1.6 


other 


103033 WUZODJ 


Uo '?n99R7 

ns.ou//o/ 


nypuincucai proicui r iuoou 


1.6 


other 


1 U JOOO AfNliU I Z f 0 


ns. IU3/Or 


huDAthofir^l nrAfpin FH .tlflillR similar tn 

iiypuuicUv/ai piuicui IU*t lu ailllllOl lU 


6.6 


TM 


10403/ InIvI_UUZ*4U/ 


Ue Q7R44 


mdmm39tobin 2 


2.9 


otiier 


'iniin7Q AAO^ilO^Q 


nb. luozoo 


PQTe 
CO IS 


1^4 


other 


1041/ *♦ MM**/oyo** 


Ue Ril<\'l 


DPODRf^Q nrAfpin 
rrvVJUOSa piUlcill 


5.6 


TM 




Ue QAQ'^A 

ns.yoyjo 


prOluCaUJlclUI dipt la s 


1.6 


other 


lU4/fO Au3iy/U 


Wc ininR7 
ns. lu luo/ 


/2/^M*» /npnpra) /vintrol Af amino-apiH evnf 
ov/[\3 ^ycncidi uuiiuui ui diiiuiu~duiu aym 


5.4 


other 


1043/3 DCo/y/DD 


Uo iKnR7*\ 
ns. i3U0/3 


puiymciaSc ^rxINrV^ 1 1 \I-'INM uiicwiusu^ ynjiyy 


6.4 


other 


4AJI070 AA0Dy!KQ7 

1043/0 AA3z439i 


Ue OIRRi 

nS./loOl 


Unmn e^niane rHKIA PI M 9Qnn Fie tIaao MT 

nuino Sapiens ui^I'Im ruj i^iauu iiai uuiic in i 




other 


1044/J K(5on3 


Uc 14*^9 


nmfajn Irinaep C eiihetratp AflkC-H 
proXcUl Kiriabc w aiiuaudSB uwinti 


5.2 


other 


1044(5/ Ad03/ /^b/ 


Up /IXORQ 

nS.44/bo 


myGlin Q^no exprGssion tector 2 ^ 


1 9 


nthpr 

WU ICi 




He 30098 


ESTs 


1.4 


other 


104757 AI694413 


Hs.332649 


olfactoiy receptor, family 2. subfamily 


2.4 


other 


104804 AI858702 


Hs.31803 


ESTs. Weakly simyar to N-WASP [H.sapien- 


1.4 


other 


104806 AB023175 


Hs.22982 


KIAA0958protdn 


2.4 


other 


104827 AW052006 


H5.8551 


PRP4/STK/WD spUdng factor 


10.9 


other 


104846 Ai250789 


Hs.32478 


ESTs 


5.7 


other 


104854 AA041276 


Hs.154729 


3-phosphoinositide dependent protein kin 


12.3 


? 


104867 AA278898 


Hs.225979 


hypofheticai protein similar to small G 


2.1 


other 


104871 T78044 


Hs.28893 


Homo sapiens mRNA; cDN A 0KFZp564O2364 (( 


1.4 


other 


104896 AW015318 


Hs.23165 


ESTs 


17.7 


other 


104909 AW408164 


Hs.249184 


transcription factor 19 (SCI) 


5.1 


TM 


104916 AW958157 


Hs.1 55489 


NSI-assodated protein 1 


1.8 


other 


104919 AA026880 


Hs,25252 


prolactin receptor 


1.5 


other 


104930 AF043467 


Hs.32893 


neurexophilin 2 


Z3 


other 


104973 NM^015310 


Hs.6763 


KIAA0942 protein 


5.1 


other 
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104974 


Y12059 


Hs.278675 


bromodomaln-containing 4 


1,5 


other 


104975 AL136877 


Hs.50758 


SMC4 (stmctural maintenance of chromoso 


2.4 


other 


104978 


AI199268 


Hs.19322 


Homo sapiens. Similar to RIKEN cONA 2010 


7.3 


other 


104979 


AA937934 


Hs.321062 


ESTs 


1.3 


other 


104994 


AI499930 


Hs.334885 


mitochondrial GTP binding protein 


3.6 


? 


105009 


BE379584 


Hs,34789 


doiichyl-diphosphooIigosaccharide-pFotei 


5.6 


other 


105012 


AF098158 


Hs.9329 


chromosome 20 open reading frame 1 


3.4 


other 


105028 


AI050715 


Hs.2331 


E2F transcription factor 5. p130^inding 


2.2 


other 


105041 


AB037716 


Hs.26204 


KiAA1295 protein 


2.2 


other 


105045 


BE242899 


Hs.129951 


speckle-type POZ protein 


3.9 


? 


105079 


AA151342 


Hs.12677 


CGI-147 protein 


9.5 


TM 


105087 


AA147884 


Hs.9812 


Homo sapiens cDNA FU 14388 fis, done HE 


5.7 


other 


105088 


H58589 


Hs.35156 


Homo sapiens cDNA FU1 1027 fis, clone PL 


2.2 


other 


105095 


Z78407 


Hs.27023 


vesicle transport-related protein 


2.2 


other 


105110 


BE387350 


Hs.33122 


KIAA1 160 protein 


1.6 


other 


105126 


AW975433 


Hs.36288 


ESTs 


6.4 


? 


105127 


AA045648 


Hs.301957 


nudix (nucleoside diphosphate linked moi 


2.2 


other 


105141 


AA164687 


Hs.177576 


mannosyt (alpha*1.3-)-gtyooprotein beta- 


2.8 


other 


105158 


AW976357 


Hs.234545 


hypothetical protein NUF2R 


2 


other 


105169 


BE245294 


Hs.180789 


3164 protein 


1.7 


other 


105186 AA191512 


Hs.28005 


Homo sapiens cDNA FLJ11309 fis, done PL 


4.9 


SSJM 


105254 


AA071276 


Hs.19469 


KIAA0859 protein 


2 


TM 


105281 


AA263143 


Hs^4596 


RAD51 -interacting protein 


2,9 


? 


105288 N99673 


Hs.3585 


ESTs. Weakly sinr^r to AF126743 1 DNAJ 


1.9 


TM 


105302 


AA700122 


Hs.3355 


sentrin-spedfic protease 


8.2 


? 


105331 


AW270037 


Hs,179507 


KIAA0779 protein 


1.8 


SS. 


105359 NM-016015 


Hs.8054 


CGI-68 protein 


8.4 


other 


105365 


BE264645 


Hs.282093 


hypothetical protein FLJ21918 


5.1 


other 


105373 


AW887701 


Hs.32356 


hypothetical protein FLJ20628 


2.6 


other 


105374 


BE242803 


Hs.262823 


hypothetical protein FU 10326 


2.2 


TM 


105387 


AW592146 


Hs.108636 


membrane protein CHI 


2.3 


SS.TM 


105393 


AF167570 


Hs.256583 


interieukin enhancer binding factor 3, 9 


5.5 


SS, 


105399 


BE3B5877 


Hs.334811 


Npw38-binding protein NpwBP 


1.6 


other 


105400 


AF1 98520 


Hs.65648 


RNA binding motif protein 8A 


1.6 


other 


105445 


AA252395 




gb:zs12g10.s1 NCI_CGAP_GCB1 Homo sapiens 


5.1 




105507 


BE268348 


Hs.226318 


CCR4-N0T transcription complex, subunit 


1.6 


other 


105529 


AA1 13449 


Hs,32471 


hypothetical protein FU20364 


1.3 


other 


105530 


AB023179 


Hs.9059 


KIAA0962 protein 


3.5 


other 


-105547 


AA262640 


Hs,27445 


unknown 


9.3 


other 


105564 


BE516694 


Hs.288042 


hypothetical protein FLJ 14299 


1.4 


other 


.105596 


AA579535 


Hs.18490 


hypothetical protein FLJ20452 


10.9 


TM 


105597 


AF054284 


Hs.334826 


spiidng factor 3b, subunit 1, 155kD 


2.9 


TM 


105608 


AI808201 


Hs.287863 


hypothetical protein FLJ 12475 


1.7 


? 


105610 


AA280072 


Hs.99872 


fetal Alzheimer antigen 


1.4 


other 


105617 


AK000892 


Hs.4069 


glucooofticold modulatory element bindin 


1.7 


TM 


105620 


AW302245 


Hs.181390 


casein kinase 1 . ganvna 2 


5.6 


other 


105658 


AA985190 


Hs^46875 


hypothetical protein FLJ20059 


9.4 


other 


105697 


AW499988 


Hs.27801 


3snc finger protein 278 


2 


TM 


105708 


R26944 


Hs.180777 


Homo sapiens mRNA: cONA OKFZp564M0264 (f 


1.7 


other 


105743 


BE246502 


Hs.95g8 


sema domain. Immunoglobulin domain Og)i 


2.7 


other 


105746 


AW151952 


Hs.46679 


hypothetical protein FU20739 


1.5 


? 


105759 


AI123118 


Hs.15159 


chemokine-like factor, altemativety spl 


1.3 


other 


105771 


AI267720 


Hs.153221 


synovial sarcoma, translocated to X chro 


1.6 


other 


105820 


AA741336 


Hs,152108 


transcriptional unit N 143 


2.2 


other 


105826 


AA478756 


Hs.194477 


E3 ubiquitin ligase SMURF2 


1.3 


other 


105856 


AI262105 


Hs.12653 


ESTs 


2.4 


other 


105858 


API 51066 


Hs,281428 


hypothetical protein 


2.9 


other 


105875 


AK001708 


Hs.32271 


hypothetical protein FLJ 10846 


1.4 


other 


105930 


AF016371 


Hs.9880 


peptidyl prolyl isomerase H (cydophilin 


5.3 


other 


106000 AW1 94426 


Hs,20726 


ESTs 


1.7 


other 


106011 


AW081202 


Hs.12284 


Homo sapiens, done IMAGE:2989556. mRNA, 


2.8 


other 


106017 


AA477956 


Hs.26268 


ESTs 


1.4 


other 


106073 


AL1 57441 


Hs.17834 


downstream neighbor of SON 


1.4 


other 


106078 


AA130158 


Hs,19977 


ESTs. Moderately similar to ALU8_HUMAN A 


1.6 


? 


106094 


AA533491 


Hs.23317 


hypothetical protein FLJ14681 


6.9 


other 


106140 


AB006624 


Hs.14912 


KIAA0286 protein 


1.6 


other 


106271 


AA251393 


Hs.289052 


Homo sapiens, Similar to RIKEN cDNA 5430 


10.8 


? 
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lUOZOO /\o\}of fH£. 


nS.Z*iO«30 


1^1 A A1 191 nmtoin 


1.3 


other 


lUOjOU Y1UU40 




nyTHnuouity 9*uup (nonnisione cnionuxnj 




ofhor 






CO 1 S( weoKiy siniiiar jo mohoh? MAiagai 




ss 


106350 AKu014u4 


ns.194D3B 


CjfCm o£. 


ti ft 


other 


1 06359 AW39u2o2 . 


ns^l i3u 


transmsmbrane 7 supeffamily member 2 


0.1 


other 


106381 AB040916 


nS.24i0o 


l\IAA14oO proiBin 


R R 
O.Q 


other 


106369 AW74o420 


nS.o23D 


nomo sapiens ojna: rLj/iHof its, crane v,* 


0 0 


1 In 


106457 AFl 19256 


Hs.27801 


zinc finger protein 278 




o&ier 


106470 D63078 


Hs.186180 


Homo sapiens cuna: rUZoUoo tis, clone u 




other 


106586 AA243837 


Hs.57787 


ESTs 


1.0 


other 


106589 AK000933 


Hs^BodI 


Homo sapiens cuna rLJiuUf i lis, cione nb 




o 


106610 AA458882 


Hs.79732 


fibulin 1 


o 
o 


ec 


106624 NM_003595 


ns.26350 


tyrosylprofein sulfotransferase 2 


7 fi 

f.Q 


other 


.106650 AL049951 


I.J_ AAOT/\ 

Hs.22370 


Homo sapiens mRNA, cuna ui\p^pdd4UU1z2 (i 


■i Q 
1.0 


other 


106669 AV657117 


t_J_ A n A A A 

Hs.184164 


EoTs, ivloderateiy sunilar to bobo5/ aipna 


1.0 


1 tVl 


106713 BE614802 


Hs. 184352 


hypothetical protein FU12549 


4.6 


other 


106717 AA600357 


Hs.239489 


TIA1 cytotoxic granule-associated RNA-bi 


1.0 


ouier 


106723 BE388094 


Hs.21857 


ESTs 


1.0 


oo. 


106795 AF174487 


ns.293753 


Bcl-2-related ovarian killer prqtein-lik 




oiner 


105829 AW959893 


Hs.27099 


hypotnelicai protem FLJ23293 similar to 


lo.2 


1 Ivi 


106831 BE564871 


Hs.29463 


centnn, cF-nand protein, o (C/UCol yeast 


1.0 


other 


106846 AB037744 


Hs.34892 


KIAA1323 protein 


2.2 


otner . 


106852 AF1 51031 


Hs.300631 


hypothetical protein 


1.3 


oiner 


106873 N49809 


Hs.11197 


1 1 • ^t__» lAAA ^P.04 4<14 ■ ■« 

Homo sapiens, clone IMAGE:3343149, mRNA, 


16.8 


other 


106886 W79171 


Hs.9567 


GL0U2 protein 


1.5 


IM 


106906 AA861271 


Hs.222024 


transcription factor 6MAL2 


2.2 


ottier 


106920 AK001838 


Hs.296323 


senjm/glucocorticoid regulated kinase 


^ A 

0.4 


other 


106945 AK000511 


Hs.6294 


hypothetical protein DKFZp434L1435 simil 


6.8 


o 
f 


106973 BE 156256 


Hs. 11923 


hypothetical protein 


0./ 


ottier 


106978 AVv631480 


nS.86oo 


ESTs 


0.1 


oo, 


107004 AA1 46872 


Hs.300700 


hypotnetica) protein FLJ20727 


1.3 


other 


107029 AF264750 


Hs.288971 


myetoid/lymphofd or mixed-lineage leukem 


'4 Q 
I.O 


other 


107071 AW385224 


. Hs.35198 


ectonucleotidepyrophosphatase/phosphodi 


1.7 


Otner 


A^'^A ^ A * A I^AAA<VAA 

107113 AK000733 


Hs^3900 


GTPase activating protein 




other 


107125 AK0Q0512 


Hs.69388 


nypotheticai protein FLJZOouo 


4 7 
1./ 


other 


107136. AV661958 


Hs.8207 


GK001 protein 


4.7 


Otner 


4lA*v^ A l/AA^t 

107146 AK001455 


Hs.5198 


Down syndrome critk:al region gene 2 




other 


107151 AW378065 


Hs.8687 


ESTs 


Ct. A 
D.4 




107155 AW391927 


Hs.7946 


t>t A A^AAA 1. _ • _ 

KIAA1 288 protein 


33.5 


Other 


107174 BE122762 


Hs.25338 


ESTs 


5.2 




107197 W 15477 


Hs.64639 


glioma pathogenesis-related protein 


0.1 


other 


107221 AW888411 


Hs.81915 


leukemia -associated phosphoprotein p18 ( 




other 


107243 BE219716 


Hs.34727 


ESTs, Moderately similar to l3o7o9 zinc 


7 A 

7.4 


o 


107248 AW263124 


t 1^ ^ A ^ A A A 

Hs.315111 


nuclear receptor co-repressor/HDAC3 comp 


A D 

1.0 


other 


107263 D60341 


Hs.21198 


translocase of outer mitochondiial membr 


6.7 


other 


107265 BE379594 


Hs.49136 


ESTs, Moderately similar to ALU7_HUMAN A 


2.5 


other 


107299 BE277457 


Hs.30661 


nypotneticai protein MGC4o0o 


J.J 


1 iVl 


107316 T63174 


Hs. 193700 


Homo sapiens mRNA; cDNA DKFZpooDlU324 (t 




livl 


107354 Nh/L005299 


1 l_ A/^ A An 

Hs.96448 


zinc finger protein 193 


e 
O 


o 
f 


107392 AW299900 


Hs.267632 


TATA element modulatory factor 1 


1.2 


other 


107481 AA307703 


Hs.279766 


kinesin family member 4A 


1.6 


other 


107529 BE515065 


Hs.296585 


nucleolar protein (KKE/u repeat) 


3 


TM 


lU/oM AmUUIood 


Uc hqhaa 
ns.oyo**** 


ESTs 




other 


107681 BE379594 


Hs.49135 


ESTs, Moderately similar to ALU7_HUMAN A 


2.3 


SS.TM 


107772 AA018587 


Hs.303055 


ESTs, Weakly similar to ALUI^HUMAN ALU S 


2.2 


? 


107859 mizim 


Hs.47584 


potassium voltage-gated channel, delayed 


8.4 


TM 


• 107901 L42612 


Hs.335952 


keratin 6B 


2.5 


ottier 


107922 BE153855 


Hs.61460 


Ig superfamily receptor LNfR 


23 


ottter 


107974 AW956103 


Hs.61712 


pynjvate dehydrogenase kinase, isoenzyme 


6.8 


ottier 


108040 AL121031 


Hs.159971 


SWI/SNF related, matrix associated, acti 


1.6 


ottier 


108230 AA054224 


Hs.59647 


ESTs 


1.3 


other 


108274 AF129535 


Hs,272027 


F-box only protein 5 


7.2 


? 


108296 N31256 


Hs.161623 


ESTs 


2.6 


ottier 


108496 AA083069 


Hs.339659 


ESTs 


3.6 


ottier 


108607 BE300380 


Hs,69476 


Homo sapiens cDNA FU12758 lis. done NT 


3.5 


ottier 


108621 AA101809 


Hs.182685 


ESTs 


1.7 


ottier 


108634 AW022410 


Hs.69507 


ESTs 


1.8 


SS.TM 
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108647 


BE546947 


Hs.44276 


homeo box CIO 


9.8 


Other 


108695 


AB029000 


HS70823 


K1AA1077 protein 


7.3 


other 


108740 


AI089575 


Hs.9071 


progesterone membrane binding protein 


2.8 


? 


108828 


AK001693 


Hs.273344 


DKFZP564O0463 protein 


1.9 


other 


108859 


AL121500 


Hs. 178904 


ESTs 


1.6 


TM 


108872 


H06720 


Hs.111680 


endosulfine alpha 


Z2 


other 


108891 


A1801235 


Hs.48480 


ESTs 


5.4 


other 


108894 


AK001431 


Hs.5105 


hypothetical protein FU 1 0559 


4.1 


TM 


108955 


AA149754 


Hs.195155 


Homo sapiens amino add transport system 


5.7 


? 


108982 


AA151708 


Hs.171980 


homeo box (expressed in ES cells) 1 


1.7 


other 


108987 


AA1 52178 


Hs.23467 


hypothetical protein FU10633 


6.3 


other 


109002 


AB028987 


HsJ2134 


KiAA1064 protein 


1.7 


other 


109011 


AA1 56542 


Hs72127 


ESTs 


1.5 


other 


109026 


AA1 57811 




gb:zo35d07.s1 Stratagene colon (937204) 


5.4 


other 


109056 


AA164293 


Hs.72545 


ESTs 


3 


other 


109101 


AW608930 


Hs.52184 


hypothetical protein FLJ20618 


1.6 


SS. 


109112 


AW419196 


Hs.257924 


hypothetical protein FU 13782 


3.3 


TM 


109124 


AK000684 


Hs.183887 


hypotiietica! protein FLJ22104 


1.7 


other 


109139 


AJ132592 


Hs.59757 


zinc finger protein 281 


2.7 


other 


109166 


AA219691 


Hs.73625 


RAB6 interacting, kinesin-like (r^kines 


3 


TM 


109198 


BE566742 


Hs.58169 


highly expressed in cancer, rich n\ leuc 


2.1 


other 


109213 


NM 016603 


Hs.82035 


potential nuclear protein C50RF5; GAP>fl 


5.4 


other 


109220 


AW9S8181 


Hs.189998 


ESTs 


5.8 


other 


109233 


AU077281 


Hs.170285 


nucieoporin 214kD (CAIN) 


5.3 


other 


109270 


N99673 


Hs.3585 


ESTs Weaklv similar to AF126743 1 DNAJ 


1.4 


other 


109273 


AA375752 


Hs.82719 


Homo sapiens mRNA; cDNA DKFZp585F1822 (f 


3 


other 


109313 


API 53201 


HS.B6276 


C2H2 (Knippel-type) zinc finger protein 


1.3 


ottier 


109341 


AA213506 


Hs. 115099 


EST 


3 


7 


109391 


AL096858 


Hs.184245 


KIAA0929 protein Msx2 interacfing nudea 


1.5 


other 


109420 


H83603 


Hs.40408 


homeo box C9 


2.2 


SS, 


109426 


N30531 


Hs.42215 


protein phosphatase 1, regulatory subunl 


3.1 


TM 


109429 


AI160029 


Hs.61438 


ESTs 


2 


? 


109445 


AA232103 


Hs. 18991 5 


ESTs 


1.8 


other 


109450 


AB032969 


Hs.173042 


KIAA1143 protein 


3.8 


other 


109468 


NM-015310 


Hs.6763 


KIAA0942 protein 


3.3 


other 


109478 


AW074143 


Hs.87134 


ESTs 


2 


TM 


109570 


L40027 


Hs.1 18890 


glycogen synthase kinase 3 alpha 


2.1 


other 


109662 


F02614 


Hs.27319 


ESTs 


1.4 


other 


109825 


R71264 


Hs.16798 


ESTs 


1.3 


other 


110039 


HI 1938 


Hs.21907 


histone acetyltransferase 


2 


other 


110056 


AA503041 


Hs^79009 


matrix Gia protein 


2.5 


other 


110085 


AA603840 


Hs.29956 


KtAA0460 protein 


1.7 


other 


110110 


T07353 


Hs.7948 


ESTs 


2.9 . 


other 


110129 


R51853 


Hs.226429 


ESTs. Weakly similar to ALULHUMAN ALU S 


1.7 


SS.. 


110154 


NM_014521 


Hs.17667 


SH3-domain binding protein 4 


4.3 


other 


110240 


AI668594 


Hs.176588 


ESTs,. Weakly similar to CP4YJHUMAN CYTOC 


4.3 


? 


110242 


N41744 


Hs.19978 


CGI-ib protein 


1.3 


other 


110259 


H28428 


Hs.32406 


ESTs, Weakly similar to 138022 hypotheti 


2.2 


other 


110312 


BE256986 


Hs.11896 


hypothetical protein FUJI 2089 


2.1 . 


other 


110501 


H55748 




gb:yq94a01 .s1 Soares fetal liver spleen 


6.1 


? 


110504 


H55915 


Hs;210859 


hypothetical protein FLJ11016 


ell 


TM 


110525 


H57330 


Hs.37430 


EST 


6;4 


other 


110568 


AK001160 


Hs,5999 


hypothetical protein FLJ10298 


1.3 


? 


110699 


T97586 


Hs.18090 


ESTs 


1.8 


other 


110705 


AB007902 


Hs.32168 


KIAA0442 protein 


1.6 


TM 


110742 


AW190338 


Hs.28029 


hypothetical protein MGC11256 


7.8 


other 


110761 


AL138077 


Hs.16157 


hypothetical protein FU12707 


2.5 


other 


110762 


BE044245 


Hs,30011 


hypothetical protein MGC2953 


9.3 


? 


110765 


AK000322 


Hs.18457 


hypothetical protein FLJ20315 


' 5.5 


SS. 


110769 


BE000831 


Hs.23837 


Homo sapiens cDNA FU1 1812 fis, done HE 


2.1 


TM 


110799 


AI08g660 


Hs.323401 


dpy-30-like protein 


1.5 


TM 


110805 


T25829 


Hs.24048 


FK506 binding protein precursor 


6.7 


TM 


110813 


AA767373 


Hs.35669 


ESTs, Moderately similar to ALUI^HUMAN A 


5.7 


other 


110820 


R33261 


Hs.6614 


ESTs, Weakly similar to A43932 mudn 2 p 


3.4 


other 


110840 


N31598 


Hs.12727 


hypothetical protein FU216l6 


1.7 


TM 


110844 


AI740792 


Hs.167531 


methyicrotonoyl-Coenzyme A carboxylase 2 


1.7 


other 


110854 


BE612992 


Hs.27931 


hypothetical protein RJ10607 similar to 


4.7 


other 
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110856 AA992380 gb:ol37g06.s1 SoaresJestis_NHT Homo sap 

110885 BE384447 Hs.16034 hypothetical protein MGC13186 

110897 AL1 17430 Hs.6880 DKFZP434D1 56 protein 

110915 BE092285 Hs.29724 hypothetical protein FU13187 

5 110918 H04360 Hs.24283 ESTs. Moderately similar to reduced expr 

110958 NM_005864 Hs.24587 signal transducfon protein (SH3 contain 

110963 AKD02180 Hs.11449 DKFZP5640123 protein 

110981 AK001980 Hs^4284 ADP-ntosyltransferase (NAD+; poly(ADP-r 

110984 AW613287 Hs.80120 UDP-N-acetyl-atpha-D-galactosamine:poiyp 

10 111125 N63823 Hs.269115 ESTs, IVtoderateiy Similar foZ195_HUIV!ANZ 

111132 AB037807 Hs.83293 hypothefical protein 

111164 N46180 Hs.122489 Homo sapiens cDNA FU 13289 fis. clone OV 

111172 R67419 Hs.21851 Homo sapiens cDNA FLJ1 2900 fis. done NT 

111174 AL050166 Hs.26295 Homo sapiens mRhlA; cDNADKF2p58BD1 122 (f 

15 111179 AK000136 Hs.10760 asporin (LRR dass 1) 

111184 AI815486 Hs.243901 Homo sapiens cDNA FU20738 fts, done HE 

111189 N67603 Hs.272130 ESTs, Weakly similar to S65824 reveree t 

111216 AW139408 Hs.152940 ESTs 

111221 AB037782 Hs.15119 KIAA1361 protein 

20 111223 AA852773 Hs.334838 KIAA1 866 protein 

1 1 1239 N90956 Hs.1 7230 hypothetical protein FU22087 

1 1 1 285 AA77871 1 Hs.431 0 eukaryotic translation initiation factor 

111299 AB033091 Hs.74313 KIAA1 265 protein 

111312 A1523913 Hs.34504 ESTs 

25 111318 T99755 Hs.334728 ESTs 

1 1 1 337 AA837395 Hs.263925 LIS1 -interacting protein NUDE1 , rat homo 

111352 H58589 Hs.35156 Homo sapiens cDN A FU 11 027 fis. done PL 

111370 A1478658 Hs.94631 brefeldin A-inhibited guanine nucleotide 

111384 N94605 Hs.288969 HSCARG protein 

30 111389 AK000987 Hs. 1691 11 oxidation resistance 1 

111452 R02354 Hs.15999 ESTs 

111486 'A1051194 Hs.227978 EST 

111549 W90638 Hs.20321 ESTs, Moderately similar to ZRF1.HUMANZ 

111585 R10720 Hs.20670 EST 

35 111627 R52656 Hs,21691 ESTs 

111870 AB037834 Hs.18685 Homo sapiens mRNA for KIAA1413 protein. 

111937 BE298665 Hs.14d46 Homo sapiens mRNA; cDNA DKFZp564D016 (fr 

111944 AW083791 Hs.21263 suppressor of potassium transport defect 

111987 NM^015310 Hs.6763 KIAA0942 protein 

40 112134 R41823 Hs.7413 ESTs; calsyntenin-2 

112244 AB029000 Hs.70823 KIAA1 077 protein 

1 12388 R46071 Hs.301693 Homo sapiens, clone IMAGE:3638994. mRNA, 

1 12456 NM_016248 Hs.232076 A kinase (PRKA) anchor protein 1 1 

1 12464 AW007287 Hs.28538 Homo sapiens cDNA: FU21086 fis. done C 

45 112506 Ai742756 Hs.26079 ESTs 

112513 R68425 Hs.13809 hypothetical protein FU 10648 

112752 AK001635 Hs.14838 hypothetical protein FU 10773 

112884 AK000004 Hs.5013 Homo sapiens mRNA for FU00004 protein. 

112923 T10258 Hs.5037 EST 

50 112936 AW970826 Hs.6185 KIAA1 557 protein 
112958 R61388 Hs.6724 ESTs 

112966 Z44718 Hs. 102548 glucocorticoid receptor DN A binding fact 
112978 AK000272 Hs.7099 hypolhetical protein FU20265 
112995 AA737033 Hs7155 ESTs. Moderately similar to 21 15357ATYK 
55 112996 BE276112 Hs.7165 zinc finger protein 259 
113047 AI571940 Hs.7549 ESTs 

113049 AW965190 Hs.7560 Homo sapiens mRNA for KIAA1729 protein. 
113089 T40707 Hs.270862 ESTs 

113196 T57317 gb:yb51a03^1 Stratagene fetal spleen (9 

60 113248 T63857 gb:yc16e01^1 Stratagene lung (937210) H 

113254 AK0a2180 Hs.11449 DKFZP56401 23 protein 

113277 AW971049 Hs.1 1774 protebi (peptidyl-prolyl ds/lrans isome 

113429 AA688021 Hs.179808 ESTs 

113499 AI487908 Hs.8882 ESTs 
65 113547 H59588 Hs.15233 ESTs 

1 13647 AA813887 Hs.188173 Homo sapiens cDNA FU12187 fis. done MA 

1 13702 T97307 gb:ye53h05^1 Scares fetal liver spleen 
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TM 
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other 
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other 
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other 
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other 
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other 
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other 
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other 
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TM 
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other 
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TM 
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other 
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other 
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32 


other 
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other 
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other 
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other 


2 


other 
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other 
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TM 
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other 
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other 
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113759 AW499665 


Hs.9456 


SWl/SNF related, matrix associated, acti 


1^ 


other 


113777 BE266947 


Hs.10590 


zinc finger protein 313 


13,4 


other 


113783 AL359588 


Hs.7041 


hypothetical protein DKFZp762B226 


17 


other 


113791 AI269096 


Hs.135578 


chitobtase, di-N-acetyt- 


1.3 


other 


113808 W44735 


Hs.9286 


Homo sapiens cDNA: FU21278 fis. done C 


3.3 


other 


113811 BE207480 


Hs.69g4 


Homo sapiens cDNA: FU22044 fis, done H 


3.1 


other 


113817 H13325 


Hs.332795 


hypothetical protein DKFZp76 1017 121 


3.2 


other 


113826 AW378212 


Hs.24809 


hypothetical protein FU10826 


2.3 


? 


113834 T26483 


Hs.6059 


EGF-containing fibulin-tike extracelluta 


11.3 


TM 


113868 W57902 


Hs.90744 


proteasome (prosome. macropain) 26S subu 


2.7 


other 


113870. AL079314 


Hs.16537 


hypothetical protein, similar to (U06944 


6.1 


other 


113885. AW959486 


HS51732 


ESTs 


6.6 


. other 


113923 •-AW953484 


Hs.3849 


hypothetical protein FLJ22041 sinrular to 


1.9 


? 


113989 W87544 


Hs.268828 


ESTs 


1.2 


other 


114022 AI539519 


Hs.120969 


Homo sapiens cDNA FU1 1562 fis, done HE 


5.4 


other 


114030 A1825386 


Hs.164478 


hypothetical protein FU21939 similar to 


9.4 


other 


114060 AB029551 


HsJ910 


RING1 and YY1 binding protein 


1.8 


other 


114195 AF017445 


Hs.1 50926 


fucose-1 -phosphate guanylyllransferase 


1.5 


other 


114226 AB028968 


Hs.7989 


KIAA1045 protein 


1.8 


other 


114253 BE149866 


Hs,14831 


Homo sapiens. Similar to zinc finger pro 


2.3 


other 


114262 All 17518 


Hs.3686 


K1AA0978 protein 


1.4 


TM 


114275 AW515443.comp 


Hs.305117 


KIAA0305 protein 15.8 


114292 AI815395 


Hs.184641 


fatty acid desaturase 2 


1.9 


TM 


114309 AA332453 


Hs.20824 


CGI-85 protein 


2.4 


other 


114392 AA249590 


Hs. 100748 


ESTs. Weakly similar to A28996 prol/ne-r 


1.9 


other 


114407 BE539976 


Hs.103305 


Homo sapiens mRNA; cDNA DKFZp434B0425 (f 


1.3 


TM 


114455 H37908 


Hs.271616 


ESTs. Weakly similar to ALU8.HUMAN ALU S 


5.6 


other 


114463 AL120247 


Hs.40109 


KIAA0872 protein 


5.3 


TM 


114464 AI091713 


Hs.106597 


Homo sapiens, Similar to RIKEN cDNA 1110 


1.3 


other 


114471 AA028074 


Hs.104613 


RP42homolog 


1.9 


? 


114480 BE066778 


Hs.151678 


UDP-N-acetyl-alpha-D-galactosamine:polyp 


13.4 


other 


114671 AA766268 


Hs.266273 


hypothetical protein FLJ13346 


2 


other 


114698 AA476966 


Hs.1 10857 


polymerase (RNA) 111 (DNA directed) poly 


3.6 


other 


114730 AI373544 


Hs.331328 


intermediate filament protein syncoilin 


3.9 


other 


114767. AI859865 


Hs.154443 


minichromosome maintenance defictent (S. 


1.7 


other 


114774 AV656017 


Hs.184325 


CGI-78 protein 


3.2 


other 


114798 AA159181 


Hs.54900 


serologically defined colon cancer antig 


3.6 


other 


114860 AL157545 


Hs.42179 


bromodonna/n and PHD finger containing. 3 


4.4 


other 


114895 AA236177 


Hs.76591 


K(AA0887 protein 


7.2 


other 


114896 BE539101 


Hs.5324 


hypothetical protein 


1.3 


other 


114911 AA236672 




gb:zt29f02.s1 Soares ovary tumor NbHOT H 


1.5 


other 


114930 AA237022 


Hs.188717 


ESTs 


2 


SS. 


114938 AA242834 


Hs.58384 


ESTs 


2.9 


other 


114965 A1733881 


Hs.72472 


BMP-R1B 


2.3 


? 


115023 AF102546 


Hs.63931 


dachshund (Drosophila) homolog 


1.3 


other 


115038 AA252360 


Hs.87968 


toll-like receptor 9 


1,6 


other 


115061 A1751438 


Hs.41271 


' Homo sapiens mRNA full length insert cDN 


1-1.8 


other 


115117 AI670847 


Hs.5324 


hypothetical protein 


1,5 


other 


115206 AW183695 


Hs,186572 


ESTs 


2.5 


other 


115221 AW365434 


Hs.79741 


hypothetical protein FLJ10116 


1.5 


other 


115239 BE251328 


Hs.73291 


hypothetical protein FLJ 10881 


1.3 


TM 


115242 AI368236 


Hs.283732 


ESTs, Moderately similar to AtU1_HUMAN A 


1.4 


other 


115278 AK002163 


Hs.301724 


hypothetical protein FU11301 


1.5 


other 


115285 AW972872 


Hs.293736 


ESTs 


2.4 


other 


115291 BE545072 


Hs.122579 


hypothetical protein FU 10461 


6.3 


SS. 


115400 At215069 


Hs.89113 


ESTs 


6.7 


? 


115468 AA314349 


Hs.48499 


tumor antigen StP-8p 


7.5 


? 


115471 AKD01376 


Hs.59346 


hypothetical protein FU10514 


1.4 


TM 


115479 AW301608 


Hs.278188 


ESTs. Moderately similar to 154374 gene 


4.1 


TM 


115496 AW247593 


Hs71819 


eukaryotic translation initiation factor 


16.3 


other 


115500 Y14443 


Hs.88219 


. zinc finger protein 200 


5 


other 


115553 AJ275986 


Hs.71414 


transcription factor (SMIF gene) 


2.5 


other 


115581 AI540842 


Hs.61082 


ESTs 


6.2 


other 


115587 BE081342 


Hs.283037 


HSPC039 protein 


2.9 


other 


115590 AA399477 


Hs.67896 


7-60 protein 


5.3 


TM 


115646 N36110 


Hs^05971 


solute carrier family 2 (facilitated giu 


4.3 


? 


115652 BE093589 


Hs.38178 


hypothetical protein FU23468 


10.6 


other 



other 



158 



wo 02/059377 PCT/US02/02242 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



115655' AL048269 


Hs.288544 


Homo sapiens, done MGC:16063, mRNA, conn 


12.7 


' TM 


115663 A)138785 


Hs.40507 


ESTs 


2 


other 


115676 AA953006 


Hs.88143 


ESTs 


3.1 


other 


115690 AA625132 


Hs 44159 


hypothetical protein FLJ21615 


1.7 


TM 


115693 AF231023 


Hs.55173 


cadherin, EGF LAG seven-pass G-fype fece 


6.9 


other 


115715 BE395161 

1 BWf Iw WWww%^ l\# > 


Hs.1390 


proteasome (prosome, maoopain) sutiunit 


1.7 


other 


115734 AI950'?39 


Hs.40782 


ESTs 


2.7 


TM 


115811 MM 01*1434 


Hs 48604 


DKFZP434B168 protein 


2.1 


other 


115823 AI732742 


Hs.87440 


ESTs 


2.1 


other 


115837 A1675217 


Hs.42761 


ESTs 


1.3 


other 




Hs 332938 


hvDofhetical orotein MGC5370 


4.4 


other 


1 1 '>RKR A WnR9R2Q 


Hs 52081 


KIAA0867 Drotein 

l\l^^i^wwW# |^l\^iwlll 


7.3 


other 


115875 N556S9 


Hs 333823 


mitochondrial ribosonrial protein LI 3 


1.2 


other 


115Q41 Aia67451 


Hs 46679 


hypothetical protein FLJ20739 


5.5 


other 


115968 AB037753 


Hs 62767 

f lw« w^ • w' 


K)AA1332 protein 


9.8 


other 


1 1finn'^ RF27*%4fiQ 


Hs 66493 


Dnwn svndrome critical rsoion QenB 5 


1.4 


other 




Hs 57664 


Homo sanlens mRN A fiill lenath insert cDN 


2.4 


other 


llfiinS AA77nfifl8 


Hs 28777 


H2A histone famiJv member L 


1.8 


other 


11R1'^4 RF24'^R'W 


Hs 50441 


nGt-04 Drotein 


1.4 


other 


11R1RQ M'^R71Q 


Hs 4474Q 


F5^Ts Mndpraf pIv similar to TQ0358 hvDot 


1.2 


other 


1 iQ tVD r\V\0£. 1 1 lO 


Hs 72402 


ESTs 


2.1 


other 


11R9'^ft A\/fiRn717 


Hs.47144 


r)KFZPS86N0819 orotein 


1.7 


other 


11R94R AF9R«\*i^*i 


Hs 2*10646 


hsiri ilftviml lAP rpnpaf-nnntainina 8 


1.7 


other 


11K0R9 AIQ'^RA^Q 


Hs S983R 


hvnnthetrrfil nrafein FLJ 10808 

1 IJpUU Iwli^wl |^lwK7HI I bW lUWW 


1.8' 


? 


UROQft AIQR<;^11 
1 104^90- 1 1 


Hs Q4inQ 


Hnmn sanipns ^^NA FLJ13634 fis done PL 


1.9 


other 






Hplpfpfl in fencer 1* RNA hpticasp HI^/DI 


5 


SS, 


1 1 RU^ A M791 OR 




Hnmo saoiens cDNA FLJ11663 fis done HE 


1.4 


ss! 




He dflfld 


k^1AAin9<^ nmfpin 


1.9 


? 




He AACST^ 


rIinpnfiH\fl npntlrinso R 


1.5 


• other 


IIR^CO AA^Q719Q 


Hs 1Ad771 

MO* IO*rr f 1 


nif/^ar fstrfnr UC fClClAAT^tandina iransc 

IIUvlwCII IflwlUI IfW ^VVnm^VIIIUUHJ UOIrav 


1.9 


2 


1 10030 Ml I*t930u 


He ')R19S 


SfitprfpmnJnHiif^H nmtPin 75 S2kD 

lilUSIiOiull^llUU^M?U ylwiClii ' wi w&i\iy 


1.9 


? 


1 1A?R(; M(\ni7A 


He 4R7RR 




6.1 


other 


11R?RR MpHdRR 


Hs 71109 


KIAA1229 orotetn 


1.6 


? 


IIRAI? A\A/4QQRR4 


Hs 12484 


Human Hnnp 2*^828 mRN A secnience 


7,4 


ottier 


IIRA'^fi AA1R1A11 
1 ION 00 /vMOI*tII 


Hs *%8RR8 


r*hrnmr»snmp 21 nnpn rparihia frame 57 


2;i 


other 


11fi4fi9 AF9lfl'^1'^ 


Hs 236828 


nutative hellcase RUVBL 
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Hs.169896 
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ESTs 
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15 124857 R63652 Hs.137190 
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124863 AI382555 Hs.127950 
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BE147740 
AF292100 
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AI470163 
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HS.10S097 thymidine kinase 1, soluble 
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Hs.106390 Homo sapiens mRNA; cDNA DKFZp586H0924 (f 
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Hs.1 07318 kynurenine 3-monooxygenase (Icynurenine 3 
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Hs. 1 08 1 04 ubiquitin-conjugating enzyme E2L 3 

Hs.326234 ESTs. Highly similar to T46422 hypotheti 

Hs. 102267 lysosomal 
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Hs.288908 WW Domain-Containing Gene 

Hs. 108642 anc finger protein 22 (KOX 15) 
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Hs.250723 hypothelica! protein MGC2747 
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Hs.241 576 hypothetical protein PR02577 
Hs.183299 ESTs 
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Hs.1 1441 chromosome 1 open reading framB 8 

Hs.278428 progestin induced protein 
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133174 


AA431620 


Hs.324178 


hypothetical protein MGC2745 


2.7 


other 


133175 


AW955632 


Hs.65656 


ESTs» Weakly simiiar to SI 9560 profine-r 


9.3 


other 


133177 


X97795 


Hs.66718 


RAD54 (S.cerevisiae)-like 


4.5 


TM 


133208 


A1801777 


Hs.6774 


ESTs 


5,5 


TM 


133226 


AW954559 


Hs.295287 


Homo sapiens. Similar to bromodomain-con 


2.7 


other 


133228 


AI492924 


Hs.6831 


golgi phosphoprotein 1 


1.7 


? 


133254 


AI567421 


Hs273330 


Homo sapiens, clone IMAGE:3544662, mRNA. 


1.3 


other 


133268 


AW956781 


Hs.293937 


ESTs. Weakty similar to FXD2^HUMAN FORKH 


12.2 


other 


133291 


BE297855 


Hs.69855 


NRAS-related gene 


1.2 


other 


133314 


AA102670 


Hs.70725 


gamma-aminobutyric acid (GABA) A recepto 


17 


TM 


133321 


T79526 


Hs.179516 


integral type 1 protein 


11.1 


? 


133327 


AL390127 


Hs.7104 


KnjppeMike factor 13 


Z9 


other 


133347 


BE257758 


Hs.71475 


acid duster protein 33 


Z5 


? 


133360 AI016521. 


Hs.71816 


v-akt murine thymoma viral oncogene homo 


1.5 


other 


133386 AA292811 


Hs.72050 


non-metastatic cells 5. protein expresse 


2.1 


other 


133367 AF231919 


Hs.18759 


KtAA0539 gene product 


1.3 


other 


133370 AF245505 


Hs.72157 


OKFZP564I1922 protein 


2.2 


other 


133390 AI950382 


Hs.72660 


phosphatidylserine receptor 


5.7 


TM 


133391 


AW103364 


Hs.727 


inhibin« beta A (activin A, activin AB a 


25.5 


other 


133394 


AA305127 


Hs.237225 


hypothetical protein HT023 


3.3 


other 


133437 


AL031591 


Hs.7370 


phosphotidylinositol transfer protein, b 


.16 


other 


133452 


Nf^002759 


Hs.274382 


protein kinase, interferon-inducible dou 


4.1 


other 


133453 


AI659305 


Hs.73826 


protein tyrosine phosphatase, non-recept 


1.5 


other 


133500 


AW964804 


Hs74280 


hypothetical protein FLJ22237 


6.3 


TM 


133529 


W45623 


Hs.74571 


ADP-ribosylation factor 1 


4 


? 


133543 


AU077073 


Hs.108327 


damage-specific DNA binding protein 1 (1 


1.8 


? 


13357Q 


AU077050 


Hs.75066 


translin 


1.5 


other 


133579 


X75346 


Hs.75074 


mitogen-activated protein kinase-activat 


3.5 


TM 


13358? 


BE391579 


Hs.75087 


Fas-activated serine/threonine kinase 


6.8 


TM 


133594 


AW1 60781 


Hs.172589 


nuclear phosphoprotein similar to S. cer 


2.6 


TM 


13359*5 


AA393273 


Hs.75133 


transcription factor 6-like 1 (mitochond 


1.4 


other 


133599 


NIVL002885 


Hs.75151 


RAP1, GTPase activating protein 1 


8.1 


other 


133621 


NM^004893 


Hs.75258 


H2A histone family, member Y 


13.5 


other 


133627 


NM_002047 


Hs.75280 


glycyt-tRNA synthetase 


2.2 


other 


133631 


NM_000401 


Hs.75334 


exostoses (multiple) 2 


1.8 


other 


133649 


U25849 


Hs.75393 


add phosphatase 1, soluble 


2 


other 


133690 


AV661185 


Hs.75574 


mitochondrial ribosomal protein LI 9 


2.8 


other 


133720 


L27841 


Hs.75737 


periceiitriolar material 1 


6.8 


other 


133722 


AW969976 


HS279009 


matrix Gla protein 


2.5 


other 


133751 


AW402048.comp 


Hs.334787 


Homo sapiens, S 


133757 


T52946 


Hs.196209 


RAE1 (RNA export 1, S.pombe] homoiog 


1.4 


? 


133760 


BE271766 


Hs.181357 


lamfnin receptor 1 (67kD, ribosomal prot 


5.4 


other 


133765 


M62194 


Hs.75929 


cadherin 11, type 2. OB-cadherin (osteob 


5 


other 


133780 


AA557660 


Hs.76152 


decorin 


3.8 


other 


133797 


AL133921 


Hs.76272 


retinoblastoma-binding protein 2 


3.1 


? 


133822 


D50525 


Hs.699 


peptidylprolyl isomerase B (cydophilin 


9.7 


? 


133842 


AW797468 


Hs.285013 


putative human HLA class 11 assodated p 


2.4 


other 


133845 


AA147026 


HSJ6704 


ESTs 


2.5 


other 


133865 


AB011155 


Hs.170290 


discs, targe (Drosophila) homoiog 5 


5 


other 


133867 


AW340125 


Hs.76989 


KIAA0097 gene product 


2.5 


? 


133868 


AB012193 


Hs.183874 


cullin 4A 


2.1 


other 


133922 


U30825 


Hs.77608 


splidng factor, anginine/serine-rich 9 


2.8 


TM 


133924 


D86326 


Hs.325948 


' veside docking protein pi 15 


1.8 


SS, 


133929 


NM„006306 


Hs.211602 


SMC1 (structural maintenance of chromoso 


2 


? 


133936 


L17128 


Hs.77719 


gamma-ghjtamyl carboxylase 


2.6 


other 


133941 


BE244332 


Hs.77770 


adaptor-related protein complex 3, mu 2 


2.9 


other 


133959 


X81789 


Hs.77897 


splidng factor 3a, subunit 3, 60kD 


10.4 


other 


133976 


AI908165 


Hs.169946 


GATA43inding protein 3 (T-cell receptor 


1.9 


other 


133989 


AL040328 


Hs.78202 


SWI/SNF related, matrix associated, acti 


2.6 


SS. 


133997 


AI824113 


Hs.78281 


regulator of G-protein signalling 12 


13 


other 


134010 


AB016092 


Hs.197114 


RfsfA binding protein; AT-ndi element bin 


8.8 


other 


134015 


D31764 


Hs.278569 


sorting nexin 17 


1.5 


SS. 


134070 


NM«C03590 


Hs.78946 


cuirm 3 


6.3 


other 


134110 


U41060 


Hs.79136 


UV-1 protein, estrogen regulated 


2.7 


other 


134129 


NM.014742 


Hs.79305 


K1AA0255 gene product 


4.2 


other 


134134 


H86504 


Hs.173328 


protein phosphatase 2. regulatory subuni 


1.7 


other 


134200 


BE559S98 


Hs.197803 


K1AA0160 protein 


2.6 


other 
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134206 AF107463 
134219 NM_000402 
134234 BE300078 
134275 A1878910 
5 134292 AI906291 
134301 AW502505 
134305 U61397 
134324 AB029D23 
134326. AW903838 
10 134329 N92036 

134337 NM_004922 
134348 AW291946 
134367 AA339449 
134376 X06560 
15 134379 AW362124 
134384 A1589941 
134391 AA417383 
134395 AA456539 
134405 AW067903 
20 134411 BE272095 
134415 AI750762 
134421 AU077196 
134424 Z44190 
134446 AA1 12036 
25 134447 M58603 
134470 X54942 
134480 NM^OOSOOO 
134485 X82153 
134498 AW246273 
30 134513 AA425473 
134516 AK001571 
134520 BE091005 
134529 AW411479 
134577 BE244323 
35 134582 AA927177 
134612 AWa68223 
134624 AF035119 
134632 X78520 
134654 AKD01741 
40 134866 BE391929 
134687 U62317 
134692 NM.003474 
134705 BE161887 
134714 Y14768 
45 134719 AA852985 
134722 AF129536 
134746 X07871 
13475t AW630B03 
134790 BE002798 
50 134834 AW451370 
134850 A1701162 
134853 BE268326 
134880 AI879195 
134925 AW885909 
55 134955 AW401361 
134971 AI097346 
134975 R50333 
135011 AB037835 
135022 NM_000408 
60 135032 AW301984 
135077 AW503733 
135083 AB036063 
135095 AF027219 
135098 AA081258 
65 135153 AI093155 
135181 BE250865 
135199 AA477514 



Hs.79968 splicing factor 30, survival of motor ne 

Hs.80205 glucose-6-phosphate dehydrogenase 

Hs.80449 Homo sapiens, done IMAGE:3535294, mRNA. 

Hs.3688 cisplatin resistance-associated cverexpr 

Hs.81234 immunoglobulin superfamily, member 3 

Hs.81360 Homo sapiens cDNA: FU21927 fis, done H 

Hs.81424 ubiquitin-fike 1 (sentrin) 

Hs.179946 KIAA1100 protein . . 

Hs.81800 . chondroitin sulfate proteog^n 2 (vers 

Hs.81848 RA021 (S. pombe) homolog 

Hs.81964 SEC24 (S. cerevisiae) related gene famil 

Hs.82065 interleukin 6 signal transducer (gpl 30. 

Hs.82285 phosphoribosylglydnamide fomiyttransfer 

Hs.82396 2',5'-ongoadenylate synthetase 1 (40-46 

Hs.323193 hypothetical protein MGC3222 

Hs.8254 Homo sapiens, Similar to tumor different- 

Hs.82582 integrin, beta-like 1 (with EGF-Bke rep 

Hs.8252 lysosomal 

Hs,82772 collagen, type XI, alpha 1 

Hs.167791 reticulocalbin 1, EF-hand calcium bindin 

Hs.8291 1 protein tyrosine phosphatase type IVA. m 

Hs.82985 collagen, type V, alpha 2 

Hs.83023 peroxisomal biogenesis factor 1 1 B 

Hs.83419 KIAA0252 protein 

Hs-83428 nudear factor of kappa light polypeptid 

Hs.83758 CDC28 protein kinase 2 

Hs.83916 Empirically selected from AFFX single pr 

Hs.83942 cathepsin K (pycnodysostosis) 

Hs,841 31 threony l-tRN A synthetase 

Hs.84429 K1AA0971 protein 

Hs.273357 hypothetical protein FU1 0709 

Hs.74861 activated RNA polymerase il transcriptio 

Hs.848 FK506-binding protein 4 (59kD) 

H5.85951 exportin, tRNA (nudear export receptor 

Hs.86041 CGG triplet repeat binding protein 1 

Hs.171581 ubiquitin C-terminal hydrotese UCH37 

Hs.8700 deleted in liver cancer 1 

H5.17413g chloride channel 3 

Hs.d739 hypothetical protein FU10879 

Hs.8752 transmembrane protein 4 

Hs.88251 arylsulfatase A 

Hs.8850 a disintegrin and metalioproteinase doma 

Hs.88799 anaphase-promoting complex subunit 10 

Hs.890 lysosomal 

Hs.89232 chromobox homolog 5 (Drosbphila HP1 aiph 

Hs.284226 F-box only protein 6 

Hs.89476 CD2 antigen (p50), sheep red blood cell 

Hs.89497 laminBI 

Hs.287850 integral membrane protein 1 

Hs.8991 adaptor-related protein complex 1 , gamma 

Hs,90207 hypothetical protein MGC1 1138 

Hs.90280 5-aminoimidazo!e-4-carboxamide ribonude 

Hs.90806 1 5 kDa selenoprotein 

Hs.6975 PRO1073 protein 

Hs.91773 protein phosphatase 2 (formeriy 2A), cat . 

Hs.286049 phosphoserine aminotransferase • 

Hs-92186 Leman coiled-coil protein 

Hs.92991 KIAA1414 protein 

Hs.93201 glycerot-3-phosphate dehydrogenase 2 (mi 

Hs.173685 hypothetical protein FU12619 

Hs.9414 K1AA1488 protein 

Hs.94262 p53-indudbleribonudeo6de reductases 

Hs.9443 zinc finger protein 202 

Hs.132390 zinc linger protein 36 (KOX 18) 

Hs.95420 JM27 protein 

Hs.279529 px19-iike protein 

Hs.86247 translin-assodatedfadorX 
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other 
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other 
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other 


2 


other 
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other 
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other 
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other 
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other 
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other 
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other 
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other 
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other 
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135207 


N26427 


Hs.9634 


ESTs. Highly similar to C10_HUMAN PUTATI 


17 


other 


135214 


T78802 


Hs.96560 


hypothetical protein FU 11 656 


6.2 


other 


135243 


BE463721 


Hs.97101 


putative G protein-coupled receptor 


2.8 


TM 


135245 


AI028767 


H$.252603 


ESTs 


12.2 


TM 


135257 


AW291023 


Hs.97255 


ESTs. Weakly similar to A46010 X-linked 


7.7 


TM 


135263 


AI088775 


Hs.55498 


geranyfgeranyi diphosphate synthase 1 


1.8 


other 


135274 


AA448460 


Hs.112017 


GE36 gene 


4.2. 


SS. 


135294 


AA150320 


Hs.9800 


protein kinase NJmu-RI 


1.2 


other 


135295 


AI090838 


Hs.98006 


ESTs 


4.9 


other 


135307 


AI743770 


Hs.98368 


ESTs, Weakly similar to KIAA0822 protein 


5.9 


? 


135321 


AI652069 


Hs.98614 


ribosome binding protein 1 (dog 180kD ho 


12.3 


TM 


135354 


AA456454 


Hs.183418 


cell division cycle 2-like 1 (PITSLRE pr 


5.8 


? 


135361 


AA373452 


Hs.1 67700 


Homo sapiens cDNA FU10174 fis, clone HE 


8.1 


other 


135389 


U05237 


Hs.99872 


fetal Alzheimer antigen 


1.9 


other 


135400 


X78592 


Hs.99915 


androgen receptor (dihydrotestosterone r 


13.9 


TM 


302256 


AA857131 


Hs.171595 


HIV TAT specific factor 1 


1.6 


other 


302276 


AW057736 


Hs;323910 


HER2 receptor tyrosine kinase (C'ert)-b2, 


5.3 


otiier 


303135 


AW592789 


. Hs.279474 


HSPC070 protein 


2.2 


TM 


303686 


AK000714 


Hs.109441 


MSTP033 protem 


1.4 


SS, 


310085 


R43191 


Hs.101248 


Homo sapiens ck>ne IMAGE:32553, mRNA seq 


5.2 


other 


315518 


AA608229 


Hs.167771 


ESTs 


2.3 


? 


317781 


NM_007057 


H5.42650 


ZW10 interactor 


2.9 


? 


320836 


AI268997 


Hs.197289 


fat)3 GTPase-activatlng protein, non-cata 


2 


other 


321114 


AA902256 


Hs.78979 


Golgi apparatus protein 1 


5.6 


SS, 


322221 


N24236 


Hs.179662 


nucleosome assembly protein 1-llke 1 


1.4 


? 


322474 


AF1 18083 


Hs.29494 


PR01 91 2 protein 


1.3 


other 


322556 


BE041451 


Hs.177507 


hypothetical protein 


2.9 


SS, 


323541 


AF292100 


Hs.1 04613 


RP42 homolog 


1.6 


other 


407827 


.BE278431 


Hs.40323 


BUB3 (budding uninhibited by benzimidazo 


1.8 


other 


408196 


AL034548 


Hs.43627 


SRY (sex detennining region Y)-box 22 


1.6 


other 


408813 


AI580090 


Hs.48295 


RNA helicase family 


6.2 


other 


409176 


R73727 


Hs.101617 


ESTs, Weakly similar to T32527 hypotheti 


5.7 


other 


409487 


H19886 




gb:yn57a05.r1 Soares adult brain N2b5HB5 


2.7 


other 


413670 


AB000115 


Hs.75470 


hypothetical protein, expressed in osteo 


2.6 


? 


414108 


AI267592 


Hs.75761 


. SFRS protein kinase 1 


2.4 


TM 


414846 


AW304454 


HSJ7495 


UBX domain-containing 1 


2.4 


other 


416040 


AW819158 


Hs.289044 


Homo sapiens cDNA FLJ12048 fis, clone 


2.3 


other 


416980 


AA381133 


Hs.80684 


high-mobility group (nonhistone chromoso 


4.2 


TM 


417378 


R57256 


Hs.82037 


TATA box binding protein (TBP)-associate 


23.6 


other 


418283 


S79895 


Hs.83942 


cathepsin K (pycnodysostosis) 


5.8 


other 


418467 


NM.006910 


Hs.85273 


retinoblastoma-bifKling protein 6 


1.3 


other 


420269 


U72937 


Hs.95264 


alpha thalassemia/mentel retardation syn 


1.6 


? 


420802 


U22376 


Hs.1334 


v-myb avian myeloblastosis viral oncogen 


Z3 


? 


421225 


AA463798 


H5.102696 


MCT-1 protein 


1.6 


? 


421642 


AF172066 


Hs.106346 


■ retinoic acid lepressible protein 


3,5. 


other 


421828 


AW891985 


Hs.279789 


histone deaoetylase 3 


5 


other 


421983 


Ai252640 


Hs.1 10364 


peptidylproiyi isomerase C (cydophtlin 


3.1 


TM 


422052 


AA302744 


Hs.104518 


ESTs 


1.9 


TM 


422055 


NM_014320 


Hs.1 11029 


putative heme-binding protein 


2.4 


other 


423750 


AF165883 


Hs.298229 


prefoldin 2 


4.2 


? 


424001 


W67883 


Hs,137476 


paternally expressed 10 (PEG10; KIAA105 


7.1 


? 


425182 


AF041259 


Hs.155040 


zinc finger protein 217 


2.3 . 


other 


425284 


AF155568 


Hs,155489 


NS1 -associated protein 1 


3.5 


other 


426372 


BE304680 


Hs.169531 


DEAD/H (Asp-Glu-Ata-Asp/His) box poiypep 


1.9 


? 


428049 


AW183765 


Hs.182238 


GW128 protein 


7.6 


? 


428477 


AW500533 


Hs.1 1482 


splicing factor, arginine/serine-rich 1 1 


1.7 


other 


437562 


AB001636 


Hs.5683 


DEAD/H (Asp-Glu-Ala^sp/His) box poiypep 


2.4 


other 


438449 


AK001333 


Hs.6216 


Homo sapiens hepatocellular carcinoma^s 


3.8 


other 


441560 


F13386 


HsJ888 


Homo sapiens clone 23736 mRNA sequence 


5.6 


other 


445580 


AF167572 


Hs.12912 


skb1 (S. pombe) homolog 


2 


TM 


446999 


AA151520 


Hs.334822 


hypothetical protein MGC4485 


7.6 


other 


447111 


AI017574 


Hs.17409 


cysteine-rldi protein 1 (intestinal) 


2.2 


other 


447778 


BE620592 


Hs.71190 


ESTs, Weakly similar to SI 6505 hypotheti 


2.9 


other 


448873 


NM_003677 


Hs.22393 


density-regulated protein 


1.8 


other 


449587 


W68520 


Hs.331328 


intennediate filament protein synooilin 


5.9 


' other 


450701 


H39960 


Hs.288467 


Homo saptens cDNA FLJ12280 fis. done MA 


5.7 


other 


450703 


AA011202 


Hs.184771 


nucteartectorl/C (CCAAT-tMnding transc 


1.4 


other 
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452461 N78223 
452511 BE408178 
453157 AF077035 
453658 BE541906 
5 100685 AA328229 
100690 M383256 
100833 AF135168 
100850 AA836472 
101161 NM 006262 
10 102481 U50360 

102831 AA262170 
103549 BE270465 
103749 AL135301 
104331 AB040450 
15 104532 AI498763 
104563 AL1 17403 
105032 AA127818 
105039 AA907305 
106531 AA454036 
20 106977 AL043152 
. 107298 N95657 
108717 AA122393 
110018 AW579842 
110330 A1288666 
25 111391 NM„003896 
111392 W46342 
113554 AW503990 
113722 AV653556 
115008 AK001827 
30 115062 AA253314 
115121 A1634549 
117881 AF161470 
119075 M10905 
119615 AL034423 
35 120253 AA1 31376 
125006 BE065136 
127609 X80031 
128868 AA4ig008 
128891 F34856 
40 128959 AI580127 
129209 R62676 
129449 A1098988 
129453 AW974265 
129629 AK000398 
45 129917 M30773 
129922 AF042379 
129989 AB015856 
130182 BE267033 
130365 W56119 
50 130471 AL121438 
130542 U64675 
130586 AB007891 
130768 AF258627 
130992 BE398091 
55 131047 H23230 

131135 NM_016569 
131339 AF058696 
131760 X76732 
131774 BE267158 
60 131853 AI681917 
131881 AW361018 
131887 W17064 
132031 AF193844 
132192 AA206153 
65 132203 NIVLC04782 
132240 AB018324 
132348 AW067708 



Hs. 108106 transcription factor 

Hs.285165 Homo sapiens cONA FLJ20845 fis. done AD 

Hs.31989 DKF2P586G1722 protein 

Hs.87819 Homo sapiens, done MGC:2492. mRNA. comp 

Hs.184582 fibosomal protein L24 . 

Hs.1657 estrogen receptor 1 

H5.108802 N-ethylmaleimide-sensitive factor 

Hs.297939 cathepsinB 

Hs.37044 peripherin • 

gbiHuman caldum. calmodulin-dependentp 

Hs.80917 adaptor-related protein complex 3, sigma 

Hs.78793 protein Idnase C, zeta 

Hs.8768 hypothetical protein FU10849 

Hs.279862 cdk inhibitor p21 binding protein 

Hs.203013 hypothetical protein FLJ12748 

Hs.306189 DKFZP434F1735 protein 

gb:zl12a02.s1 Soares_pregnanLutefiis_NbH 

Hs.36475 ESTs 

Hs.8832 ESTs 

Hs.50421 K1AA0203 gene produd 

Hs.6820 ESTs. Moderately similar to Y0J1.CAEEL H 

Hs.70811 hypothetical protein FU20516 

Hs.104557 hypothetical protein aJ10697 

Hs.16621 DKFZP434I1 16 protein 

Hs.225939 sialyltransferase 9 (CMP-NeuAc:laclosylc 

Hs.325081 Homo sapiens, done IMAGE:3659680. mRNA. 

Hs.142442 HP1-BP74 

Hs.1 84411 albumin 

Hs.87889 helicase-moi 

Hs.154103 LIIVl protein (simitar to rat protein kina 

Hs.88155 ESTs 

Hs.260622 butyrate-induced transcript 1 • 

Hs.287820 fibronectin 1 

Hs.75875 ubiquitin-conjugating enzyme E2 variant 

Hs.326401 fibroblast growth fador 12B 

Hs.145696 spriclngfador(CC1.3) 

Hs.530 collagen, type tV. alpha 3 (Goodpasture 

Hs.106730 chromosome 22 open reading frame 3 

Hs.292457 Homo sapiens, clone M6C:16362. mRNA, com 

Hs.107381 hypothetical protein FLJ11200 

Hs.1 7820 Rho-assodated. coiled-coil containing p 

Hs.1 1 1 554 ADP-ribosylation factor-like 7 

Hs.1 11 632 Lsm3 protein 

Hs.1 1747 hypothetical protein FLJ20391 

Hs.278540 ■ protein phosphatase 3 (fomieriy 2B). reg 

Hs.1 3386 gamma-tubulin complex protein 2 

Hs.247433 - acUvating transcription factor 6 

Hs.192853 ubiquitin-conjugating enzyme E2G 2 (homo 

Hs.1 55103 eukaryotic translation initiation factor 

Hs.183706 adducin 1 (alpha) 

Hs.179825 RAN binding protein 2-Iike 1 

Hs.1 6349 KIAA0431 protein 

Hs.21 1 562 ATP-binding cassette, sub-family A (ABC1 

Hs.74316 desmoplakin (DPI, DPIl) 

Hs.22481 ESTs, Moderately similar to A46010 X-Bn 

Hs.267182 TBX3-iso protein 

Hs.25812 Nijmegen breakage syndnjme 1 (nibrin) 

Hs.3164 nudeobindin 2 

Hs.1 69474 DKFZP586J01 19 protein 

Hs.3321 ESTs. Highly similar to IRX1_HUMAN IROQU 

Hs.3383 upstream regulatory element binding prot 

Hs;332848 SWVSNF related, matrix associated, adi 

HS.375B C0P9 complex subunit 7a 

Hs.4209 mitochondrial ribosomal protein L37 

Hs.194714 synaptosomaJ-assodated protein, 29kD 

Hs42676 K1AA0781 protein 

Hs.17031 1 heterogeneous nudear ribonudeoprotein 
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T78735 
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Human 26S proteasome-associated padi 2.2 
ESTs 1.3 
Homo sapiens mRNA for osteoblast specific . 7 .5 
ESTs 7.1 
Protein kinase interferon-inducible double 4.1 
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TABLE 7A 



Table 7 A shows the accession numbers for those pkeys lacking imigenelD's for Table 7. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are Usted in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

15 

Ptey CAT number Accession 

102481 31281.-28 U50360 
20 105032 genbanleAA127818 AA127818 

409487 1134778J H19886 AW402806T10231 
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TABLE 8: Figure 8 from BRCA 001-1 US 

Table 8 shows genes upregulated in tumor tissue compared to normal breast tissue. 
Specifically, one column shows the ratio of expression of the indicated gene in breast tumor 
tissue compared to other body tissues, and another column shows the ratio of expression of 
the indicated gene in breast tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number. Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene ^tle 

R1 : Ratio of tumor to nonnai body tissue 

R2: Ratio of tumor to nonnal breast tissue 



Pkey ExAccn UnigenelD Unigene Title 

100075 AF152333 Hs.284160 protocadherin gamma subfamily B. 4 

100229 AV652249 Hs.180107 polymerase (DNA directed), beta 

100262 D38500 Hs.278468 postmeiofc segregation increased 2-like 

100271 BE160081 Hs.256290 8100 calcium-binding protein All (calgiz 

100355 AI907114 Hs.71465 squalene epoxidase 

100522 X51501 Hs.99949 prolactin-induced protein 

100552 AA019521 Hs.301946 lysosomal 

100599 X77343 Hs-334334 transcription factor AP-2 alpha (activat 

100676 X02761 Hs.287820 fibronectin 1 

100690 AA383256 Hs.1657 estrogen receptor 1 

100895 U01351 Hs.75772 nuclear receptor subfamily 3, group C. m 

101046 K01160 NM_002122:Homo sapiens major histocomps 

101086 AA382524 Hs.250959 histatih 1 

101 148 NM_002923 Hs.78944 regulator of G-proteIn signalling 2. 24k 

101161 NM.006262 Hs.37044 peripherin 

101201 L22524 Hs.2256 matrix metalloproteinase 7 (MMP7; uterin 

101212 AI186220 Hs.83164 collagen, type XV. alpha 1 

101441 AW468397 Hs.100000 SlOO calcium-binding protein A8 (calgran 

101447 M21305 gb:Human alpha satellite and satellite 3 

101469 AA310162 Hs.169248 cytochrome c 

101567 M33552 Hs.56729 lysosomal 

101600 BE561617 Hs.119192 H2A histone family, member Z 

101624 M55998 gb:Human alpha-1 collagen type I gene, 3 

101674 NM_002291 Hs.82124 laminin, beta 1 

101861 AA350659 Hs.83347 angio-associated, migratory cell protein 

101977 AF1 12213 Hs.184062 putative Rab5-interacting protein 

102193 AL036335 Hs.313 secreted phosphoprotein 1 (osteopontin. 

102199 AA334592 HsJ9914 lumican 

102304 AF015224 Hs,46452 mammaglobin 1 

102345 NM_003480 Hs.300946 Microfibril-associatedglycoprotein-2 

102457 NM_001394 Hs.2359 dual spedficity phosphatase 4 

102534 U96759 Hs.198307 von Hippel-Lindau binding protein 1 

102541 AI379954 Hs.79025 KIAA0096 protein 

102827 BE244588 Hs.6456 chaperonin containing TCPl.subunit 2 (b. 

102962 R50032 Hs.159253 collagen, type Vl, alpha 2 

102991 AW293542 Hs.75309 eukaryotk: translation etongation factor . 

103119 X63629 Hs.2877 cadherin 3. type I.P-cadherin (placenta 

103175 X69089 Hs.79227 myomesin (M-prolein) 2 (165kD) 

103286 D38616 Hs.54941 phosphorylase kinase, alpha 2 (fiver) 

103319 X83492 Hs.82359 tumor necrosis factor receptor superfami 

103372 BE536700 Hs.4888 seryl-tRNA synthetase 

103419 T34708 Hs.272927 Sec23 (S. cerevisiae) homolog A 

103471 Y00815 Hs.75216 protein tyrosine phosphatase, receptor t 

103546 Z14244 Hs75752 cytochrome c oxidase subunitVllb 
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103658 

103758 

103774 

103821 

103869 

103980 

104054 

104115 

104189 

104230 

104278 

104295 

104319 

104425 

104432 

104464 

104479 

104505 

104592 

104613 

104636 

104782 

104792 

104848 

104849 

104850 

104852 

104861 

104873 

104891 

104920 

104926 

104952 

104963 

104977 

105030 

105035 

105068 

105159 

105178 

105182 

105274 

105303 

105413 

105426 

105432 

105443 

105483 

105492 

105495 

105539 

105594 

105623 

105807 

105812 

105823 

105831 

105851 

105879 

105918 

105939 

105941 

105969 

105990 

106012 

106050 

106070 



NM.000088 

AA084874 

H24185 

AA095971 

BE439604 

AW1 30242 

AK001913^ 

AF1 83810 

AB040927 

AB002347 

AW583693 

AW365522 

AW804296 

AF283775 

X51501 

AW966728. 

AK001731 

W94824 

AW630488 

AF123303 

R82252 

AW270555 

AA960961 

AA305351 

A1279065 

AL133035 

W70164 

AA058630 

W03831 

W44626 

AW955089 

BE298808 

AW076098 

AB029020 

AI392640 

BE613061 

N39760 

BE410438 

AF146277 

AA313825 

BE407961 

AI554929 

BE243327 

AI015709 

W20027 

W03516 

AA252372 

AL137257 

AI805717 

AU037715 

AB040884 

AB024334 

BE504200 

AA788g46 

BE614149 

AI559444 

AA329449 

AI827976 

BE392914 

AW028485 

AL137728 

AB033075 

A6030656 

AI690586 

AI240665. 

NM^001329 

T74445 



Hs.172928 

Hs.92918 
Hs.198793 
Hs.24322 
Hs.293476 
Hs.7100 
Hs.26102 
Hs.301804 
Hs.15303 
Hs.109253 
Hs,103657 
Hs.9950 
Hs.35380 
Hs.99949 
Hs.54642 
Hs.i06390 
Hs.11565 
Hs.325820 
Hs.24713 
Hs.106106 
Hs.171774 
Hs.305953 
Hs.274369 
Hs.241507 
Hs.8728 
Hs.20107 
Hs.29759 
Hs.20597 
Hs.30627 
Hs.306083 
Hs,33363 
Hs74316 
Hs.173694 
Hs.18272 
Hs.337772 
Hs.6859 
HS.90D6 
Hs.265551 
Hs.21941 
Hs.18271 
Hs.281866 
Hs.182626 
Hs-172089 
Hs.23439 
Hs.76698 
Hs.12144 
Hs^3458 
Hs.289112 
Hs.28785 
Hs.109694 
Hs.25001 
Hs,30127 
Hs.16869 
. Hs.20814 
Hs.293960 
Hs.247302 
Hs.24391 
Hs.30503 
Hs^6136 
Hs.12258 
Hs.10669 
Hs.17377 
Hs.29403 
Hs,8895 
Hs.171391 
Hs.5957 



ooQagen, type 1, alpha 1 . 3^ 3 

gbzn13e04.r1 Stiatagene hNT neuron (937 0.9 10 

hypothetical protein 1.9 15.9 

Homo sapiens cONA:FLJ22463Rs, done H 1.2 3.9 

ATPase, H-i* transporting, lysosomal (vacu 1 .4 3.9 

hypothetical protein FKSG44 1 .6 4.1 

hypothetical protein 1.5 4.3 

opposite strand to tiichoihinophalangeal 7 7 

KIAA1494 protein 2 4.6 

KIAA0349 protein 0.7 4.5 

N-tenninal acetyftransferase complex ard 3.3 3.3 

hypothetical protein PR02219 2.3 4.2 . 

Sec61 gamma 3.1 7 

X 001 protein 4 1.3 * 

prolactin-induced protein 3.8 0.6 

methionine adenosyitransferase 11, beta 0.8 6.7 

Homo sapiens mRNA: cDNA DKFZp586H0924 (f 1.7 4.8 

RlKENcDNA 2010100012 gene 2 7.5 

protease, serine, 23 1.9 7.4 

hypothetical protein 1.1 6.3 

protein kinase (cAMP-dependent. catalyti 1.2 4 

hypothetical protein 1.4 3.9 

zinc finger protein 83 (HPF1) 1.5 4.2 

uncharacterized hypothalamus protein MAR 1.1 4.1 

ribosomal protein S6 1.3 4.6 

hypothetical protein DKFZp434G171 1.2 3.6 

ESTs 0.8 4.2 

RNA POLYMERASE I AND TRANSCRIPT RELEASE 17 5.1 

host cell factor homotog 0.8 5.4 

ESTs 0.7 6.8 

Novel human gene mapping to chomosome 22 1 3.9 

DKFZP434N093 protein 3.3 3.3 

desmoplakin (DPI. DPIl) 1.2 3.7 

KIAA1097 protein 1.1 5.5 

amino acid transporter system A1 3.2 1 .4 

Homo sapiens, Similar to RlKENcDNA 0610 1.6 11.4 

Homo sapiens. Similar to RIKEN cDN A 5830 1 .5 7.2 

VAMP (vesicie-associated membrane protet 1.1 3.5 

C02-associated protein 1.2 10 

AD036 protein 3.6 8,3 

golgi phosphoprotein 3 1.7 6.8 

ATPase, H+ transporting , lysosomal (vacu 1 . 1 3.7 

chromosome 22 open reading frame 5 1.5 4 

Homo sapiens mRNA; cDNA DKFZp586l2022 (f 1.5 14 

ESTs 4.3 2.9 

stress^ssodated endoplasmic reticulum 1.5 5 

KIAA1033 protein 1.2 3.6 

Homo sapiens cDNA:FU23015ris. clone L 17 . 15.8 

CGM3 protein 2 4.8 

mioofibrillar-associated protein 3 1.3 3.9 

KIAA1451 protein 27 11.4 

tyrosine 3^onboxygenaseftiyptophan 5-mo 1.3 6.1 

hypothetical protein 1.7 4.5 

ESTs. Moderately similar to CA1C RAT COL 3.9 24.6 

CGI-27 protein 1.8 3.6 

ESTs 1.9 6.6 

twisted gastrulation 1.5 4.3 

hypothetical protein FLJ13612 3.8 1.9 

Homo sapiens cDNAFLJ1 1344 fis, done PL 17 4 

hypothetical protein MGC14156 1.7 7.4 

Homo sapiens mRNA; cDN A DKFZp434B0920 (f 1 .2 3.8 

development and differentiation enhandn 1.3 4.6 

coronin, actin-binding protein, 1C 1.1 5.9 

hypothetical protein FU22060 2 4.6 

ESTs 4.1 1.2 

C-terminal binding protein 2 2.6 7 

Homo sapiens done 24416 mRNA sequence 1.4 10.7 
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106083 


H62087 


ns.jioos uiyrotu normone reoepior*associaicQ proi 


1.0 


O.O 


106155 


AA425414 


nS. OOZO / n UCi6di laCiOr l/D 


<; A. 
o.*» 




106255 


BE613206 


ns.2/9oU/ ca!p3Siaun 


\ ft 


A 
*» 


106414 


BE568205 


ns.28827 mitogen-acQvated protBin Kinase Kinase 


5.1 


0.1 


106538 AK00D274 


nS.27oo35 rtUuiVlAlor protein 


i 0 


<t 0 

0.9 


106568 AW051564 


ns.282o5 patcned related protein translocated in 


1.0 


0.4 


106574 


BE044325 


ns.2272o0 Uo snKNA-associa»d onHixe protein 






106813 


N88604 


Hs.30212 thyroid receptor inteiacting protein 15 


1.2 


O.D 


106617 


H09548 


ns.53D7 cSTs, WeaKiy similar to 138022 nypotneti 


0.9 


A A 


106619 AA459480 


ns.23956 nypotneticai protein ruzuouz 


A 1 


o.o 


106701 


BE387614 


Hs.25797 spfidng factor 3b, .subunit 4, 49kD 


1.6 


■7.3 


108721 


AA741038 


Hs.6670 ESTs 




0.1 


106776 


AA206O79 


ns.6693 nypotneticai protein rLJ204zu 


A 

1 


C A 
0.** 


106866 


AA487416 


Hs.268231 Homo sapiens cDNA: FU231 11 fis, clone L 


1.6 


5.4 


106868 


BE185536 


Hs.301183 molecule possessing ankyrin repeats indu 


3.3 


1.2 


108887 


BE503373 


Hs,334335 hypothetical protein FU 13576 


1.4 


6,3 


106940 


T85594 


Hs.339808 hypothetical protein FU10120 


3.3 


1.8 


106968 


AF216751 


Hs.26813 CDA14 


3 


3 


107052 


BE391904 


Hs. 12482 glyceronephosphateO-acyltransferase 


1.7 


7.0 


107051 


BE147611 


Hs.6354 stromal cell derived factor receptor 1 


1.2 


4.3 


107149 


AI289507 


Hs.299883 hypothetical protein FU23399 


1.8 




107222 


BE172058 


Hs.82689 tumor rejection antigen (gp96) 1 


1.2 


6.9 


107233 


BE267795 


Hs.22595 hypothetical protein FU10637 


1.4 


3.5 


107295 


AA186629 


Hs.801 20 UDP-N-aoetyt-alpha-D-^alactDsamine:polyp 


2.6 


4.3 


107679 


AA011510 


Hs.60512 ESTs 


1.8 


4 


107914 


AA027229 


Hs.6i329 ESTs. Weakly similar to T16370 hypotheti 


1,3 


3.5 


107965 


AF109219 


Hs.108787 phosphatidytinositol giycan. class N 


1,6 


3.0 


108033 


AW368993 


Hs.323748 Homo sapiens clone CDABP0086 mRNA sequen 


1.8 


108060 


AA291440 


Hs,731 49 paired box gene 8 


1.1 


3,5 


108081 


AA093668 


Hs.28578 muscleblind (DrosophilaHiKe 


0.7 


0.0 


108137 


A1283611 


Hs.263479 ESTs. Weakly similar to HMGI^HUMAN HIGH 


A O 


108186 


AW088579 


Hs.7780 Homo sapiens mRNA; cDNA DKFZp564A072 (fr 


3.1 


108215 


AI879238 


Hs.299315 collapsln response mediator piotein-5;C 


1.5 


4.6 


108297 


AA333660 


Hs.71331 hypothetical protein MGC5350 


1.5 


4 


108339 


AW151340 


Hs.51615 ESTs. Weakly similar to ALU7J1UMAN ALUS 


6.3 


108371 


AA074374 


Hs.67639 ESTs 


1.3 


3.8 


108399 


AP086070 


Hs.237519 EST 


1 


3.6 


108469 


AA079487 


gb:zm97f08.s1 Stratagene colon HT29 (937 


1.5 


3.6 


108470 


AA079500 


gb:zm96h10.s1 Stratagene coton HT29 (937 


1.1 


•4.3 


108564 


M23114 


Hs. 1 526 ATPase, Ca-i-i- transporting , cardiac musci 


2 


4.9 


108641 


AA1 12059 


Hs.429 ATP synthase. H*»- transporting, nutochond 


1.1 


3,5 


108668 


AA058522 


Hs.185751 ESTs 


1.2 


3,6 


108694 


AA036725 


Hs.61847 ESTs 


1.4 


. 3,6 


108824 


AK001332 


Hs.44672 hypothetical protein FU10470 


1.4 


3.5 


108863 


AA1 33456 


Hs. 1 02548 glucocorticoid receptor ONA binding fact 


1.2 


4 


108893 


BE276891 


Hs.194691 retinoic acid induced 3 


1.3 


3.6 


108992 


AA152312 


Hs.72047 ESTs 


1.1 


4.1 


109072 


AI732585 


Hs.22394 hypotheUcal protein FU10893 


1.2 


3.5 


109097 


AA167512 


gb:zp10f12.s1 Stratagene fetal retina 93 


1.3 


5 


109160 


BE220601 


Hs.301997 hypothetical protein FU13d33 


4 


C A 

6.1 


109244 


BE179030 


Hs.64239 Human DNA sequence from clone RP5-1 174N9 


1.7 


109481 


AA878923 


Hs.289069 hypothetical protein FU21016 


3.8 


7.7. 


109484 


AA365263 


Hs.72531 hypothetical protein FU11838 


1.9 


A 
H 


109795 


AA173942 


Hs.326416 Homo sapiens mRNA; cONA DKF2p564H1916 (f 


3.7 


110009 


BE075297 


Hs.6614 ESTs. Weakly sinular to A43932 mucin 2 p 


4.6 


7.4 


110107 


AW151660 


Hs.31444 ESTs 


1.2 


:l5 


110411 


AW001579 


Hs.9645 . Homo sapiens mRNA for KIAA1741 protein. 


3.7 


■--3 


110731 


NM.014899 


Hs.188006 KIAA0878 protein 


2.8 


\ 7 


110756 


N21207 


Hs.182999 ESTs 


1.6 - 


3.0 


110930 


BE242691 


Hs.14947 ESTs 


3.1 


1.2 


110935 


AI753230 


Hs.323562 hypothetical protein DKrZp564K142 


1.9 


7.5 


111051 


Ai681293 


Hs.12186 hypothetical protein RJ22558 


2 


4 . 


111110 AK001566 


Hs.23618 hypothefical protein FU10704 


1.1 


3.8 


111356 


BE301871 


Hs.4867 mannosyl (alpha-1 .3-)-glycoprotein beta- 


1 


8.2 


111357 


BE314949 


Hs.871 28 hypothetical proten FU23309 


3.3 


6.1 


111770 R27975 


Hs.269401 ESTs. Moderately s^mOar to S65657 ai^ha . 


1.2 


5.4 


111900 AF131784 


Hs.25318 Homo sapiens done 25194 mRNA sequence 3.2 


0.8 



8.1 



5.6 
6.9 



4.7 



7.4 
1.3 
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111903 


NM 014906 


Hs.1 66351 KIAA1072 protein 


1 


5.4 


111951 


NM_014927 


Hs.100527 KIAA0902 protein 


1 


3.8 


112141 


AW137198 


Hs.278682 Phosphatidylgtycerophosphate Synthase 


1.4 


3.5 


.112193 


R49499 


Hs.138238 ESTs 


1,5 


3.6 


112197 


NfVL003655 


Hs.5637 ESTs 


4.6 


2 


112610 


AW500106 


Hs.23643 serine/threonine protein kinase MASK 


3.3- 


10.5 


112971 


Z42387 


Hs.83883 transmembrane, prostate androgen induced 


3.2 


3 


112984 


T16971 


Hs.289014 ESTs, Weakly similar to A43932 mucin 2 p 


3.7 


10.8 


113056 


AF019226 


Hs.8036 glioblastoma overexpressed 


4.5 


3.7 


113449 


AW160683 


Hs.158006 hypothetical protein 


1.2 . 


4.4 


113497 


AF143321 


Hs.15572 hypothetical protein IMAGE 109914 


0.9 


3.6 


113508 


AL042936 


Hs.21 1571 holocytochrome c synthase (cytochrome c 


11 


3.5 


113531 


AK001898 


Hs.16740 hypothetical protein FU1 1036 


1.2 


3.9 


113604 


AI075407 


Hs.296083 ESTs. Moderately similar to 154374 gene 


1.7 


5.3 


113674 NIVL014214 


Hs.5753 inositol(myo}-1 (or 4Hnonophosphatase 2 


0.8 


6.1 


113841 


W30681 


Hs.146233 Homo sapiens cDNA:FLJ22130fis.ck)neH 


1.7 


6.2 


113857 


AW243158 


Hs.5297 DKF2P564A2416 protein . 


1.2 


4.6 


113931 


BE2^99 


Hs.34g6 hypothetical protein MGC15749 


1.5 


4 


113936 


W17056 


Hs.83623 nuclear roceptor subfamDy 1. group 1. m 


3.8 


1 


113987 


AA345519 


Hs.g64 1 complement component 1 , q subcsomponent, 


1.2 


4.7 


114132 


AI342493 


Hs.24192 Homo sapiens cONA FU20767 fis. done CO 


0.3 


4.3 


114156 


BE179882 


Hs.336920 glutathione peroxidase 3 (plasma) 


1.1 


4.3 


114213 


N58309 


Hs.19575 CGI-11 protein 


1.6 


9.2 


114636 


AA075488 


.gb:zm88d01.s1 Stratagene ovarian cancer ' 


1.6 


3.7 


114760 


A1929382 


Hs.252692 hypothetical protein FU20343 


1,4 


4 


114781 


T10446 


Hs.95388 ESTs 


1 


4.3 


'114795 


AB037858 


Hs.173484 hypothetical protein FU10337 


1.6 


9.2 


114901 


AV660012 


Hs.196437 hypothetical protein FU10788 


1.4 


5.2 


115096 


AI683G69 


Hs.175319 ESTs 


3.7 


1 


115518 


BE541042 


Hs.23240 Homo sapiens cDNA: FU21848 fis. clone H 


3.2 


4.2 


115646 


N36110 


Hs.305971 solute earner family 2 (facilitated glu 


1.5 


3.9 


115764 


AW582256 


Hs.9101 1 anterior gradient 2 (Xenepus laevis) horn 


1.3 


5.9 


115802 


AW410233 


Hs.206521 YME1 (S.cerevisiae)-nke 1 


1.7 


6.6 


115994 


AB037836 


Hs.109315 KIAA1415 protein 


1.5 


9.1 


116032 


BE383668 


Hs.42484 hypothetical protein FU10618 


0.9 


4.3 


116046 


BE395293 


Hs.94491 hypothetical protein FU20297 


1.6 


5.5 


116274 AI129767 


Hs.1 82874 guanine nucleotide binding protein (G pr 


3.2 


2.4 


116310 


Z24854 


Hs,42299 ESTs 


0.8 


4.7 


116356 


AI371223 


Hs.288671 Homo sapiens cDNA FU 1 1997 fis. done HE 


2.4 


3.9 


116429 


AF191018 


Hs^9923 putative nudeofide binding protein, est 


5.5 


5.5 


116461 


AA313607 


Hs.58633 Homo sapiens cDNA: FLJ22145 fis, clone H 


5 


1.3 


116470 


AI272141 


Hs.83484 SRY (sex detemiining region Y)-box 4 


8.7 


4.5 


116578 


D21262 


H5.75337 nudeolarand ooiled-body phosphprotein 


3.2 


6.9 


116579 


AW888411 


Hs.8191 5 leukemia-assodated phosphoprotein p18 ( 


3.2 


3 


116589 


AI557212 


Hs.17132 ESTs, Moderately simiiar to 154374 gene 


3.1 


8.3 


116786 


H25836 


Hs.301 527 ESTs. Moderately similar to unknown (H.s 


3.2 


4.5 


117170 


N25929 


Hs.42500 ADP-ribosylation factor-like 5 


7 


5,5 


117215 


N20066 


Hs.133207 PTPRF Interacting protein, binding prate 


1.2 


6.2 


117280 


M18217 


Hs.172129 Homo sapiens cDNA: FU21409 fis. done C 


4.5 


2.4 


117576 


AI383467 


Hs.44597 ESTs 


1.4 


4:2 


117667 


U59305 


Hs.44708 Ser-Thr protein kinase related to the my . 


4.3 


0.5 


117881 


AF161470 


Hs.260622 butyrafe-induced transcript 1 


2.1 


5.7 


118336 


BE327311 


Hs.47166 HT021 


3.6 


7.7 


118475 


N66845 


gb:za46c1 1.s1 Scares fetal liver spleen 


4.2 


0.5 


118493 


AL353944 


Hs.50115 Homo sapiens mRNA; cDNA DKFZp761J1112 (f 


3.5 


118505 


N67343 


gb:yz50b07.s1 Morton Fetal Cochlea Homo 


2.1 


3.8 


119159 


AF142419 


Hs.15020 homotog of mouse quaking OKI (KH domain 


3.7 


1.5 


119307 


BE048061 


Hs.37054 ephrin-A3 


3 


1,1 


119355 


BE218319 


Hs.5807 GTPase Rab14 


1.1 


5.6 


119462 


BE041667 


Hs.314544 Homo sapiens cenncal cancer suppressor- 


1.4 


4.3 


119771 


AI9056B7 


Hs.2533 EST 


3.2 


1 


119940 


AL050097 


Hs^2531 DKFZP586B0319 protein* 


4.3 


0.7 


119943 


BE565849 


Hs.14158 copinelll 


3.5 


1.9 


120407 AA235207 


Hs.250456 hypothetical protein DKFZp762F2011 


1.5 


3.7 


120493 AW968080 


HS.1S2939 HonK) sapiens done 24630 mRNA sequence 


4 


1.4 


120S77 AF150208 


Hs.108327 damage-spedfic DMA binding protem 1 (1 


1.6 


6.8 


120867 


AA350781 


Hs56967 ESTs 


1.1 


3.6 



3.3 
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124059 BE387335 
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124543 AI393320 
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124605 AA749315 
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124737 BE270465 
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124792 R44357 
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125034 BE548446 
45 125058 T83731 
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55 125260 H05635 
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125388 W27235 
125824 Z45258 
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125970 AW504721 
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126469 BE384361 
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Hs,178292 KIAA0180 protein 
Hs.249495 heterogeneous nudear ribonucleoprotsn 
Hs.104800 hypothetical protein FU10134 
Hs.27413 adaptor protein containing pH domain. FT 
Hs.21355 hypothetical protein DKFZp762K2015 
Hs.308026 major histocompatibility complex, dass 
Hs.284291 sorting nexin 6 
Hs.218329 hypothelical protein 
Hs,178443 ESTs 

Hs.106747 serine carboxypeptidasel precursor prot 
gb:ae55d04.s1 Stratagene lung cardnoma 
gb:ae56h07.s1 Stratagene lung cardnoma 
ESTs, Moderately similar to H2BL_HUMAN H 1.1 
DKFZP434B203 protein 
cydic AMP phosphoprotein, 19 kD 
activated RNA polymerase It transcripiio 
nudear factor I/A 

ESTs, Weakly similar to S64054 hypotheti 
hypothetical protein FU22174 
lectin, galadoside-blnding. soluble, 1 
erythrocyte membrane protein band 7.2 (s 
Homo sapiens. Similar to RIKEN cDNA 9430 
GTP-binding protein 
KIAA0265 protein 

ESTs, Weakly similar to ALUC_HUMAN 111! 
g b:yy98e1 2.s1 Soares_multipIe_sderosis_ 
cell division cyde 42 (GTP-binding prot 
ESTs 
ESTs 

A kinase (PRKA) anchor protein 2 
minichromosonie maintenance dericient (S. 
DKFZP586C1324 protein 
Homo sapiens cDNA FU11918 fis. done HE 1.5 
protein kinase C, zeta 0.7 
ESTs. Moderately .similar to ALU5_HUMAN A 0,9 



HS.18B762 
Hs.21610 
Hs.7351 
Hs.74861 
Hs.173933 
Hs.283713 
Hs.7734 
Hs.227751 
Hs.160483 
Hs.334725 
Hs.215766 
Hs.192966 
Hs.268869 

Hs.146409 
Hs.104573 
Hs.108275 
Hs.42322 
Hs.77171 
Hs.21143 
Hs.289068 
Hs.78793 
Hs.91052 
Hs.76288 
Hs.48712 



1.1 
1.4 
1.9 
1.5 
14,8 
1.2 
2.5 
1 

1.5 
1.8 
1.1 
1.3 
2.7 
2.1 
1 

1.4 
0.7 
1.1 
1.4 



Hs.103804 

Hs.33578d 

Hs.194766 

Hs.1 10044 

Hs.187775 

Hs.5167 

Hs.3343 



Hs,122730 

Hs.143669 
Hs.274428 
Hs.8518 
Hs.12396 
Hs.84298 
Hs.294030 
Hs.171957 
Hs.180224 
Hs.64311 
Hs.288013 
H5.76550 
Hs.177516 
Hs.30376 
Hs.182885 
Hs.334762 
Hs.10248 
Hs.94830 
Hs.150101 



calpain 2. (m/11) large subunit 1 .3 

hypothetical protein FU20736 1.8 
gb:yg93h09.s1 Scares infant brain 1N1B H 1 
heterogeneous nudear ribonudeoprotein 3.2 
tubulin, beta polypeptide 1 
seizure related gene 6 (mouse)-like 0.9 
ESTs 0.9 
ESTs 1.2 
Homo sapiens mRNA; cDNA DKFZp434F152 {fr 
phosphoglycerate dehydrogenase 0.9 
gb:oq02h08.s1 NCLCGAP_Lu5 Homo sapiens 
gb:ye20«}5.s1 Stratagene lung (93721 0) H 3.2 
ESTs. Moderately similar to KIAA1215 pro 5.3 
gb'.zc78a07.s1 Pancreatic Islet Homo sapi 0.9 
ESTs 1.1 
TRF2-interacUng telomeric RAP1 protein 1.1 
selenoprotein N 1.1 
ESTs. Weakly similar to 2004399Achromos 1 
CD74 antigeri (invariant polypeptide of m 1 .2 
topoisomerase-related function protein 4 1 
triple functional doniain (PTPRF friteract 1.3 
myosin regulatory light chain 1.1 
a disintegrin and metaUoproteinase doma 1.4 
short ooSed-coii protein 2.4 
Homo sapiens mRNA; cONA DKFZp564B1264 (f 
high density lipoprotein binding protein 1 .9 
hypothetical protein 14 
ESTs. Weakly similar to JC5024 UDP-galac 2 
hypothetical protein FLJ14735 1.3 
Homo sapiens cDNA FU20167 fis. done CO 1.3 
ESTs. Moderately sintilar to T03094 A-Wn 4.3 
lysosomal 1-5 

181 



4.1 

5.5 

3.2 

3.9 

7.1 

3,7 

4.9 

3.6 

5.2 

5.1 

5.2 

9.3 

3,6 

3.5 

3.8 

11.2 

4.4 

11.5 
6.2 

12,7 

4,1 

8,4 

10.2 

4.8 

4-1 

4.3 

5.7 

4.1 

4 

4 

3.5 
3.6 
9.9 
4 

3,6 
3.9 
4.2 
3.6 
3.4 
4.4 
5.2 
3.5 
5 

1.5 
6 

1 3.7 

2.5 

6.6 

6.1 

4.3 

4.1 

3.8 

3.6 

7.8 

4.9 

4.8 

16.1 

5.3 

8.7 

1.8 

3.8 

4.1 

3.7 

4.1 

5 

0.9 
4.7 



3.7 



4.6 



wo 02/059377 



PCT/US02/02242 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



128453 


X02761 


Hs.287820 fibfonectin 1 


1.2 


4.3 


128460 


T16206 


Hs.237164 ESTs, Highly similar to LDHK HUMAN L-LAC 3.1 


44.4 


128491 


H08379 


Hs.165563 hypothetical protein DKFZp434N1429 


0.6 


13.1 


128495 


NM 005904 


Hs.100602 MAO (mothers against decap8ntapiegic.Dr 


1.3 


4 


128546 


NM 003478 


Hs.101299 cuDin 5 , 


1 


5.1 


128574 


AI185977 


Hs.38260 ublquHin specific protease 18 


0.8 


4 


128611 


NM_014721 


Hs.102471 KIAA0680 gene product 


1.3 


3.7 


128652 


AA432202 


Hs.103147 hypothetical protein FU21347 


1.4 


3.9 


128653 


D87432 


Hs. 1 03 15 sciuie earner family 7 (cationic ammo 


1.2 


3.6 


128655 


Ai246669 


Hs.324275 WW domain-containing protein 1 


0.8 


4.1 


128684 


BE246444 


Hs.283685 hypothetical protein FU20396 


3 


1.6 


128717 


AK001564 


Hs.1 04222 hypothetical protein FUlb702 


2.8 


4.8 


128774 


AA476220 


Hs.54457 CD81 antigen (target of antiproliferativ 


i!i 


10.6 


128790 


AF026692 


Hs. 105700 secreted frizzled-related protein 4 


1 


3,8 


128805 


AA194554 


Hs.183434 ATPase, H+ transporting, lysosomal (vacu 


5.3 


5.3 


128827 


AI638184 


Hs.106334 Homo sapiens done 23836 mRNA sequence 


2.2 


5.3 


128840 


AI917602 


Hs.106440 ESTs 


1 


4.5 


128869 


AA768242 


H5.8O8I8 hypothetical protein 


0.8 


3.6 
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060985 


Hs.106909 OKFZP5660193 protein 


4.6 


3.7 
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AI222020 


Hs.182364 CocoaCrIsp 


3 


1.5 
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AK000140 


Hs.107139 hypothetical protein 


0.2 


3.9 
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.AA622037 


Hs.166468 programmed ceO death 5 


2.5 


15.2 


128926 


AF1550g6 


Hs.107213 hypothetical protein FU20585 


4 


4 
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Hs.10724 MDS023 protein 


1.2 


4.5 
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AW247536 


Hs.10729 hypothetical protein 


1.4 


5 


128948 


AW953622 


Hs.223025 RAB31 , member RAS oncogene family 


2.3 


5.6 


128953 


AB020716 


Hs.107352 KIAA0909 protein 


0.9 


3.9 


128979 


AW271217 


Hs.281434 Homo sapiens cDNA FLJ14028 fis. done HE 


1.5 


3.6 


128980 


AA258924 


Hs,10758 NM_002495*:Homo sapiens NADH dehydrogena 


0.8 


129005 


AI770025 


Hs.13323 hypothetical protein FU22059 


1:2 


5.7 


129009 


C15105 


Hs,330716 Homo sapiens cONAFLJ14368fis. done HE 2.1 


9.9 


129013 


AA371156 


Hs.107942 DKFZP564M1 12 protein 


2.4 


3.8 


129068 


AI634522 


Hs.152925 KIAA1268 protein 


1.2 


3.8 


129106 


AW504486 


Hs.10668g sterol regulatory element binding transc 


1.2 


5.5 


129113 


BE543205 


Hs.288771 DKFZP586A0522 protein 


0.5 


3.7 


129125 


AB002450 


Hs.278391 CGl-109 protein 


1 


5.2 


129125 


AW881089 


Hs.108806 Homo sapiens mRNA; cDNA OKFZp566M0947 (f 


1.5 


129151 


N23018 


-Hs. 171 391 C-temninal binding protein 2 


2.1 


9.7 


129230 


AA335362 


Hs.109646 Empirically selected from AFFX single pr 


0.9 


8.6 


129234 


M18916 


Hs.282997 glucosidase. beta; add (Indudes glucos 


1.1 


3.5 


129238 


BE542214 


Hs.109697 ESTs 


1.1 


12.8 


129239 


W57656 


Hs.109701 ubk)uitin-like5 


3.2 


5.1 


129241 


Aia78857 


Hs.1 09706 hematological and neurological expressed 


1.9 


5.7 


129243 


BE169531 


Hs.109727 TAKI-binding protein 2; K1AA0733 protein 


1.2 


6.6 


129247 


R49920 


Hs.109733 CGM31 prot»n 


1.5 


3.5 


129250 


AA344367 


Hs. 109760 Empirically selected from multiple AFFX 


1 


5.4 


129258 


AA250970 


Hs.251946 poly(A>-binding protein, (^oplasmic 1-1 


1.3 


4:1 


129260 


AF077200 


Hs.279813 hypothetical protein 


1.6 


3.9 


129270 


AA357185 


Hs.l 09918 ras homolog gene family, member H 


1.8 


4:2 


129277 


AB007896 


Hs. 1 10 putative L-type neutral amino add trans 


1.1 


6.1 


129284 


AA318224 


Hs.296141 ESTs 


2.5 


4.8 


129300 


W94197 


Hs.1 10165 ribosomal protein L26 homolog 


1.6 


5.1 


129318 


AF189062 


Hs.285976 tumor metastasis-suppressor 


1.8 


6.5 


129352 


AW511656 


Hs.1 70177 Meisi (mouse) homolog 


0.9 


4 


129362 


U30246 


Hs.1 10736 solute camer family 1 2 (sodium/potassi 


1.4 


9.2 


129379 


BE278964 


Hs.1 1085 CGI-1 11 protein 


1 


4.8 


129390 


AA318271 


Hs.250905 hypothetical protein 


1 


4.1 


129416 


AA016188 


Hs. 1 1 1 244 hypothetical protein 


1.8 


10.7 


129427 


AI498631 


Hs.l 1 1 334 fem'tin, light polypeptide 


1.1 


4.8 


129470 


W92931. 


Hs.250899 heat shock factor binding protein 1 


1.8 


9.3 


129472 


AL050260 


Hs.323817 DKFZP547E1010 protein 


1 


5 


129475 


NM_004477 


Hs^772 FSHO region gene 1 


1.1 


4.2 


129498 M449789 


Hs.7551 1 connective tissue growth factor 


1.9 


6.8 


129501 


AI631811 


Hs.180403 STRINprot^ 


1.1 


9.7 


129527 AA769221 


Hs^0847 delta^bulin 


1.1 


4.3 


129545 


R18087 


Hs^23769 dsplatin resistance related protein CRR 


1 


4.2 


129579 AW517695 


Hs^86218 Junctional adhesion molecule 1 


2.3 


3.5 



3.8 
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129606 AW968941 Hs.1 66254 hypothetical protein DKF2p566l1 33 • 

129619 AA209534 Hs.284243 tetraspari NET-6 protein 

129620 D79338 Hs.239720 CCR4-N0T transcription complex, subunit 

129621 AL1 10212 Hs.301005 purine-rich element binding protein B 
5 129634 AB020335 Hs. 181 300 seM (suppressor of lin-1 2, C.elegans)- 

129663 AI207408 Hs. 11866 translocase of inner mitodiondrial membr 

129679 AW889132 Hs-11916 nljokinase 

129888 U53209 Hs.24937 transfomier-2 alpha (htra-2 alpha) 

129691 M26939 Hs.1 19571 collagen, type HI, alpha 1 (Ehlers-DanI 

10 129712 U46386 Hs.12102 sorting nexin 3 

129747 AL050272 Hs.12305 DKFZP566B1 83 protein 

129788 BE397454 Hs.1 24969 Homo sapiens clone 24707 mRNA sequence 

129796 BE218319 Hs.5807 GTPaseRab14 

129797 M62839 Hs.1252 apoiipopiotein H (beta-2'glycopn>tein I) 
15 129800 AF052112 Hs.12540 lysosomal 

129834 AL080084 Hs.296155 CGMOO protein 

129836 AW410233 Hs.206521 YIVJE1 (S.cerevisiaeHike 1 

129843 NWL014840 Hs.200598 KIAA0537 gene product 

129874 AA626937 Hs.181551 hypothetical protein MGC2594 

20 129878 Z43161 Hs-283714 30 kDa protein 

129904 AL119499 Hs.13285 neuronal potassium channel alpha subunit 

129917 M30773 Hs.278540 protein phosphatase 3 (formerly 2B), reg 

129975 X14008 Hs.234734 lysosomal 

1 29982 Z14221 gb:H.sapiens gennline transcript of Ig h 

25 130007 R15917 Hs.142570 Homo sapiens done 24629 mRNA sequence 

130060 BE277024 Hs.1 46381 RNA binding motif protein, X chromosome . 
130064 X57815:comp Empirically selected from AFFX single pr 

130068 M93143 Hs.262869 plasminogen-like 

130090 H97878 Hs.1 32390 zinc finger protein 36 (KOX 18) 

30 130095 AK001635 Hs.14838 hypothetical protein FU 10773 . 

130102 W61005 Hs.14896 DHHC1 protein 

130112 AA916785 Hs.180610 splicing factor proline/glutamine rich ( 

130115 T47294 Hs,149923 X-box binding protein 1 

130123 NM_005095 Hs.15O390 zinc finger protein 262 

35 130150 BE094848 Hs.15113 homogentisate 1,2-dioxygenase(homogenti 

130161 R42678 Hs.151385 KIAA0564 protein 

130210 M23115 Hs.1526 ATPase, Ca-*^ transporting, cardiac musd 

130213 BE278370 Hs.i5265 heterogeneous nuclear ribonudeoprotein 

130215 BE301883 Hs.152707 glioblastoma amplified sequence 

40 130232 U29463 gb:Human cytochrome b561 gen 

130252 U92014 Hs.153527 Homo sapiens pTM5 mariner-like transpose 

130281 W78907 Hs.15395 similar to aiginyl-tRNA synthetase (aigi 

130343 AB040914 Hs.278628 KIAA1481 protein 

130385 AW067800 Hs,155223 stanniocalcin 2 

45 130414 AW842182 Hs.241392 small indudble cytokine A5 (RANTES) 

130417 AW163518 Hs.155485 huntingtin interacting protein 2 . 

130440 AA852868 Hs,132853 KIAA0171 gene product 

130442 NM_005245 Hs.1 18244 protein phosphatase 2. regulatory subuni 

130465 AW362955 Hs,15641 Homo sapiens cDNA FLJ14415 fis, done HI 

50 130479 R44163 Hs.12457 hypothetical protein FU 1081 4 

130499 AB007915 Hs.158286 K1AA0446 gene product 
130546 AI598022 Hs.193989 TAR DNA binding protein 
130568 AA232119 Hs.16085 putative G-protein coupled receptor 
130606 A1652143 Hs.288382 hypothetical protein FU13111 

55 130612 BE242873 Hs.16677 WD repeat domain 15 

130616 AL049963 ' Hs,284205 up-regulated by BCG-CWS 
130623 AL045128 Hs.1691 glucan (1.4-alpha-). branching enzyme 1 . 
130629 AL042896 Hs.1697 ATPase, H+ transporting, lysosomal (vacu 
130632 AW073971 Hs.238954 ESTs, Weakly similar to K1AA1 204 protein 

60 130639 A1557212 Hs,17132 ESTs, Moderately similar to 154374 gene 
130641 AF158555 Hs^39189 glutaminase 
130653 AI861791 Hs.278479 TSPY-like 
130655 AI831982 Hs.17409 cysteine-rich protein 1 (intestinal) 
130666 AL1 17508 Hs.194035 KIAA0737 gene product 

65 130669 AI928985 Hs.17680 hypothetical protein MGC1 314 sinular to 

130693 R68537 Hs.17g62 ESTs 

130694 N^^014^27 HsJ7969 KIAA0663 gene product 
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131947 AI123939 Hs.182997 ESTs 0.7 4.1 

1319S1 AA129782 Hs.3576 Homo sapiens mRNA full length Insert cDN 0.9 4.8 

131964 AW381148 Hs.198365 2,3-bisphosphogIycerate mutase 1.1 6.1 

131974 AF208856 Hs.268122 hypothetical protein 1.3 3.9 

5 131983 AF1 19665 Hs.184011 pyrophosphatase (inonganic) 3.3 6.9 

131997 AF229181 Hs. 1 36544 CS box-containing WD protein 0.9 .5.2 

132005 AW162335 Hs.3709 low molecular mass ubiquinone-binding pr 1.2 3.6 

132063 BE277910 Hs.3833 3*-phosphoadenosine S'-phosphosulfate sy 32 1.8 

132065 BE37g335 Hs.211594 proteasome(prosome.macropain)26Ssubu '1.2 3.6 

10 132071 AF217798 Hs.3850 USInnleracting protein NUDEL; endoolig 0.7 5.2 

132079 AI701457 Hs.38694 ESTs . 2 5.3 

132094 NM 016045 Hs.3945 CGM07 protein 1^ 4.3 

132116 AW960474 Hs.40289 ESTs 3.1 3.1 

132164 AI752235 Hs.41270 procoHagen-lysine. 2-oxoglutarate S^lio 1.8 3.7 

15 132181 AW961231 Hs.16773 Homo sapiens clone TCCCIA00427 mRNA sequ 1.2 5 

132208 AL031709 Hs,241575 hypothetical protein CAB561 84 1.4 4.2 

132258 AA306325 Hs.4311 SUMO-1 activating enzyme subunit 2 2 10.3 

132303 BE177330 Hs.325093 Homo sapiens cONA: FLJ21210 fis. done C 1.2 4.1 

132316 U28831 Hs.44566 KIAA1 641 protein 5,9 1.6 

20 132358 NM_003542 Hs.46423 H4 histone family, member G 5.8 1.5 

132384 AA312135 Hs.46967 HSPC034 protein 2.1 9.3 

1 32397 AA021 1 60 Hs.4750 hypothetical protein DKFZp564K0822 1 .3 4.6 

132413 AW361383 Hs.2601 16 metallopiotease 1 (pitrilysin family) 2 4.9 

132442 AW970859 Hs.313503 ESTs 1.2 . 5 

25 - 132534 BE388673 Hs,5086 hypothetical protein MGC1 0433 2 3.9 

132540 BE396290 Hs.5097 synaptogyrin2 1.4 .5.1 

132554 AF065391 Hs.l94718 zinc finger protein 265 1.2 4 

132575 AV660538 Hs.284162 60S ribosomal protein L30 isolog 3 1.7 

132585 AF029750 Hs.179600 TAP binding protein (tapasin) 1.8 4.7 

30 132602 AW606927 Hs.5306 hypothetical protein DKFZp586F11 22 simil 1.6 4.9 

132608 AA353044 Hs.5321 ARP3 (actin-related protein 3, yeast) ho 1.8 8.1 

132718 NM_G04600 Hs.554 Sjogren syndrome antigen A2 (60kD. ribon 4.2 2 

132719 AI264357 Hs.55405 hypothetical protein MGC16212 . 1.1 5.3 
132730 AK000868 Hs.5570 hypothetical protein FU 10006 1.4 5.2 

35 132765 BE222975 Hs.56205 insulin induced gene 1 1.1 5.8 

132782 F07424 Hs.279840 zinc finger protein 222 1.3 3.7 

132793 AB020713 H5.56966 KIAA0906 protein 2.3 6.3 

132805 AW975748 Hs.5724 sclerostin 0.7 7.7 

132863 BE268048 Hs.236494 RAB10. member RAS oncogene famfly 1.8 62 

40 132894 063209 Hs.5944 solute carrier family 11 (proton-coupled 1.5 20.8 

132930 AA579258 Hs.6083 Homo sapiens cDNA: FLJ21 028 fis, done C 1 3.8 

132932 AW1 18826 Hs.6093 Homo sapiens cDNA: FU22783 fis. done K 0.7 5.4 

132933 BE263252 Hs.6101 hypothetical protein MGC3178 1.6 4.1 
132965 AI248173 Hs.191460 hypothetical protein MGC1 2936 1 4,2 

45 132984 BE539199 Hs.62112 zinc finger protein 207 1.5 4.4 

132990 X77343 Hs.334334 transcription factor AP-2 alpha (activat 13.9 0.8 

132998 Y00062 Hs. 1701 21 protein tyrosine phosphatase, receptor t 0.6 4.6 

133002 AW499985 Hs.42915 ARP2 (acUn-related protein 2, yeast) ho 1.5 11.1 

133011 NM_006379 Hs.171921 sema domain, immunoglobulin domain (Ig). 3.5 1 

50 133012 AA847843 Hs.62711 Homo sapiens, clone 1MAGE:3351295, mRNA 1 4.5 

133040 AW502761 Hs.30909 KIAA0430 gene produd 0.9 5.5 

133056 H12028 Hs.6396 jumping translocation breakpoint 1.7 5.3 

133063 AI654133 Hs.30212 thyroid receptor interacting protein 15 0.6 4.9 

133067 AK000708 Hs.159764 hypothetical protein FU20701 1.2 3.5 

55 133080 AF089816 Hs.6454 chromosome 19 open reading frame 3 12 17.5 

133110 AA808177 Hs.65228 ESTs 0.9 5.1 

133150 AV655783 Hs.661 Empirically selected from AFFX single pr 1.1 4.5 

133175 AW955632 Hs.66666 ESTs, Weakly sinriilar to SI 9560 proline-r 1.5 4,8 

133199 AF23ig81 Hs.250175 homolog of yeast long chain polyunsatura 5.5 5.9 

60 133203 AA464362 Hs.6748 hypothetical protein PP1 665 1^ 3.7 

133206 AB037773 Hs.6762 hypothetical protein 1.6 8.6 

133221 W32474 Hs.301746 RAP2A. member of RAS oncogene family ZA 4.8 

133229 AL137480 Hs.6B34 KIAA1014 protein 1 4.2 

133241 AW796524 Hs.68644 Homo sapiens microsomal signal peptidase 1.3 3.9 

65 133257 BE617892 Hs.6895 actin related protein 2/3 complex, subun 1.4 5.4 

133271 Z48633 Hs.283742 H.sapiens mRNA for retrotransposon 3.1 0.7 

133273 N27672 Hs.69469 dendritic ceH protein Z5 6.5 
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133287 


AW797437 


Hs.69771 


6-factor, properdin 


1.3 


4 


133291 


BE297855 


Hs.69855 


NRAS^Iated gene 


1.4 


5 


133292 


AA304961 


Hs.699 


peptidylprotyl isomerase B (cydophilin 


2.2 


6.8 


133294 


AJ001388 


Hs.69997 


zinc finger protein 238 


1.5 


4.3 


133300 


AF1 16666 


Hs.70333 


hypothetical protein MGC10753 


1.4 


6.3 


133302 


X04898 


Hs.237658 


apolipoprotein A-il 


0.2 


3.6 


133308 


U56979 


Hs.250651 


H factor 1 (complenient) 


0.6 


5 


133347 


BE257758 


Hs.71475 


add duster protein 33 


1.2 


4.2 


133370 


AF245505 


HS72157 


DKFZP564i1922 protein 


37 


5.8 


133404 


AB007916 


Hs.214646 


K1AA0447 gene product 


1.4 


5.1 


133408 


AI738719 


Hs.198427 


hexokinase 2 


0.9 


6.3 


133422 


AB033061 


Hs,73287 


KIAA1235 protein 


1.2 


3.7 


133442 


AL137663 


Hs.7378 


Homo sapiens mRNA; cDNA DKFZp434G227 (fr 


07 


133448 


M27749 


Hs.288168 


immtinoalobtjlin lamhda>tike oolvnpntidp 1 


1.1 


4.3 


133449 


AF038962 


Hs.7381 


VUlloyC'UCpCllUClH OIIIUll l,<rlalllld O 


0.7 


4.2 


133501 


AI982602 


Hs.74284 




3.1 


5.9 


133504 


NM.004415 


Hs.74316 


desmoplakin (DPI. DPil) 


4!3 


11.5 


133506 


BE562958 


Hs.74348 


hypotheUcal proteb MGC14353 


1.8 


197 


133532 


087452 


HS74579 


KIAAG263 gene product 


1.2 


5.4 


133574 


H97991 


Hs.193313 


Empirically seledied from AFFX single pr 


1.4 


3.9 


133586 


AI929645 


Hs.225936 


synapsin 1 


0.8 


4.9 


133589 


L37368 


Hs.75104 


RNA-binding protein SI, serine^h doma 


2 


10.8 


133591 


AI423369 


Hs.75111 


protease, serine, 11 (IGF tM'nding) 


2.1 


4.5 


133606 


U10564 


Hs.75188 


wee1+ (S. pon*e) homolog 


3.3 


1.1 


133617 


BE244334 


Hs.75249 


ADP-ribosylation ^ctor-like 6 interact) 


2.3 


5.6 


133651 


AI301740 


Hs. 173381 


dthydropyrimidinase-like 2 


0.8 


13.5 


133660 


H14843 


Hs.303154 


popeye protein 3 


1 


9.1 


133663 


AJ006239 


Hs.75438 


quinoid dthydropteridine reductase 


0.5 


5.8 


133668 


L77964 


Hs.271980 


mitogen-activated protein kinase 6 


1.1 


6.9 


133671 


AW503116 


Hs.301819 


zinc finger protein 146 


1.8 


3.8 


133681 


AI352558 


Hs.75544 


tyrosine 3-monooxygenase/lfyptophan 5-mo 


1.5 


11.1 


133694 


W17187.comp Hs.232400 


heterogeneous nudear ribonudeoprotein 


2 


3.9 


133708 


AI018666 . 


Hs.75667 


synaptophysin 


0.6 


3.5 


133737 


AW001130 


HS75824 


KIAA0174 gene product 


1.2 


7.2 


133743 


AI929587 


Hs75847 


CREBBP/EP300 Inhibitory protein 1 


1.5 


5 


133750 


BE410769 


HS75873 


zyxin 


1.2 


4.8 


133765 


M62194 


HS75929 


cadherin 11, type 2. OB-cadherin (osteob 


3.2 


4.1 


133776 


BE268649 


Hs.1 77766 


ADP-ribosyltransfeiase (NAD+; poly (ADP- 


2.1 


3.8 


133799 


W24087 


HS76285 


DKFZP564B167 protein 


1.9 


12.6 


133800 


AF075337 


HS76293 


thymosin, beta 10 


2.6 


6.6 


133802 


AW239400 


HS76297 


G protein-coupled receptor kinase 6 


1 


4.9 


133806 


025969 


HS76325 


step II spCdng factor SLU7 


0.5 


3.8 


133817 


AW578716 


Hs7644 


HI histone family, member 2 


1.5 


4.5 


133829 


AW630088 


HS76550 


Homo sapiens mRNA; cONA OKF2:p564B1264 (f 


37 


133841 


AA345824 


HS76688 


carboxylesterase 1 (monocyte/macrophage 


0.3 


4.4 


133845 


AA147026 


HS76704 


ESTs 


5.5 


2.9 


133863 


A1815523 


HS76930 


synudein. alpha (non A4 component of am 


0.6 


4.8 


133887 


X07767 


HS77271 


protein kinase, cAMP-dependent catalyti 


1 


10.2 


133892 


AW859528 


Hs.301497 


arginyltransferase 1 


0.9 


4.8 


133913 


AU076964 


Hs.7753 


calumenin 


2.8 


10.5 


133914 


AW58213 


Hs.77542 


ESTs 


1.8 


5.6 


133917 


AL031177 


Hs7756 


proteasome (pnosome, macropain) 26S subu 


1.5 


6.6 


133947 


L41066 


Hs.77810 


nudear factor of activated T-ceils, cyt 


1.5 


3.8 


133986 


M54968 


Hs.184050 


v-Kj-ras2 Kirsten rat sarcoma 2 viral on 


0.9 


4.3 


133987 


LI 5409 


Hs.174007 


von Hippel-Llndau syndrome 


2.3 


4.3 


133989 


AL040328 


HS78202 


SWI/SNF related, matrix assodated, acti 


3.3 


3.4 


133990 


R48316 


Hs7822 


Homo sapiens mRNA; cDNA DKFZp564C1216 (f 


1.3 


134029 


BE150882 


Hs,143601 


hypoftetical protein hCLA-iso 


1 


6.5 


134040 


NM.003470 


HS78683 


ubk]uitln spedfic protease 7. (herpes vi 


17 


3.6 


134042 


AI027881 


HS7869 


lysosomal 


1 


7.5 


134049 


AP1 17236 


HS78825 


matrin 3 


1.2 


4 


134095 


NM.004354 


HS79069 


cydin G2 


27 


4.8 


134098 


BE513171 


HS79086 


mitochondrial ribosomal protein L3 


3.3 


2.1 


134207 


Z43039 


Hs.170198 


KIAA0009 gene product 


1.3 


33 


134210 


AF035G06 


Hs.80019 


programmed cell death 6 


17 


6.9 


134218 


U77735 


Hs.80205 


pim-2 oncogene 


0.8 


5.3 


134270 


X68194 


Hs.80919 


synaptophysin-like protein 


14 


11.4 



4.8 



5,6 
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134277 
134280 
134288 
134296 
134300 
134310 
134343 
134364 
134374 
134378 
134382 
134415 
134417 
134421 
134439 
134454 
134494 
134501 
134505 
134520 
134528 
134545 
134553 
134573 
134576 
134577 
134579 
134582 
134600 
134655 
134700 
134737 
. 134762 
134843 
134854 
134865 
134868 
134874 
134885 
134891 
134908 
134934 
134970 
134982 
135011 
135032 
135035 
135051 
135060 
135062 
135077 
135082 
135107 
135143 
t 135156 
135172 
135181 
135222 
135232 
135289 
135290 
135291 
135349 
135357 
135398 
135399 
135400 



NM_004369 
NM„000712 
AI022650 
R00603 
NM_001430 
AL037800 
D50683 
X76534 
N22687 
AL035786 
BE512856 
AI750762 
NMJ006416 
AU077196 
Z23024 
NM_013230 
086981 
W84869 
AW960673 
BE091Q05 
M23161 
AI902899 
A1203545 
NM_016142 
AB033017 - 
BE244323 
AW936928 
AA927177 
AF078859 
AF265208 
AK000606 
D17530 
T51986 
AA428520 
J03464 
AA587775 
AB020689 
AI803761 
AJ00203D 
R51083 
BE089762 
AF005043 
BE560779 
AK002085 
AB037835 
AW301984 
AL034344 
AI272141 
AK001887 
AK000967 
AW503733 
AB017363 
T97257 
AA132813 
BE563088 
AB028956 
BE250865 
AA534a09 
AL038812 
AW372569 
AA331901 
T83882 
AA114212 
AI56S004 
M16029 
W79431 
X78592 



Hs.80988 coIIagen.typeVI,aIpha3 2.6 

Hs.8 1029 biltverdin reductase A 1-8 

Hs.8117 erbb2-lnteracting protein ERBIN . 1.1 

Hs.8128 phosphatidyisernie decarboxylase 1.1 

Hs.8136 endolherial PAS domain protein 1 0.5 

Hs.8148 selenoprotein T 1.7 

Hs.62028 transfomiing growth factor, beta reoepto 0.8 

Hs.82226 glycoprotein (transmembrane) nmb 2.2 

Hs.8236 ESTs 1.9 

Hs.82425 actin related protein 2/3 complex, subun 1.5 

Hs.109051 SH3 domain binding glutamic acid-rich pr 1,1 

Hs.8291 1 protein tyrosine phosphatase type 1 VA, m 1 .9 

Hs.82921 solute carrier family 35 (CMP-sialic aci 1 .2 

Hs.82985 collagen, type V. alpha 2 6.6 

Hs.138860 Rho GTPase activating protein 1 2 

Hs.286124 CD24 antigen (small cell hing carcinoma 3.5 

Hs.84084 amyloid beta precursor protein (cytoplas 1.5 

Hs.211568 eukaryotic translation initiation factor 1.2 

Hs. 1 77530 ATP synthase, H+ transporting , mitochond 1 .3 

Hs.74861 activated RNA polymerase II transcriptio 1.8 

Hs.84775 Human transposon-liKe element mRNA 0.8 

Hs.85155 butyrate response factor 1 (EGF-response 1 .4 

Hs.296169 S-phase response (cyclin-related) 0.8 

Hs.279617 steroid dehydrogenase homolog 1-3 

Hs.8594 KIAAl 191 protein 0.9 

Hs.85951 exportin, tRNA (nuclear export receptor 4 

Hs.85963 DKFZP564M1 82 protein 2.2 

Hs.86041 CGG triplet repeat binding protein 1 1 .6 

Hs.86347 hypothetical protein 2.1 

Hs.1 23090 SWl/SNF related, matrix associated, acti 1 .7 

Hs.8868 golgi SNAP receptor complex member 1 4.4 

Hs.89434 drebrinl 3.1 

Hs.283108 hemoglobin, gamma G . 0.5 

Hs.90061 progesterone binding protein 1.3 

Hs.179573 collagen, type I. alpha 2 8.7 

Hs.66295 multi-PDZ-domain-containing protein 1.7 

KIAA0882 protein 3.4 

serine palmitoyltransferase, long chain 1 .3 

progesterone membrane binding protein 1 .4 

ESTs 1 

hypothetical protein 1-9 

poly (ADP-ribose) glycohydrolase 1 

NICE-5 protein 14 



Hs.90419 
Hs.90458 
Hs.9071 
Hs.90787 
Hs.9877 
Hs.91390 
Hs.284233 
Hs.92308 
Hs.92991 



Homo sapiens cDNAFU 11 223 fis, clone PL 1.6 

KIAA1414 protein 1.2 

Hs.173685 hypothetical protein FU12619 1.7 

Hs.284186 forWieadboxCI 3.2 

Hs.83484 SRY (sex detennining region Y)-box 4 4.2 

Hs.259842 protein kinase, AMP-activated. gamma 2 n 1 .3 

Hs.93872 KIAA1682 protein 2 

Hs.9414 K1AA1488 protein 2.8 

Hs,94234 frizzled (Drosophila) homolog 1 2.4 

Hs:337531 ESTs. Moderately similar to 138022 hypot 1.4 

Hs.69559 KIAA1096 protein t8 

Hs.9552 binder of Ari Two 1^ 

Hs.12144 KIAA1033 protein 3.1 

Hs.279529 px19-like protein 1-3 

Hs.183487 interferon stimulated gene (20kD) 13 
Hs.96800 ESTs. Moderately simiter.to ALU7_HUMAN A 2.1 

Hs.9788 hypothetical protein MGC10924simnar to 0.9 

Hs.184736 hypothetical protdnFU1G097 1 

Hs.97927 ESTs 1-2 

Hs.9930 serine (or cysteine) proteinase inhibito 2.6 

Hs.79572 cathepsinD (lysosomal aspartyl protease 2.5 

Hs.287270 ret proto-oncogene (multiple endocrine n 0.4 

Hs.326249 ribosomal protein L22 1.5 

Hs.99915 androgen receptor (dihydrotestosteroner Z2 
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3.5 
5.8 
3.6 
5.9 
4.8 
7.9 
7.6 
3.6 
3.6 
8.3 
3.6 
4.6 
7.5 
8.7 
3.9 
1.1 
4.4 
5.7 
3.9 
4.3 
5,6 
5 

3.9 

5,7 

3.7 

6.8 

4.3 

3.6 

3.5 

4.2 

0.9 

1.6 

4.6 

3.7 

17.3 

4 

0.9 

6.9 

9.6 

10.1 

3.9 

4.3 

10.4 

4.1 

5.6 

7.6 

0.6 

'4.1 

4.8 

3,7 

3.7 

4.8 

5.8. 

8.5 

6.8 

1.4 

7.5 

3.8 

3.9 

8.4 

3.8 

3.5 

8.9 

5.4 

7.9 

4.5 

1.8 
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302665 
302892 
302963 
303131 
303150 
310125 
312662 
319429 
320591 
408779 
410691 
410763 
415738 
420185 
422055 
425815 
426218 
427397 
427466 
427505 
427723 
428673 
430219 
430450 
432866 
433423 
437562 
437667 
437754 
440252 
441471 
448292 
449404 
449964 
451389 
452685 



R99693 
AW176909 
AW673106 
AW081061 
AA887146 
AA147979 
AA233808 
BE616412 
AA054761 
AA412048 
AW239226 
AF279145 
BE539367 
NM_015925 
NM_014320 
R94023 
AF1 19043 
AI929685 
AA523543 
AA361562 
AI355260 
AW601325 
X99209 
R23553 
BE395875 
BE407127 
AB001636 
BE616412 
R60366 
BE513940 
AL042986 
BE281316 
H51066 
AW001741 
N73222 
A1634651 
RC.H15847.S 
RC.W84712 
X14008_ma1_f 
RC_H86543_f 
H07011 

RCJiA164586^s 
RC_AA070485 
RC_H98714.s 
RCJ\A406145_f 
AA458584 
AA031548 
X02761 

RCJ^487193 

R25326 

RCJ^393805 

RC.AA449333 

RC_AA287681j5 

RC^AA490B54 

RC C14243_f 

R21443 

RC AA251902 

M21121_s 

C00038_s 

Y00503 

RC_R27006J 

RC_AA416886 

RC_AA450450 

RCJ\A488433 

RC.AA278400J 

U28831 

RC.AA199588 



Hs.224410 Homo salens cONA FU12843ns, done NT 3.6 

Hs.42346 caldneunn-binding protein calsardn-l 3.3 

Hs.151945 mitochondrial ribosomat protein L43 0.9 

Hs.103180 DC2pro^ 3 

Hs.8217 stromaJ antigen 2 6.2 

Hs.285005 mitochondrial import receptor Tom22 1.2 

Hs.286241 protein kinase. cAMP-dependent. regulato 1 

Hs.286218 junctional adhesion molecule 1 1.5 

Hs.169149 kaiyopherin alpha 1 (importin alpha 5) 1.2 

Hs.279574 CGI-39 protein; oetl death-regulatory pr 1.3 

Hs.65450 reticulon4 1.2 

Hs.8966 hypothetical protein FU21776 2 

Hs.295953 ESTs, Weakly similar to AF220049 1 uncha 1.3 

Hs.95697 Bver-specific bHLH-Zip transcription fa 1.5 

Hs.1 11029 putative heme-binding protein 2 

Hs.337531 ESTs, Moderately sin^lar to 138022 hypot 1 .7 

Hs.168005 Homo sapiens cDNAFLJ13372fis. clone PL 3.3 

Hs.1 77656 calmodulin 1 (phosphorylase kinase, delt 1.3 

Hs7678 cellular retinoic add-binding protein 1 1.1 

Hs.1 78761 26S proteasome-assodated pad1 hdmotog 3.2 

Hs.279789 histone deacetylase 3 2.8 
Hs.324278 Homo sapiens mRNA; cDNA DKFZp566M063 (fr 

Hs.235d87 Hivrn (hnRNP methyltransferase. S. cerevi 1.8 
Hs.241489 hypothetical protein 
Hs.279609 mitochondrial carrier homolog 2 
Hs.8997 heat shock 70kD protein 1 A 
Hs.5683 OEADyH (Asp-Glu-Ala-Asp/His) box potypep 
Hs.286218 iundional adhesion molecule 1 
Hs.5822 Homo sapiens cONA: FU22120 fis. clone H 
Hs.6101 hypothetical protein MGC3178 
Hs.7857 erythrocyte membrane protein band 
Hs.47334 hypothetical protein FLJ14495 
Hs.23581 leptin receptor gene-related protein 
Hs.273193 hypothetical protein FLJ10706 
Hs.279009 matrix Gla protein 
Hs.30250 v>maf musculoaponeurotic fibrosarcoma (a 
peptidylpnolyl Isomerase 8 ((^philin B) 
calumenin 

fysozyme (renal amyloMosis) 
ESTs 

ESTs; Weakly similar to SAS [H.sapfens] 



1.1 
1.5 
1.3 
1.6 
1.3 
2 

1.1 
0.5 

2.5 
1.1 
1.4 
4 

0.8 
1.8 
3,5 
0.9 
1.8 
1.8 
ESTs 
3.4 
1.6 
ESTs 
3.4 



Homo sapiens clone 23967 
ESTs 

SRY (sex detennining region Y)-box 4 
cell division cyde 42 (GTP-binding protein; 25kD) 

fibronectini 3.6 

secreted frizzled-related protein 4 4.7 

Homo sapiens mRNA for putative vacuolar 0.9 

ESTs; Weakly similar to (defiine not 1.1 

ESTs 2.9 

ESTs 

ESTs;H^hlystmilartoheatshockfactor 1.4 

ESTs; Highly simitar to heat shock factor 1 .7 

ESTs 1.6 

Homo sapiens lysophospholipase (LPL1 ) 2.2 

small indudble cytokine A5 (RANTES) 0.9 

ESTs 2.8 

keratin 19 3.1 

ESTs 1.6 

ESTs; Weakly similar to predlded using 3.1 

fibroblast growth factor receptor 2 (bacteria- 1 .5 

ESTs; Weakly similar to deduced amino add 1 . 1 



3.6 

1;6 

4.2 

17.3 

4 

6.6 

3.5 

4.7 

5.6 

3.5 

13.9 

5.1 

3.9 

6.2 

11.3 

3.6 

2.8 

4.7 

3.7 

2:5 

22 

1,1 

8.8 

5.6 

6.1 

7.6 

6.5 

3.5 

5.7 

6.2 

3.7 

4.9 

3.6 

3.5 

112 

5.6 

4.8 

4.6 

4.5 

6.6 

3.9 

6.2 

2.6 

3.5 

4.6 

0.4 

3.1 

15.2 

4 

5 

8.4 
4.6 
1.3 
5 

5 

3,7 
3,8 
9.9 
4.8 
1.1 
3.7 
3.1 
3.7 
4 



5.2 



0.8 



3.9 



Homo sapiens HRIHFB21 15 mRNA; partial cds 
Human protein immuno-reactive with anti-PTH4.4 0,6 
Homo sapiens actin-related pmtein Afp3 (ARP3) 1 .8 4.7 
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1.5 



3.6 
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AF006082 


Homo sapiens actin-related protein Arp2 (ARP2) 


1.6 


10.9 


RC H90899 


desmoplakin (DPI; DPIl) 5,4 


5.5 




RC.W95070 


desmoplakin (DPI; DPlI) 5 


2.6 


3.9 


RC_T90946J 


Human mRNA for KIAA263 gene; complete cds 


1.1 


D87258 


protease; serine; 11 (IGF binding) 2.4 


3.5 


5.3 


AA313414_s 


ESTs; Weakly similar to cDNA EST EMBLTl 157 


1.5 


RC H73484_s 


ESTs; WeakV similar to similar to Yeast 1 .3 


6.3 




AFFX-HUMISGF3A/M97935.3 


2.3 


13^ 


AFFX-HUMRGErt^10098^5 1.1 


7.9 


7.4 


AFFX-M27830^5 




0.5 


AFFX-M27830_5 




0.6 


5.4 


RC AA0S3431J 


ESTs 


0.8 


4.1 


RC T63769J 


ferritin; light po^peptide 1 .1 


3.7 
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TABLE 8A 

Table 8A shows the accession numbers for those pkeys lacking unigenelD's for Table 8. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are Usted in the "Accession" column. 

Pkey: Unique Eos probeset identifter number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 



108469 116761J AA079487AA1 28547 AA1 28291 AA079587AA079600 
20 125076 190299_1 AA973971 T88817 AA253263 

1 14636 ia9698J AA075488 AA129081 AA074851 AA082852 AA074732 AA084908 AA084751 AA076042 AA131 172 AA085374 AA079519 
AA074510 AA1 13824 AA102437 AA070833 AAO70143 AA084693 AA084389 AA076373 AA075492 AA062834 AA084335 
AA078B29 AA079344 AA069916 AA079275 AA070914 AA654069 AA081976 AA080957 AA0831 15 AA070942 AA085296 
123526 genbanleAA608657 AA608657 
25 123533 genbank_AA608751 AA608751 
125090 genbank_T91518 T91518 
125154 genbank.W38419 W38419 
118475 genbanleN66845 N66845 
118505 genbanK.N67343 N67343 
30 101046 entre2_KOl160K01160 

129982 221 J67 214221 AW381862 M97920 AW401444 Z66542 M29470 AW406502 X61011 M34024 AA327072 Z14166 Z14167 Z14165 

AW403806 Z14200 AA383972 Z14205 Z14201 M18513 214202 AW403684 X14584 AF062221 U43760 X65892 X65883 X62107 
280847 X65885 X65893 AF062142 X65891 X17675 Z47274 Z47277 Z47276 X65888 247275 X62109 AF062140 L01278 
^ AF062134 AF062139 X81723 Z80840 X81733 X81743 X81744 X81732 Z80843 AW402942 AW403516 X65919 AF062190 

-3 5 AF062177 AF062222 AF062115 247240 AF062263 AF062261 AF062223 AF06221 1 247238 AW401714 AW404008 AW404991 

AF062280 M99595 247214 Z47232 Z47218 M26995 AF062184 X65895 L38433 X81731 21 1946 247226 AF062205 AF174012 
L01 276 AFG62168 AF062136 X81755 XB1748 AF174019 AF062285 L06924 L11699 AW402665 L09070 L28049 L08090 
AW407843 214171 AW402944 L08083 AW405627 L33035 L26907 M17750 L28052 M17751 AJ239360 U19885 L14821 X56526 
. ^ AB014341 L12087 L12098 U68231 L12184 AF062242 AF022000 U64499 U00570 A\268604 Y15773 X64239 X62969 U00506 

40 X73605 X99360 U00577 D83677 AB021539 AF035796 233899 U00588 AJ239353 AF174062 233901 X98899 AF174058 X63080 

D83676 AB021529 218318 U00488 L01412 X81746 U21262 U21272 U00560 AF174060 U00547 U00561 218321 S73957 
M26435AF115130AF115117 292896 U21254 292895 AF115112AF062290 T28938 Y09386AF174087 U27189 Y09384 
U77373 AF174057 M17749 X69692 AF174038 U64478 U64486 AF174098 X99365 AF174077 AF174083 AF174089 AF015133 
AF127792 AJ006171 U00510 U58144 AF004323 AF115109 AF004324 AF015130 AF090414 AF090418 AB021536 AB021519 
45 296957 AF021986 292898 295956 L34164 AF062251 AF052524 U00549 U00541 D83687 AF052522 AF087420 AF052520 

AF022011 L43085 U00536 L19915 L43083 AF090420 AB021530 AB021534 296955 296954 233903 X62964 AF103282 U00509 
AF062298 AF062289 AF062307 AW408326 L33037 L04337 L04328 X81742 L04336 L03385 AF032360 856184 AF062191 
AF191092 AI906954 AF103184 L04343 AW364860 AF001424 AF103163 298717 AF103143 AJ008250 L04323 AF103321 
AF103335 L04342 L03818 L03817 AW404978 AW403690 U86801 AF103150 AJ010334 AF035027 AJ007327 AF103115 
50 AF017458 AJ008207 AJ008183 AJ008196 AJ008241 AJ008208 AF103210 AF068668 AF068670 AF068671 AF068664 

AF068669 AF068666 AF068665 AF068672 AW371244 AW403670 AW408074 AW404575 AW3621 53 AW403803 AW406702 
AW351514 D78345 T29140 J00231 NM_002179 AW405146 AA301091 X04546 H64660 AW402990 AW406534 T93007 
AI857980 AW368899 AJ905833 AW406586 AA482084 AI872299 AA715266 AW404328 AI831674 AI709348 AA603112 
AW514864 AA485775 H64492 AW404789 AA487630 AA715498 AA295885 T27613 T981 13 
55 108470 genbank_AA079500 AA079500 
101447 enlre2^M21305 M21305 
124447 genbank.^M8000 N48000 
101624 entrez.M55998 M55998 

131791 221.260 X62111 S67984AJ131056 Z47243Z47235AF062268 247237 AJ131058 AF062130 AF062282 X62108AA385989AA464794 
OO X69861 AW402964 M90808 Z98735 Z98734 Z98736 AF035018 X79161 U00545 AF174046 AF174071 U00552 U95288 

AF021989 AF062255 AF174081 873953 AF062135 AFG62155 X64147 U00555 AF174070 AF129754 U86789 Z98714 Z98738 
S75886 AJ008175 AJa08186 AJ008188 Y08403 H15753 H22208 AA327241 AW405737 H42300 H39677 H26239 H26903 
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H45128 R86072 AA327565 AI660584 AW361537 AW383759 S71043 H15014 H45570 H42819 H45523 H45134 R72043 
H24543 H27636 H27610 T28147 H25496 AW364071 AW364072 H45561 H45556 H42605 AI735017 T47421 R48719 H27570 
H44599 AI459598 H42347 H41938 H24993 AA345888 H22339 AI538691 AJ012264 AA664201 AI880450 AA327310 AI991250 
AI833028 AW001210 AI956075 H30467 AA326915 H41943 AI749268 AI744441 AA327377 AW512326 AI735170 H01634 
AI587047 AI571623 AA327486 AA327103 AA327195 AA326973 T28143 

124842 217726_1 R56485 R37248 R59992 

103758 AA084874_Lal AA084874J 

130064 221 264 X57815 L29157 AA367448 AW328098 AW404536 L29156 L29163 H27683 R83195 AA295096 AW327822 H25458 AW404692 
X57819 X57823 AW405604 AW404447 Z34914 AW406542 AA427726 AA604389 AW405606 AW405918 AW4051 17 
AW407182 L03632 AW405058 L03627 AW407470 R72738 L21959 AW375738 X87888 AF124166 AF054638 AJ2414G6 
U09902 AW405284 M12025 AA360219 L03631 L03629 AW327727 AF194810 AF058076 AF194686 AF063774 AF063737 
AFa63755 AF063740 AF063739 AF063738 S73129 L43088 AF063775 L33026 AF060138 AF194813 AF194812 AF19481 1 
AF194809 AF194814 AF194808 AF084505 AF064503 AF063765 AF063757 AF063752 AF063716 AF063736 AF194e05 
AF058077 AFdB3747 AF063772 AF063781 AF060137 AF194805 AF060134 AF060132 AF058074 AF063754 AF063704 
U38589 Z18332 AF060122 AF194807 AF060135 AF054506 AF064504 AF063773 L26540 AF058072 AF060131 AF064500 
AF060136 AF064513 AF194683 AF194711 AF058075 AF063717 Z19546 AF194581 X72746 U96393 U09890 X98897 
AF194592 M80916 AJ241405 AF194632 AJ388659AF194625Z74694 AF194588 AF194601 U09901 U09911 U09892 
AJ010336 AJ006162 AJ249377 AJ241414 AA327392 U97248 X72747 Z46850 X95739 AW406701 AJ243109 AF194609 
S80758 AF194595 AF194596 L22483 Z70262 D84141 AW405758 AF001788 AF194580 D84143 U76684 AF194593 L03630 
X87892 X91 134 U21249 Z46346 AJ132426 AF103659 AJ233718 AF021038 AJ233727 H24657 U09882 S75627 AA573599 
AF047231 AF047232 AF103616 AW404484 Z46848 S76132 AF103663 AF103713 Y17940 AF047216 AF103595 Y17956 
AF052799 AF052797 AF052802 AF052798 AF052801 AF052794 AF052796 AF047218 AF052800 AF047217 AF052795 
AF032351 AF103701 AF103708 AF1 037 10AF1 03706 S62232 AF1 03645 AF103632AF103647 AF103644 AF103640 
AF103692 AF093581 AF103620 AW405934 AI445389 AW383753 AA360256 AF099676 H21654 H39501 AI82G828 H53689 
W26785 AW384495 AW407708 AA541663 AA911602 A1821461 AA588300 AA327050 H42717 AI951280 AA421322 AI923193 
AA864302 H25133 D87023 J0301 1 M61771 D87017 AA526865 AA253450 

130232 18831_2 U29463 U06715 W24970 AA584303 T39581 AA155603 AA305043 AA429426 W05664 AA102382 AA482044 W24487 AA319060 

T88946 F10105 AA232161 AA243117 AA158937 AA100864 

109097 genbanK_AA167512 AA167512 
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TABLE 9: Figure 9 from BRCA 001-2 US 

5 Table 9 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset identifier nuniber 

1 0 ExAccn : Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 



15 Pkey ExAccn UnigenelD UnlgeneTitle 



20 



25 



30 



35 



100690 
102211 
103587 
104115 
105038 
105500 
105990 
106155 
106373 
106414 
110009 
111900 
114540 
116470 
117280 
119771 
121723 
124059 
131148 
132371 
134169 
302235 
452410 



AA383256 

BE314524 

BE270286 

AF18381Q 

AW503733 

AW602166 

At690586 

AA425414 

AW503807 

BE568205 

BE075297 

AF131784 

Aig04232 

AI272141 

M18217 

AI905687 

AA243499 

BE387335 

AW953575 

AA235448 

A1690916 

AL049987 

AL133619 



Hs.1657 

Hs.78776 

Hs.82128 

Hs.26102 

Hs.9414 

Hs.222399 

Hs.29403 

Hs.33267 

Hs.21907 

Hs.28827 

Hs.6614 

Hs.25318 

Hs.75323 

Hs.83484 

Hs.172129 

Hs.2533 

Hs. 104800 

Hs.283713 

Hs.303125 

Hs.46677 

Hs.178137 

Hs.166361 

Hs.29383 



estrogen receptor 1 

putative transmembrane protein 

514 oncofetal trophoblast glycoprotein 

opposite strand to trichortiinophalangeal syndrome I 

KIAA1488 protein 

CEGP1 protein 

hypotheticat protein FLJ22060 

nuclear factor l/B 



mitogen-activated protein kinase kinase kinase 2 

ESTs. Weakly similar to A43932 mucin 2 precursor, intestinal 

Homo sapiens ctone 25194 mRNA sequence 

prohibitin 

SRY (sex detenmining region Y)-box 4 

Homo sapiens cDNA: FLJ21409 fis, done COL03924 

EST 

hypothetical protein FU10134 

ESTs. Weakly similar to S64054 hypothetical protein YGL050w 

p53-induced protein PIGPC1 

PRO2000 protein 

transducer of ERBB2, 1 

Homo sapiens mRNA; cDNA DKFZp564F112 

Homo sapiens mRNA; cONA DKFZp434 
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TABLE 10: Figure 10 from BRCA 001-3 PCX 

Table 10 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



pkey: Unique Eos probeset identifier number 

1 0 ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD; Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to nonnal body tissue 

R2: Ratio of SO^^ percentile tumor to body 

1 5 R3: Ratio of 75*^ percentile body to tumor 

R4: Ratio of tumor to nomnal breast tissue 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Pkey ExAccn UnigenelD Unigene Titte 



100082 
100103 
100131 
100147 
100154 
100157 
100169 
100203 
100210 
100219 
100234 
100248 
100252 
100260 
100279 
100286 
100294 
100335 
100365 
100375 
100409 
100410 
100414 
100418 
100438 
100439 
100448 
100449 
100522 
100552 
100643 
100661 
100666 
100667 
100745 
100774 
100783 
100821 
100864 
100877 
100892 
101038 
101046 
101079 



AA130080 Hs.4295 
AA380887 Hs.5085 
D12485 
D13666 
H60720 
D14661 



Hs.11951 
Hs.136348 
Hs.81892 
Hs.119 



AL037228 Hs.82043 
.BE242284 Hs.172199 
D26361 Hs.3104 
AW972300 Hs.1 18110 
D29677 Hs.3085 
NM^015156Hs.78398 
NM^006207Hs.1 70040 
D38491 Hs.322478 
D42084 Hs.82007 
BE247550 Hs.86859 
AA331881 Hs.75454 
AW247529 Hs.6793 
AI878927 Hs79284 
D80004 HSJ5909 
086957 Hs.80712 
D86961 HS79299 
m 014735HS.82292 
D86978 Hs.84790 
AA013051 Hs.91417 
M3m26 Hs,122669 
AF234887 Hs.57652 
D87470 HS75400 
X51501 Hs.99949 
AA019521 Hs.301946 
NM_005032Hs.4114 
BE623001 Hs.132748 
L05424 Hs.169610 
L05424 Hs.169610 
BE207168 Hs.144630 
J05581 Hs.89603 
AF07B847 Hs.191356 
M26460 

BE563957 Hs74851 
X80821 Hs.27973 
BE245294 Hs.180789 
BE297139 Hs79411 
KOI 160 

BE264g01 Hs.250502 



proteasome {prosome, macropain) 26S subu 

dolichyl-phosphate mannosyltfcinsferase p 

ectonucleotide pyrophosphatase/phosphodi 

osteoblast specific factor 2 (fasciclin 

K1AA0101 gene product 

Wilms' tumour 1 -associating protein 

D123 gene product 

adenylate cyclase 7 

KIAA0042 gene product 

bone man-ow stromal cell antigen 2 

KfAA0054 gene product Helicase 

KIAA0071 protein 

platelet-derived growth factor receptor- 
KIAA0117 protein 
KIAA0094 protein 

growth factor receptor-bound protein 7 
peroxiredoxin 3 

platelet-activating factor acetylhydrola 

mesodemn specific transcript (mouse) horn 

KIAA0182 protein 

K1AA0202 protein 

lipoma HMGIC fusion pariner-like 2 

KtAA0215 gene product 

KIAA0225 protein. 

topoisomerase (DNA) 11 binding protein 
KIAA0264 protein 

cadherin, EGF lAG seven-pass G-type rece 

K1AA0280 protein 

prolactin-induced protein 

lysosomal 

plastin 3 (T isofomn) 

Homo sapiens ribosomal protein L39 mRNA, 
CD44 antigen (homing function and Indian 
CD44 antigen (homing function and Indian 
nuclear receptor subfamily 2, group F. m 
mucin 1, transmembrane 
general transcription factor IIH, polype 
gb:Homo sapiens (done 104) retinoblasto 
activated RNA polymerase il transcriptio 
K1AA0874 protein 
8164 protein 

replication protein A2 (32kD) 
NM.002122:Homo sapiens major histocompat 
carisonic anhydiase Vlll 
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R1 


R2 


R3 


R4 


4.2 


152 


36 


12.2 


9,8 


123 


13 


5 


13.2 


244 


19 


9.9 


157 


1030 


66 


5 


4.1 


320 


78 


10.6 


4.7 


119 


26 


3 


5.1 


106 


21 


9.2 


47 


47 


1 


4.3 


47 


47 


4 


0.7 


3.8 


350- 


93 


1.9 


4.1 


64 


16 


3 


3.4 


77 


23 


5,9 


4.5 


45 


4 


4 


5.9 


59 


1 


2.6 


3.5 


96 


28 


1.3 


3.1 


306 


98 


1.5 


12^ 


128 


1 


117 


4.2 


187 


44 


5.4 


4.5 


129 


29 


3.1 


3.5 


78 


23 


4.8 


10.2 


102 


1 


4.8 


4 


40 


1 


3.8 


3.2 


32 


2 


2.9 


3.6 


36 


7 


3.2 


5.6 


76 


14 


2 


3.5 


35 


9 


3.1 


5.5 


145 


27 


2.2 


3.4 


34 


1 


1.2 


22.7 


760 


34 


1.4 


14.4 


144 


9 


47 


4.1 


259 


63 


19 


3.3 


116 


36 


2.2 


8.5 


85 


1 


3.2 


3 


594 


201 


2.3 


5 


82 


17 


0.9 


3.6 


37 


11 


2.8 


97 


97 


10 


7.2 


3.3 


33 


1 


0.8 


3.7 


477 


130 


3.1 


6.3. 


63 


4 


57 


4.7 


47 


1 


4.2 


3.8 


115 . 


30 


7.1 


3.9 


390 


100 


11.1 


3.9 


39 


8 


3.6 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 

BNSOOCIO: <WQ1 



101084 


AW409934 Hs75528 


nucleolar GTPase 


4.1 


53 


13 


4 


101104 


AW862258 Hs.169266 


neuropeptide Y receptor Y1 


15,3 


153 


1 


14.1 


101185 


NfwL001621Hs.170087 


aryl hydrocait)on receptor 


11.3 


113 


8 


3.9 


101188 


L20320 Hs.1 84298 


cycfin-dependent kinase 7 (homolo9 of Xe 


3.1 


118 


38 


2 


101201 


L22524 Hs.2256 


matrix metalloprDteinase 7 (MMP7: uterin 


8.2 


396 


48 


0.9 


101232 


AU077288 Hs.242894 


ADP-ribosylatlon faclor-iike 1 


4 


110 


28 


10.7 


101275 


BE545277 Hs.3273 


Ts translation elongation factor, mibch 


4.2 


50 


12 


4.4 


101300 


BE535511 Hs.74137 


transmembrane trafficking protein 


6.6 


135 


21 


13.1 


101396 


BE267931 Hs78996 


proOferating cell nuclear antigen 


6.4 


249 


39 


22.4 


101447 


M21305 


gb:Human alpha satellite and satellite 3 


6.5 


878 


135 


0.8 


101448 


NM_000424Hs.195850 


keratin 5 (epidermolysis bullosa simplex 


• 4.8 


622 


130 


0.7 


101470 


NM_000546Hs.1846 


tumor protein p53 (li-Fraumeni syndrome) 


5,1 


97 


19 


9.3 


101478 


NML002890Hs758 


RAS p21 protein activator (GTPase activa 


9.6 


96 


1 


8.5 


101484 


AA053486 Hs.20315 


interferon-induced protein with tetratri 


11.2 


112 


8 


5.9 


101507 


X16896 Hs.82112 


interleukin 1 receptor, type 1 


3.9 


39 


2 


3.5 


101621 


BE391804 Hs.62661 


guanylate binding protein 1 , interferon- 


3.6 


36 


1 


2.6 


101624 


M55998 


gbiHuman alpha-1 collagen type 1 gene, 3 


3.1 


2898 


923 


2.2 


101654 


AA436989 Hs.1 21 01 7 


H2A histone familv mpmhpr A 


6.9 


103 


15 


R4 


101684 


M63256 Hs.75124 


cerebellar degeneration-related protein 


6.4 


64 


2 


4.9 


101724 


L1 1690 Hs.620 


bullous pemphigoid antigen 1 (230^40kD) 


9.4 


94 


i 


0.3 


101754 


S70114 Hs.239489 


TIA1 cytotoxic nfanule-assncfated RNA-bi 


8.9 


89 


5 


8 


101767 


M81057 Hs 18Q884 








777 




101791 


M83822 Hs.62354 


roll division cvdfi 

wwH WIWI0IWII v^wiw T line 


9 


144 


16 


13 


101794 


M84605 Hs 957 


niitath/P nnioifl npn^ntnr niaiimnipHin iC t 






11 




101803 


AW024390 Hs.1 55691 


nr^JR-c^\l iRiikPinia tranocrintion furinf 

|Jlw^U^^wll IClJIWIIilO UOllOwll^UUII laULwl 


5.4 


180 


34 




101809 


M86849 Hs.323733 


gap junction protein, betel 2i 26kD (conn 


12 


120 


8 


9 


101839 


AA446644 Hs.692 


riA733-9 antifipn* pnlthpfisil nlvmnrntpln 


3.1 


353 


116 


7 8 


101888 


AL049610 Hs 95243 


trfin^rrfntinn plnnnaVrnn farinr A /j^ll\- 


7 1 


7"^ 
# o 


1 




101960 


AL036287 Hs.1 94662 


catoonln 3 acidic 


3.8 


399 


105 


3.3 


102009 


BE245149 Hs.82643 


protein tyrosine kinase 9 


4.6 


148 


32 


11.3 


102095 


U11313 Hs.75760 


sterol carrier protein 2 


9.5 


95 


4 


8.8 


102123 


NM_001809Hs.1594 


centromere orotein A f17lcD^ 


4.2 


42 


7 


3.4 


102125 


NmI006456Hs.288215 


sialyltransferase 


9.3 


93 


4 


3 


102139 


N(\4_^004419Hs.2128 


dual specificity phosphatase 5 


5.4 


137 


26 


2.5 


102162 


AA450274 Hs.1 592 


C0C16 (cell division cycle 16, S. cerevi 


4.6 


151 


33 


2 


102165 


BE313280 Hs.1 59627 


death associated protein 3 


9.3 


93 


5 


8 


102193 


AL036335 Hs.313 


secreted phosphoprotein 1 (osteopontin, 


45.7 


457 


1 


39.7 


102211 


BE314524 Hs78776 


Dtitative transfnem brans nrotein 


3 9 


442 


114 


1.3 


102221 


NM_006769Hs.3844 


LIM domain only 4 


4,9 


49 


1 


3^6 


102242 


1127185 Hs.82547 


retinoic add receptor responder (tazaro 


3.1 


31 


1 


1.3 


102258 


NM_001546Hs.34853 


inhibitor of DNA binding 4, dominant neg 


3.8 


163 


43 


0.5 


102302 


AA3b6342 Hs.69171 


protein kinase O-like 2 


4.5 


45 


1 


3.6 


102304 


AF015224 Hs.46452 


mammaglobin 1 


8.5 


2058 


243 


1^4 


102348 


U37519 Hs.87539 


flldehvdp dphvdfnopna^P familv mpmhpr 


6.4 


428 


67 


2.3 


102369 


U39840 Hs.299867 


heoatocvte nuclear factor ^ ainha 


6.7 


67 


g 


6.3 


102407 


AW602154 Hs.82143 


E744ike fector 2 fefs domain fran(;rrinf 




53 


1 


4.8 


102409 


BE300330 Hs.1 18725 


selenophosphate synthetase 2 


3.3 


111 


34 


7.5 


102457 


NM.001394Hs.2359 


dual specificity phosphatase 4 


20.2 


202 


5 


1.3 


102544 


MM 003937HS 169139 




R 
o.o 


oo 




1.5 


102567 


U63830 Hs 146847 










6.8 


102580 


U60808 Hs.152981 


CDP-dlacvtatvceml synthase f cditmnhafiria 


4 1 


41 




3.3 


102618 


AL037672 Hs.81071 


extracellular matrix orotPin 1 


in 9 


628 • 


62 


17.2 


102638 


U67319 Hs!9216 


caspase 7. apoptosis-related cysteine pr 


5 


66 


13 


5.3 


102663 


NM.002270Hs,1 68075 
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Hs.44856 


hypothetical protein FLJ12116 


5.5 


323 


59 


10.5 


132358 


NM_003542Hs.46423 


H4 histone fainity, number G 


3.3 


979 


298 


2.2 


132384 


AA'^l 91*^*5 

rv\0 1^ iO*J 


He 


HSPC034 protein 


3.6 


36 


1 


3.1 


132388 


W32624 


Hs 278626 


Ang/AbHnteractlng protein Ar9BP2 


5.9 


186 


32 


3.7 


132393 


Al.135094 




hypothetical protein Fl-J 14495 


4.2 


159 


38 


7.1 


132407 


BE613126 


Hs.47783 


B aggressive tymphoma gene 


4.6 


46 


1 


4.3 


132425 


N87549 


Hs.1 25287 


Tjnc finoer orotein ?NF1 40*like nroteln 


3.6 


146 


41 


1,1 


132440 


AB020699 


Hs.1 12751 


KIAA0892 DfDtein 


3.3 


33 


4 


2.9 


132465 


AW169847 


Hs.49169 


KIAA1634 protein 


8.3 


145 


18 


3.7 


132522 


AB023164 


Hs.5070 


K1AA0947 protein 


4.6 


46 


1 


4.4 


132528 


T78736 


Hs.50758 


^ivi\y^ you uiMJtcn liiQu Kdiaiiww \Jt w<fiuiii«/Sv 


9.3 


93 


1 


8.4 


132530 


AA306105 


Hs.50785 




4.9 


49 


-( 


4.4 


132543 


BE568452 


Hs.5101 


nrotein reaulator of cvtoklnesiji 1 


11.8 


201 


17 


19.1 


132572 


AI929659 


Hs.237825 


ctnnal rpmnnitinn narKHp 79kri 


3.8 


38 


1 


3 


132592 


AW803564 


Hs.288850 




4.8 


93 


20 


3.1 


132602 


AW606927 


Hs.5305 




o« 1 


61 


2 




132616 


BE262677 


Hs.283558 




1 A 


193 


58 


19 3 


132617 


AF037335 


Hs.5338 


waiuuiii^ OiiiijUfade All 


1A 9 


390 


28 


99 R 


132618 


AL050025 


Hs.279916 




3.3 


909 


274 


3.2 


132632 


AU076gi6 


Hs.5398 


guanine rnonpnuspnaie synuieiase 


c 


en 


i 
1 


A 1 

H. 1 


132668 


AB018319 


Hs.5460 


ISUWJi f Q piDlBtn 


A 9 


171 

1 r 1 


A1 
•t 1 


19 R 


132742 


AA025480 


Hs.292812 


ESTs Wealdv ^milar to T3346A hvnnfheti 
EOia* vvcaiuj ouiiiiai tu Ii7«jhuo iiy|iuuiBU 


6.5 


65 


1 


5.6 


132790 


AW242243 


Hs.168670 


npmyKnnruil fsimPQvlafari nmtaln 
pciUAiamiKii latiicsyiciicu (Jiuioifi 


3 7 


37 


1 


9 9 


132811. U25435 


Hs.57419 


• 1 v/*uiiiuiiig laclur iinyci piuici 


7 


IIS 

1 13 


17 


S A 


132852 


AL120050 


Hs.58220 


Unmn canipnc W^MA* PI l9^nnR fic Hnna 1 




R1 

O 1 


1Q 


(% 1 


132856 


NM^001448Hs.58367 


yiypKxiii *♦ 


A fl 


46 




3 R 


132880 


BE077155 


Hs.1 77537 


hvDothetical nratPin nKF7n761R15l4 


12.6 


126 


g 


9.9 


132902 


AI936442 


Hs.59838 


hvDofhefical Drotetn FLJ 10808 


11 


187 


17 


10.4 


132906 


BE613337 


Hs.234896 


ripminin 


3 3 


106 


33 


2.6 


132914 


AL047045 


Hs.60293 


Hnmo sanipn<i dtinp 1 99482 unknown mRN A • 


3 5 


110 


32 


2.1 


132968 


AF234532 


Hs.61638 




4 1 
t, 1 


62 


15 


4.9 


132977 


AA093322 


Hs.301404 




99 1 


991 


q 


17.8 


132990 


X77343 


Hs.334334 


franscrintion factor AP-2 aloha factivat 


12.7 


311 


25 


2.4 


132994 


AA1 12748 


Hs.279905 




3 


380 


127 


5.5 


133011 


NM_006379Hs.171921 




7 


271 


37 


2.3 


133015 


AJ002744 


Hs.246315 


I i nP>N^r!Ptvl>alnh a _0_n alsrtn^si mi np*nnlvn 


4 5 


427 


93 


10.4 


133070 


U92649 


Hs,64311 


3 HLcsinfpnrin ^^nfi mpffltlonmfpinACP rinma 
a UMiim^iiM fliiu 1 1 ictaimpiuiBiiiodc umiio 




36 




3.1 


133091 


AK001628 


Hs.64691 


KIAAfUA!) nmtain 


9 


117 


93 


5 


133192 


AA218564 


Hs.67052 


vauuuiai piutciii ouiuiiy £aj ^yt^oi iiuiiiui 


1 

O, i 




118 


2.5 


133197 


A(275243 


Hs. 180201 


hvnofhefrcal onotefn FLJ20671 


5.1 


58 


12 


5.7 


133199 


AF231981 


Hs,250175 


hnnininn nf vpa^f Innn chain nnlviin<:flfiira 

IIVMIIUtUU Ul TwOOl lUIIM wllOMI ^UIjUl lOulUICS 


3 


816 


275 


3.9 


133221 


W32474 


Hs.301746 


RAP9A mpmhpr of RA}% nnt^nntino familw 
rvAf iiiciiiuci ui rvi w ui i\^y CI ic laituiy 


O. 1 


234 


76 


8.6 


133240 


AK001489 


Hs^42894 


ADP-rihofivlation factor-like 1 


8.1 


81 


1 


4.6 


133271 


Z48633 


Hs.283742 


M caf^pne mRMA fnr rpimtranenncnn 
n.9a|ACiia iiir\iv^ lui leuvuaiispuouii 


19 A 


19A 


A 

u 


10.8 


133291 


BE297855 


Hs.69855 






<JO 




2.9 


133294 


AJ001388 


Hs.69997 


Tine finnpr nmfpin 9^A 




9^A 


3D 


18.9 


133350 


A1499220 


Hs.71573 


hwnnHipfi/^l nmfpin PI Mn07A 


A R 


AR 


c 


3.5 


133362 


AK001519 


Hs.7194 






iin 

1 IV/ 


99 


9.7 


133370 


AF245505 


Hs.72157 


DKFZP564I1922 protein 


3.2 


725 


227 


3.2 


133407 


AF017987 


Hs.7306 


secreted frizzled-related protein 1 


4.1 


374 


91 


1.1 


133422 


AB033061 


Hs73287 


KIAA1235 protein 


4.3 


43 


1 


3.9 


133435 


AI929357 


Hs.323966 


Homo sapiens done H63 unknown mRNA 


5.5 


186 


34 


16.5 


133479 


W01556 


Hs.238797 


ESTs, Moderately similar to 138022 hypot 


3,5 


35 


7 


2.1 


133493 


AW998046 Hs. 194369 


arginine-glutamic add dipeptide (RE) re 


3.6 


39 


11 


0,4 


133504 


NM_004415Hs.74316 


desmoplakin ppi, DPIl) 


4.1 


640 


158 


3 


133517 


NM^000165Hs.74471 


gap junction protein, alpha 1, 43kD (con 


3.2 


351 


111 


5.2 


133536 


W25797 


Hs.1 77486 


amyloid beta (A4) precursor protein (pro 


3.2 


226 


71 


2.8 


133578 


AU077050 


HSJ5066 


translin 


3.4 


178 


53 


8.8 


133633 


D21262 


Hs.75337 


nudeolarand coHed-body phosphproteln 


4.7 


47 


1 


4 


133640 


AW24642B Hs.75355 


ubiquitin-conjugating enzyme E2N (homolo 


8.5 


85 


1 


7.2 


133669 


NM_006925Hs.166975 


spndng factor, arginine/serine-rich 5 


3.6 


36 


1 


0.4 


133681 


AI352558 


Hs.75544 


tyrosine 3-monooxygenase/tryptophan 5-mo 


3.4 


234 


68 


10.7 


133746 


AW410035 HSJ5862 


MAO (mothers against decapentaplegic. Or 


9.3 


. 93 


1 


7.8 
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55.. 



60 



65 



133765 M62194 Hs.75929 


cadherin 11. type 2, OB-cadherin (osteob 


3.2 


ObU 


1/4 




133780 AA557680 Hs.76152 


decorin 


5.4 


144 


27 


13.3 


133784 BE622743 Hs.301064 


arfaptln 1 


47 


47 


1 


4.1 


133814 NM^002462Hs.76391 


myxovinjs (influenza) resistance 1, homo 


3.3 


380 


114 


4.9 


133829 AW630088 Hs.76550 


Homo sapiens mRNA; cDNA DKF2p564B1264 (f 67 


304 


46 


7.8 


133845 AA147026 Hs.76704 


ESTs 


6.2 


600 


97 


4.1 


133913 AU076964 Hs.7753 


calumenin 


3.3 


889 


267 


5 


133968 AA355986 Hs.232068 


transcription factor 8 (represses interi 


3.7 


91 


25 


2.6. 


133990 R48316 Hs.7822 


Homo sapiens mRNA; cDNA DKFZp564C1216 (f 3.4 


91 


27 


8.5 


133999 AA535244 Hs.78305 


RAB2. member RAS oncogene family 


7.8 


78 


. 1 


5.6 


134032 NM.005025HS.78589 


serine (or cysteine) proteinase inhibito 


5.9 


59 


1 


3.3 


134064 AR)91622 Hs.78893 


KIAA0244 protein 


5.8 


58 


1 


4.9 


134087 U51166 Hs.173824 


thymine-DNA glycosylase 


6.4 


100 


16 


4.4 


134089 R51273 Hs.79029 


ESTs 


5.1 


51 


9 


3.8 


.134095 NWL004354HS.79069 


cydin G2 


5 


50 


1 


3.2 


134098 BE513171 Hs.79086 


mttochondriat ribosomal protein L3 


4.8 


246 


51 


3.9 


134110 U41060 HS79136 


LIV-1 protein, estrogen regulated 


4.5 


1472 


330 


2.1 


134125 NM_014781Hs.50421 


K1AA0203 gene product 


4.6 


69 


15 


5.8 


134246 D28459 Hs.80612 


ubiquitin-conjugating enzyme E2A (RAD6 h 


7 


97 


14 


7.5 


134257 C05768 Hs.8078 


Homo sapiens done FBD3 Cri-du-chat crit 


3.4 


34 


5 


2.6 


134272 X76040 Hs.278614 


protease, serine. 15 


3.6 


36 


1 


2.8 


134282 R45621 Hs.81057 


iiypothetical protein MGC2718 


67 


67 


9 


5.7 


134288 AI022650 Hs.8117 


erbb2-interacting protein ERBIN 


4.5 


137 


31 


12 


134321 BE538082 Hs.8172 


ESTs, Moderately similar to A46010 X-lin 


5.2 


52 


1 


4.9 


134326 AW903838 Hs.81800 


chondroitin sulfate proteoglycan 2 (vers 


8.6 


568 


66 


22.4 


134328 AW959281 Hs.8184 


ESTs 


4.8 


53 


11 


3.7 


134348 AW291946 Hs.82065 


interieukin 6 signal transducer (gp130. 


7.1 


71 


4 


6.4 


134359 NM O0l982Hs. 199067 


v-€rl>-b2 avian erythroblastic leukemia v 


3 


68 


23 


2.8 


134367 AA339449 Hs.82285 


phosphoribosylglycinamidefomiyltransfer 


4.4 


44 


1 


4.1 


134374 N22687 Hs.8236 


ESTs 


13.3 


445 


34 


6 


134380 AU077143 Hs.179565 


minichromosome maintenance deficient (S. 


4.5 


45 


2 


3.4 


134395 AA456539 Hs.8262 


lysosomal 


6 


60 


5 


5.9 


134401 AI916662 Hs.211577 


kinectin 1 (kinesin receptor) 


4.1 


301 


73 


6.1 


134405 AW067903 Hs.82772 


collagen, type XI, alptia 1 


4.6 


1216 


267 


4.4 


134415 A(750762 Hs.82911 


protein tyrosine ptiosphatase type IVA, m 


4.9 


163 


34 


15.1 


134417 NM-005416Hs,82921 


sohjte carrier family 35 (CMP-sialic aci 


4.9 


49 


3 


3.6 


134419 W95642 Hs.82961 


trefoil factor 3 (tntestinal) 


3.2 


1872 


592 


3.3 , 


134421 AU077196 Hs.82985 


.collagen, type V. alpha 2 


6.3 


1075 


171 


3.8 


134436 U29344 Hs.83190 


fatty acid synthase 


3.3 


710 


217 


2 


134485 X82153 Hs.83942 


cathepsin K (pycnodysostosis) 


34.3 


411 


12 • 


5,1 


134487 AF061739 Hs.83954 


protein associated with PRK1 


4.8 


153 


32 


4.3 


134495 D63477 Hs.84087 


KIAA0143 protein 


3.1 


147 


48 


12.7 


134520 BE091005 Hs74861 


activated RNA potymerase II transcriptio 


3.3 


33 


1 


2 


134542 M14156 Hs.85112 


insulin-like growth factor 1 (somatomedi 


4.2 


42 


5 


2.6 


134570 U66615 Hs.172280 


SWI/SNF related, matrix associated, acli 


3.9 


39 


1 


2.5 


134590 AW903849 Hs.173840 


HUEL(C4orf1)-interactlng protein 


37 


41 


11 


0.6 


134604 NrvL002884Hs.865 


RAP1A, member of RAS oncogene family . 


5.2 


52 


1 


3 


134612 AW068223 Hs.l71581 


ubiquitin C-tenninal hydrolase UCH37 


4.9 


49 


1 


1 T 


134643 AW299723 Hs.87223 


bone morphogenetic protein receptor, typ 


5.2 


52 


5 


3.5. 


134654 AK001741 Hs.8739 


hypothetical protein FU10879 


6.4 


64 


1 


C 4 

5- 


134656 A1750878 Hs.87409 


thrombospondin 1 


12.6 


126 


1 


10.8 


134672 AF271212 Hs.322901 


disnipter of silencing 10 


5.4 


81 


15 


2.6 


134700 AK000606 Hs.8868 


golgi SNAP receptor complex member 1 


3.4 


17Q 

lis 


Oc. 




134711 X04011 Hs,88974 


cytochrome b-245, beta polypeptide (chro 


3.2 


143 


45 


13.9 


134722 AF129536 Hs.284226 


F-box onty protein 6 


7 


70 


6 


6 


134856 BE281128 Hs.9030 


TONDU 


3.1 


31 


1 


2.3 


134880 A1879195 Hs.90606 


15 kDa selenoprotein 


5.7 


57 


1 


5 


134917 X87241 Hs.166994 


FAT tumor suppressor (Drosophiia) homolo 


3.2 


153 


48 


4.7 


134921 AL137491 Hs.125511 


Homo sapiens mRNA: cDNA DKF2p434P1530 (f 4 


452 


114 


2 


134982 AK002085 Hs.92308 


Homo sapiens cDNA FU1 1223 fis, done PL . 


5.1 


150 


30 


7.2 


134989 AW968058 Hs.92381 


nudix (nucleoside diphosphate linked moi 


8.2 


114 


14 


9.9 


135029 H58818 Hs.187579 


hydroxysteroid (17-beta) dehydrogenase 7 


11.5 


115 


1 


10 


135035 AL034344 Hs.284186 


forkheadboxCI 


5.4 


259 


48 


1.4 


135051 AI272141 Hs.a3484 


SRY (sex detemiining region Y)-box 4 


3.3 


1296 


394 


.2.2 


135062 AK000967 Hs.g3872 


KIAA1682 protein 


3.8 


240 


64 


3.2 


135073 W55956 Hs.94030 


Homo sapiens mRNA; cDNA DKFZp585E1624 (f 8.1 


101 


13 


7.9 


135098 AW274526 Hs.277721 


ovarian carcinoma antigen CA125 


3.3 


33 


1 


2.6 
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135117 


W52493 


Hs.94694 


Homo sapiens cDNA FUlOSBI fis. done NT 


5.3 


53 


1 


4.1 




'135144 NM_016255Hs.95260 


Autoson^l Highly Conserved Protein 


7.4 


74 


5 


2.4 




135154 


AK001835 


Hs.267812 


sorting nexin 4 


6.6 


69 


11 


6.3 




135155 


AI207958 


Hs.166556 


Homo sapiens, Similar to TEA domain fami 


6.1 


61 


1 


5.1 




135172 


AB028955 


Hs.12144 


KIAA1033 protein 


3.4 


88 


26 


1.4 




135242 


AI583187 


Hs.9700 


cycfin El 


3.1 


31 


1 


2.3 




135243 


BE463721 


Hs.97101 


putative G protein-coupled receptor 


3.4 


169 


50 


9.1 . 




135269 


NM_003403Hs.97498 


YY1 transcription factor 


3.4 


475 


142 


2.5 




135356 


BE312948 


Hs,18104 


liypolhetical protein FU 11274 


3.1 


31 


10 


1,7 




135357 


AI565004 


HS79572 


cathepsin D (lysosomal aspartyl protease 


4.7 


710 


151 


2.5 




135389 


U05237 


Hs.99872 


fetal Alzheimer antigen . 


20.6 


206 


4 


. 19.1 




135397 


L14922 


Hs.166563 


repilcation factor C (acSvator 1) 1 (14 


3.2 


32 


1 


2.4 




-135400 


X78592 


Hs.99915 


androgen receptor (dihydrotestosterone r 


3.2 


117 


37 


9.4 






AI471525 


Hs.247486 


ESTs 


3.8 


58 


16 


5,5 






X70683 


Hs.93668 


ESTs 


1.8 


1047 


596 


1,6 






L14922 


Hs.82128 


5T4 oncofetal trophoblast glycoprotein 


5 


285 


58 


1.2 






M23263 


Hs.g04 


amyio-1 ;6-gtuoosida5e: 4-alpha-glucanotransferas 


3.1 


31 


1 


2.6 




AI267886 


Hs.148027 


polymerase (RNA) It (DNA directed) polypeptide B 


7.8 


137 


18 


11.9 




AA044640 


Hs.241676 


stromal cell-derived factor 1 


4.7 


114 


25 


0.9 






Ng0980 


Hs.227459 


ESTs: Moderately similar to 1111 ALU SUBFAMILY 


4.7 


151 


32 


9.3 




AA8732B5 


Hs.137947 


ESTs 


4.7 


47 


3 


4.4 






T56679 


Hs.865 


RAP1 A; member of RAS oncogene family 


4 


40 


1 


3.4 






AA305536 




'EST176S22 Colon carcinoma (Caco-2) cell line II 


3.6 


121 


34 


11.8 




AI369384 




arytsuffiatase D 


3.5 


113 


33 


1.7 






AA219081 


Hs.242396 


ESTs; Moderately simitar to Ul! ALU SUBFAMILY] 


3.4 


107 


32 


9.9 
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TABLE lOA 



Table 10 A shows the accession numbers for those pkeys lacking unigenelD's for Table 10. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. TTiese sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland CaUfomia). The Genbank accession nxmibers 
for sequences comprising each cluster are Usted in the "Accession" column. 

10 

Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 

15 

Pkey CAT number Accession 

123619 371681 1 AA602964 AA609200 

20 - 104602 524482^2 H47610R86920 

121581 283769 1 AA4 1 6568 AA4428a9AA4 17233 AA442223 

123523 genbaniLAA608588 AA608588 

100821 tigr_HT43b6 M26450U09116 

125091 genbank^T91518 T91518 
.25 125150 NOT FOUND,entre2_W38240 W38240 

118475 genbanK_N66845 N66845. 

104787 genbank^AA027317 AA027317 

106055 genbankJ\A417034 AA417034 

113702 genbanK_T97307 T97307 
30 101046 entre3L.K01160 K01160 

101447 entfB?J421305 M21305 

101624 entre^M55998 M55998 

124677 genbank.R01073 R01073 

110581 genbank^H61560 H61560 
35 119023 genbank_N98488 N98488 

110775 genbank_N22414 N22414 

112092 genbanK_R44538 R44538 

112253 genbanK_R51818 R51818 

107014 genbanK_AA598820 AA598820 
40 114988 genbankj^1089 AA251089 
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TABLE 11: Figure 11 from BRCA 001-3 PCX 

Table 11 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number. Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R 1 : Ratio of tumor to normal body tissue 

R2: Ratio of 90^ percentile tumor to normal body 

R3: Ratio of 76^ percentile normal body to tumor 

R4: Ratio oftumor to normal breast tissue 



Pkey ExAccn UnigenelD 


Unigene Title 


R1 


R2 


R3 


R4 


100131 D12485 Hs.11951 


ectonudeotide pyrophosphatase/phosphodi 


13.2 


244 


19 


9.9 


100147 D13666 Hs.1 36348 


osteoblast spedfic factor 2 (fasctdin 


15.7 


1030 66 


5 


100522 X51501 Hs.99949 


prolactinnnduced protein 


22.7 


760 


34 


1.4 


100666 L03424 Hs. 169610 


C044 antigen (hommg function and Indian 


8.5 


85 


1 


3.2 


101104 •AW862258 Hs.169266 


neuropeptide Y receptor Y1 


15.3 


153 


1 


14.1 


101478 NM_002890Hs.758 


RAS p21 protein activator (GTPase activa 


9.6 


96 


1 


8.5 


101724 L1 1690 Hs.620 


bullous pemphigoid antigen 1 (230/240kD) 


Q A 


94 


1 


U.J 


101754 S70114 Hs.239489 


TIA1 cytotoxic granule-assodaledRNA-bi 


8.9 


89 


5 


8 


101888 AL049610 Hs.95243 


transcription elongation fador A (Sll)- 


7.3 


73 


1 


5.3 


102165 BE313280 Hs.159627 


death assbdated protein 3 


9.3 


93 


5 


8 


102304 AF015224 Hs.46452 


mammaglobin 1 


8.5 


2058 243 


1.4 


102348 U37519 Hs.87539 


aldehyde dehydrogenase 3 family, member 


6.4 


428 


67 


2,3 


102457 NM_.001394Hs.2359 


dual spedfidty phosphatase 4 


20.2 


202 


5 


1,3 


102567 U63830 Hs. 146847 


TRAP family member-assodated NFKB activ 


8.2 


82 


1 


6.8 


102823 D85390 Hs.5057 


carboxypeptidase D 


5.6 


56 


1 


5.3 


103557 AL133415 Hs.297753 


vimentin 


7.5 


136 


18 


3.4 


103613 NM^000346Hs.2316 


SRY (sex determining region Y)-box 9 (ca 


7.3 


73 


1 


5.2 


104115 AF183810 Hs.26102 


opposite strand to trichorhinophalangeal 


29 


290 


1 


26.8 


104667 A1239923 Hs.30098 


ESTs 


14.9 


149 


1 


6.4 


104804 A1858702 Hs.31803 


ESTs. Weakly similar to N-WASP [H^sapien 


7.7 


77 


1 


5.1 


104807 A113g058 Hs.125790 


teudne>rich repeat-containing 2 


7 


70 


1 


6.5 


104896 AW015318 Hs.23165 


ESTs • 


7.4 


74 


1 


6 


104943 AF072873 Hs.114218 


frizzled (Drosophita) homolog 6 


16.2 


162 


1 


4.2 


105038 AW503733 Hs.9414 


KIAA1488 protein 


5.5 


55 


1 


5.2 


105329 AA234561 Hs.22862 


ESTs 


2.8 


131 


47 


3.9 


105500 AW602166 Hs.222399 


CEGP1 protein 


25.4 


508 


20 


3 


105516 AK001269 Hs.30738 


hypothelJcal protein FLJ1D407 


8.3 


83 


3 


1.8 


105730 AW377314 Hs.5364 


DKFZP564I052 protein 


6.9 


69 


1 


4.4 


106012 AI240665 Hs.8895 


ESTs 


21.2 


212 


6 


17.4 


106095 AF115402 Hs.11713 


E74-like fador 5 (ets domain transcript 


26.3 


355 


14 


1 


106155 AA425414 Hs,33287 


nudear fador MB 


9.9 


483 


49 


1.8 


107102 AB037765 Hs.30652 


KIAA1344 protein 


6.3 


63 


1 


5.4 


107136 AV661958 Hs,8207 


GK001 protein - 


2.5 


392 


155 


4.3 


107151 AW378065 Hs.8687 


ESTs 


15.6 


156 


7 


10.8 


107922 BE153855 Hs.61460 


Ig superfamtly receptor LNIR 


9 


90 


1 


5.5 


108339 AW151340 Hs.51615 


ESTs. Weakly similar to ALU7_HUMAN ALU S 


18.7 


187 


1 


17 


109112 AW419196 Hs.257924 


hypothetical protein RJ13782 


4.1 


334 


82 


3.4 


109292 AW975746 Hs.188662 


KIAA1702 protein 


7,1 


71 


1 


6.5 


109415 U80736 Hs.1 10826 


tnnudeotkie repeat containing 9 


12.3 


123 


1 


11.3 


109912 AW390822 Hs.301528. 


L-kynurenine/alpha-aminoadipate anfiinol/a 


14.2 


142 


1 


9.5 


110009 BE075297 Hs.6614 


ESTs, Wealdy similar to A43932 mudn 2 p 


6.3 


693 


110 


7.2 


110915 BE092285 Hs.29724 


hypothetical protein FU13187 


20.9 


209 


1 


193 


111164 N46180 Hs.122489 Homo sapiens cONA FU13289 fis. done OV 


7.7 


77 


1 


5 



210 



<wo 02asasntajj>- 



wo 02/059377 



116470 AI272141 

116786 H25836 

117280 M18217 

117412 N32536 

118472 AL157545 Hs.42179 

119271 AI061118 Hs.65328 

119771 AI905687 



111179 AK000136 Hs.10760 asporin (LRR dass 1) 
111190 AK002055 Hs.151046 hypothetical protein FLJ 11 193 
111223 AA852773 Hs.334838 KIAA1866 protein 
111357 BE314949 Hs.87128 hypothetical protein FU23309 . 
112244 AB029000 Hs.70823 KIAAl 077 protein 
113047 A1571940 Hs.7549 ESTs 

113702 T97307 gb:ye53h05.s1 Scares fetalliver spleen 

114124 W57554 Hs.125019 lymphoid nudear protein (lAF-4) mRNA 
114138 AW384793 Hs.15740 Homo sapiens mRNA; cDNA DKFZp434E033 (fr 
114768 AF212848 Hs.182339 ets homologous factor 
114880 AL157545 Hs.42179 bromodomain and PHD finger containing. 3 
114965 AI733881 Hs.72472 BMP-RIB 

114988 AA251089 gb:zs04fi)5^1 NCLCXSAP^GCBI Homo sapiens 

115206 AW183695 Hs.186572 ESTs 

1 15719 AW992405 Hs.59622 Homo sapiens, done IMAGE:3507281 , mRlsiA, 
115844 AI373062 Hs.332938 hypothetical protein MGC5370 

Hs.83484 SRY (sex detennining region Y)-box 4 
Hs.301527 ESTs. Moderately similar to unknown (H.s 
Hs.172129 Homo sapiens cDNA: FU21409 fis, clone C 
Hs.42645 solute carrier family 1 6 (monocarboxylic 

bromodomain and PHD firiger containing, 3 
Fanconi anemia, complementation group F 
Hs.2533 EST 
120562 BE244580 Hs.302267 hypothetical protein FU1 0330 
121463 AK000282 Hs,239681 hypotheUcal protein FU20275 
121723 AA243499 Hs.104800 hypothetical protein FLJ10134 
122963 AA478446 Hs.69559 KIAA1096 protein 
123137 A1073913 Hs.100686 ESTs, Weakly similar to JE0350 Anterior 
123619 AA602954 gb:no97c02.s1 NCLC6AP_Pr2 Homo sapiens 

123709 AA706910 Hs.1 12742 ESTs 
124006 AI147155 - Hs.270016 ESTs 

124059 BE387335 Hs.283713 ESTs. Weakly similar to S64054 hypotheti 
124308 AA249027 Hs.241507 ribosomal protein S6 
125279 AW401809 Hs.4779 KIAA1 150 protein 
125617 AA287921 Hs.164950 ESTs 

Hs. 1 4368 SH3 domain binding glutamk; add-rich pr 
Hs.279009 matrix Gla protein 
Hs.2g6251 programmed cell death 4 
Hs.105700 secreted frizzled-related protein 4 
Hs.21851 Homo sapiens cONA FU12900 fis. done NT 
129017 AA115333 Hs.107968 ESTs 

129229 AF013758 Hs.109643 polyadenylate binding protein-interadin 
129337 NM.014918HS.110488 KIAA0990 protein 
129366 BE220808 Hs.184897 Homo sapiens done 23785 mRNA sequence 
129821 AB028945 Hs.12696 cortadin SH3 domain-binding protein 
130036 BE061916 Hs.125849 chromosome 8 open reading frame 2 
130057 AF027153 Hs.324787 solute earner family 5 (inositol transp 
130095 AKb01635 Hs.14838 hypothetical protein FU10773 
130343 AB040914 Hs.278628 KIAA1481 protein 
130385 AW067800 Hs.155223 stanntocaldn2 
130407 BE385099 Hs.334727 hypothetical protein MGC301 7 

Hs.1 55637 protein kinase, DNA-activaled, catalytic 
Hs.155956 N-acetyltransferase 1 (ary/amine N-acety 
130604 AA383256 Hs. 1657 estrogen receptor 1 

Hs.1 674 glutamine-fniclose-6-phosphate transamin 
Hs.279762 bromodomain-containing 7 
131148 AW953575 Hs.303125 p53-induced protein PIGPC1 
131388 NM 014810HS.92200 KIAA0480 ge^'se produd 
131564 T93500 Hs.28792 Homo sapiens cDNA FU1 1041 fis, done PL 
131742 AA961420 Hs.31433 ESTs 

131877 J04088 Hs.156346 topoisomerase (DNA) II alpha {170kD) 
131985 AA503020 Hs.36563 hypothetical protein FU22418 
132316 U28831 Hs.44566 KtAAl 641 protein 
132528 T78736 Hs.50758 SMC4 (structural maintenance of chromoso 
132742 AA025480 Hs.292812 ESTs. Weakly similar to T3346Bhypotheti 
132990 X77343 Hs.334334 transcription factor AP-2 alpha (adivat 
133015 AJ002744 Hs.246315 UDP-N-aceJyl^lpha-D^aiactosamine:po)yp 
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127439 D60237 

128305 AI954968 

128482 AI694143 

128790 AF026692 

128925 R67419 



130441 U63630 
130455 D90041 



130617 M90516 
130712 AJ271881 



25.1 


288 


12 


6.7 


6.3 


63 


1 


5.8 


3.6 


402 


112 


4.9 


3.8 


425 


111 


4 


5.7 


567 


100 


6.7 


9.6 


124 


13 


9 


12.3 


129 


11 


11.7 


24.2 


242 


10 


5.6 


6.7 


67 


1 


6.3 


13.7 


137 


1 


8.9 


9.1 


91 


1 


7.6 


35.9 


359 


10 


29.7 


11.5 


115 


1 


6.9 


5.8 


58 


1 


5 


7.6 


144 


19 


13.9 


6.2 


62 


1 


5.4 


1.8 


1047 596 


1.6 


22.8 


228 


9 


12.4 


3.9 


322 


83 


4.4 


17:4 


174 


9 


6.9 


14,5 


145 


1 


2.4 


8.2 


82 


1 


6.4 


3.5 


2073 595 


2.1 


8.5 


127 


15 


1.6 


10.3 


103 


1 


9.3 


2.9 


214 


74 


3.7 


7.2 


72 


1 


5.7 


9.9 


351 


36 


13.9 


8.5 


85 


1 


4.3 


3.9 


60 


16 


4.8 


5.8 


321 


55 


17 


10.4 


880. 


85 


5.3 


10.5 


105 


1 


9.9 


13.1 


131 


1 


5.1 


6.7 


67 


1 


6 


30.6 


306 


4 


26.5 


7.5 


75 


1 


6.5 


7.2 


72 


1 


5.8 


17.4 


409 


24 


7.8 


7.1 


392 


56 


3.6 


8.2 


82 


1 


7.4 


7.1 


71 


1 


6.2 


9.5 


95 


1 


8.5 


7.1 


150 


21 


14:5 


11.4 


114 


1 


10 


6.7 


67 


1 


5.7 


1 


1 


1 


1 


14.6 


219 


15 


7.6 


13.2 


331 


25 


12.4 


72.2 


722 


1 


1.9 


6.5 


65 


4 


5.3 


6.1 


61 


1 


5.7 


10,8 


706 


66 


9.2 


32.2 


322 


1 


4.7 


10 


100 


1 


7.6 


17.5 


175 


2 


12.8 


3.8 


585 


153 


3.7 


7.6 


76 


1 


5 


4.7 


381 


81 


6.4 


11.7 


117 


1 


10.1 


6.8 


68 


1 


5.6 


40.2 


402 


1 


4 


18.6 


186 


10 


1.5 


9.3 


93 


1 


8.4 


6.5 


65 


1 


5.6 


12.7 


311 


25 


2.4 


4.6 


427 


93 


10.4 
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133199 


AF231981 


H5.250175 homolog of yeast long chain polyunsatura 


3 


816 


275 


3.9 


133240 


AK001489 Hs.2428g4 ADP-ribosylation factor-like 1 


8.1 


81 


1 


4.6 


133271 


Z48633 


Hs.283742 H.sapiens mRNA for retrotransposon 


12.4 


124 


6 


10.8 


133640 


AW246428 Hs.75355 ubiquitin-conjugating enzyme E2N (homolo 


8.5 


85 


1 


7.2 


133746 


AW410035 Hs75862 MAD (mothers against decapentaplegic. Dr 


9.3 


93 


1 


7.8 


133999 


AA535244 


Hs.78305 RAB2, member RAS oncogene fiamily 


7.8 


78 


1 


5.6 


134110 


U41050 


Hs.791 36 LIV-1 protein, estrogen regulated 


4.5 


1472 330 


2.1 


134485 


X82153 


Hs.83942 cathepsin K (pycnodysostosis) 


34.3 


411 


12 


5.1 


134654 


AK001741 


Hs.8739 hypothetical protein FLJ10879 


6.4 


64 


1 


5.1 


134880 


AI879195 


Hs.90606 1 5 kDa selenoprotein 


5.7 


57 


1 


5 


135029 


H58818 


Hs. 187579 hydroxysteroid (17-beta) dehydrogenase 7 


11.5 


115 


1 


10 


135389 


U05237 


Hs.99872 fetal Alzheimer antigen 


20.6 


206 


4 


19.1 


128305 


AI954968 


Hs.279009 matrix Gta protein 


9.4 


94 


3 


5.3 
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TABLE llA 

Table llA shows the accession numbers for those pkeys lacking unigenelD's for Table 1 1 . 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and AUgnment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

10 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

15 

Pkey CAT number Accession 

123619 371681 1 AA602964 AA609200 
20 113702 genbaniLT97307 T97307 

114988 genbank_AA251089 AA251089 
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TABLE 12: Figure 12 from BRCA 001-3 PCX 



Table 12 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset identifier number 

1 0 ExAcx:n: Exemplar Accession number. Genbanlc accession number 

UnigeneiD: Unigene number 

Unlgene Title: Unigene gene title 

R1: Ratio of tumor to nomial body tissue 

R2: Ratio of 90^ percentile tumor to body 

1 5 R3: Ratio of 75"» percentile body to tumor 

R4: Ratio of tumor to normal breast tissue 



20 



25 



Pkey 


ExAccn UnigeneiD Unigene Title 


R1 


R2 


R3 


R4 


100131 


D12485 Hs.11951 phosphodiesterase 1 (PC-1) 


13.2 


244 


19 


9.9 


105500 


AW602166 Hs.222399 ESTs 


25.4 


508 


20 


3 


112244 


AB029000 Hs.70823 KIAA1077 protein 


5.7 


567 


100" 


6.7 


114124 


W57554 Hs.125019 ESTs 


24.2 


242 


10 


5.6 


119771 


AI905687 Hs.2533 ESTs 


3.5 


2073 


595 


2.1 


121723 


AA243499 Hs.1 04800 ESTs 


2.9 


214 


74 


3.7 


128790 


AF026692 Hs.1 05700 secreted frizzled-related protein 4 


17.4 


409 


24 


7.8 


131148 


AW953575 Hs.303125 ESTs 


3.8 


585 


153 


'3.7 


131985 


AA503020 Hs.36563 ESTs 


40.2 


402 


1 


4 


133199 


AF231981 H5.250175 Homo sapiens ctone 23904 mRNA sequence 3 


816 


275 


3.9 
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TABLE 13: Table 1 from BRCA 001-5 US 
Table 13 depicts a preferred group of genes upregulated in breast cancer cells. 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number. Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title • 

R1 : Ratio of tumor to nomnal body tissue 



Pkey ExAccn UniGenelD Unigene Title 

100038 M97935 control 

100039 M97935 control 

100040 M97935 contiol 

100041 m722S control 

100082 AB003103 Hs.4295 proteasome (prosome; macropain) 26S sub 

100091 AF000177 Hs.1 11783 Uml protein 

100100 AF006084 Hs.11538 actin related protein 2/3 complex; subu nil 

100103 AF007875 Hs.5085 dolichyl-phosphate mannosyltransferase p 

100114 D00596 Hs.82962 tiiymidylate synthetase 

100121 D10495 Hs.155342 protein kinase C; delta 

100123 D10523 Hs.16e669 oxoglutarate dehydrogenase (lipoamide) 

100128 D11094 Hs.61153 proteasome (prosome; macropain) 26S sub 

100131 D12485 Hs.11951 phosphodiesterase l/nucleotide pyrophosp 

100137 D13627 Hs. 15071 chaperonin containing TCP1 ; subunit 8 (t 

100144 D13643 Hs.75616 Human mRNA for KIAA0018 gene; comp 

100147 D13666 Hs.136348 osteoblast specific factor 2 (fasciclin Mike 

100154 D14657 Hs.81892 KIAA01 01 gene product 

100164 D14812 Hs.173714 MORF-reiated gene X 

100169 D14878 Hs.82043 D123 gene product 

100190 D21090 Hs.178658 RAD23 (S. cerevisiae) homolog B 

100203 D25S3S Hs.172199 adenylate cyclase 7 

100209 D2630B Hs.76289 biliverdin reductase B (flavin reductase (N 

100215 028598 Hs.82793 proteasome (prosome; macropafn) subunit 

100216 026599 Hs.1390 proteasome (prosome; macropain) subunit 
100219 D28137 Hs.118110 bone marrow stromal cell antigen 2 
100227 028915 Hs.82316 interfenHvinduced; hepatitis C-assodated 
100248 031886 Hs.78398 K1AA0071 protein 

100287 043950 Hs. 1600 chaperonin containing TCP1; subunit 5 (e 

100294 049396 HsJ5454 antioxidant protein 1 

100307 D50525 Hs.699 hypothetical protein 

100335 D63391 Hs.6793 plateiet-acth^ating factor acetylhydroiase; 

100340 D63487 Hs.82563 KIAA01 53 protein 

100355 D78129 Hs.71465 Homo sapiens mRNA for squalene epoxid 

100363 D78514 Hs.78563 ubiquitin-conjugating enzyme E2G 1 (hom 

100368 079987 Hs.153479 extra spindle poles; S. cerevisiae; homolo 

100372 D79997 Hs.184339 KIAA0175 gene product 

100375 D80004 Hs75909 KIAA0182 protein 

100379 D8206O Hs.278721 Ke4 gene; mouse; human homolog of 

100387 D83777 Hs.75137 KIAA0193 gene product 

100393 084145 Hs.39913 novel RGD-containing protein 

100398 D84557 Hs.1 55462 minichromosome maintenance deficient (m 

100405 086425 Hs.82733 nidogen2 

100406 086479 Hs.1 18397 AE-binding protein 1 
100409 086957 Hs.80712 KIAA0202 protein 

100421 086985 Hs.79276 Human mRNA for KIAA0232 gene; comp 

100446 087464 Hs.10037 K1AA0274 gene product 

100447 087465 Hs74583 KIAA0275 gene product 

100448 087469 Hs^852 EGF-like-domain; multiple 2 

215 



16.7 

6.3 

8.3 

14.8 

7.5 

4.9 

4.7 

13.4 

15.9 

4.6 

7.5 

4.4 

8,7 

9.5 

6 

8.5 

10.5 

4.6 

7.9 

5.6 

9.9 

4.9 

14.2 

11.3 

5.7 

5.7 

7,4 

5.6 

12.9 

8.4 

6.8 

4.4 

12.6 

4.6- 

8.5 

8.4 

4.5 

8.1 

10.7 

7.2 

7.2 

5.4 

4.3 

11.9 

9.7 

6.4 

10 

6.2 
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100467 


D89052 


Hs.7476 


ATPase; H-*- transporting; lysosomal (vacu 


7,5 


100466 


D89289 


Hs.1 18722 


fucosyltransferase 8 (alpha (1;6) fuoosyltr 


5 


100486 


HT1112 


Hs.10842 


RzsAJke Protein Tc4 


16.9 


100497 


HT1400 


Hs.79137 


Carfooxvl Methvifransferase Astjartate A 

\JUltJ%J^J% IVIWU IJf lilMl M(w4 Glww| f^w|#BI IdftWf 


5.6 


100618 


HT2710 


Hs,1 14599 


Coiiagsn, Type Viii, Alpha 1 


7.5 


100661 


HT3018 


Hs.132748 


Ribosomai Protein L39 Homolog 


4.4 


100657 


HT3127 


Hs.169610 


Epican» Alt. Spiice 11 


4.6 


100668 


HT3938 


Hs. 169610 


Epican, Alt Splice 12 


4.4 


100676 


HT3742 


Hs.287820 


Fibronectin, AIL Splice 1 


9 


100775 


HT26388 


Hs.89603 


Mudn 1, EpitheRal, Alt. Splice 9 


4.7 


100783 


HT4018 


Hs.191356 


Basic TranscriDiion Factor 44 Kda Subun 

UGlulw 1 ICillwwll|#Uw| 1 9 BwliWl 1 ■ » wUhJUfI 


13.7 


100829 


HT4343 


Hs. 278544 


Cytosofk: Acetc^oetyt-CoGnzyme A Thk> 


10.6 


100830 


HT4344 


Hs.4756 


Rad2 


5.5 


100840 


HT4392 


Hs.1 83418 


Protein Kinase Pitstre, Alpha, AIL Spiice 


4.1 


100850 


HT417 


Hs 297939 

■ lw«^wr www 


CathfiDsin B 


4 


100866 


HT4582 


Hs 75113 

1 • W 1 i W 


Transcription Factor liia 


4.9 


100906 


HT5158 


Hs.5398 


Guanosine 5'-Monophosphate Synthase 


8.7 


100914 


HT511 


Hs.324178 


Ras Inhibitor Inf 


7.2 


100916 


HT544 


Hs 73946 

1 iw« 9 WW~W 


Endothelial Cell Growth Factor 1 


5.9 


100945 


HT884 


no. louvou 




4.6 


100975 


J02923 


Hs 76506 

• Iw* f WW Vw 


Ivmnhocvte cvtosolic oratetn 1 (L-Dlastin 


30.1 


100988 


J035a9 




uh^itifinJikP d 

UUK|uilltl'UnC *r 




10099B 

1 wUwww 


J03909 


Hs 14623 


Intprfpmn' nAmma.inrliirifilp nmtpin 


6.9 


100999 

1 WW WW w 


J03934 


Hs 807(16 


rftanhnrsiQP fMADH/NAnPHl /rvfnrhmm 


4 3 


101011 


J04430 


Hs.1211 


oiwiu ^1 luoul laicoc wiiuotw ipoioioiil 


5.9 


101017 


J04599 


Hs.821 


hish/can 


5.1 


101031 


J05070 


Hs 1 SI 738 


matriy moffillnnrntohiaco Q /nolsifinaco R* 
lllaUIA IllClcUIUpllilCillaoC S |^dalllla9c Oj 




101038 


J05749 


Hq 79411 


rpnliratinn nmtp»in A9 f'^^kDT 


fi 1 
O. 1 


101054 


K02405 


Hq 73Q'?1 

1 lO. > «J{7vf I 


H(iman MHO rlaqq U HLA-DOuheta mRN 


4.3 


101061 


K03515 


Hs 180532 


nltif^qp nhnqnhAfp iqnmpraqp 


4.3 


101091 


L06132 


Mc 14Q155 




7 4 


101097 


L06797 


Hs.89414 


chemokine fC-X-C motift* receotor 4 (ins ■ 


4,6 


101104 


L07615 


Hs.169266 


Human nauroDeDfide Y recentor Y1 /NPY 


18.3 


101143 


L12723 


Hs 900Q3 


hpJ3t Qhrtr^W yniff^ nrofpin A 

Ileal OlIUMV f UniJ ^}^^J^XSU^ t 


174 


101152 


L13800 


HS.98B4 


Homo saDtsns liver exoressed DroteIn oan 


7,6 


101183 


LI 9779 


Hs.795 


H2A histone familyi member O 


10.9 


101216 


L25876 


Hs.84113 


cvclin-denendeni kinase inhibitor 3 fCHK 

\/y will i^jw^wituwi i( ivii lOww iiiiiiwiiwi ^ iw^/i\ 


7.4 


101233 


L29008 


Hs.878 


qnrhitnl riphvrirocipnaqp 


14.6 


101247 


L33801 


Hs.78802 


oh/nofipn qvnthaqp kin^qp ^ hptri 


7.5 


101282 


L38810 


Hs.79387 


nmtpaqnmp fnmqnmp* marmnain^ 9fiR qiih 


4.4 


101326 


L42572 


Hs.78504 


inner membrane protein,* mitochondrial (m 


5,8 


101332 


L47276 


Hs.1 56346 


Homo sapiens (cell line HL-6) alpha topo 


18.9 


101348 


L77213 


Hs.30954 


phosphomevalonate kinase 


7.5 


101352 


L77701 


Hs 16297 

■ iw* f w^wf 


nOX17 /veafit\ homoloa' fivtorhroirip r ox 


9.3 


101378 


Ml 3755 


Hs.833 


intprfprnn-^'imiikalpH nmtpin* I'l kRa 


18.1 


101396 


M15796 


Hs 78996 


nmiifpraHnn rpU niirlpar anfinctfi 


8.6 


101404 


M16342 


Hs.1 82447 


hpf PiTYiPnpni i<s niiHoar rihnniirlpnnmipin (\ 


4,5 


101439 


M20g02 


Hs 268571 




6,1 


101464 


M22538 


Hs.51299 


N/^H dehydnxienase fiibioiiinone) flavo 


8,7 


101469 


M22877 


Hs.169248 


Human somatic cvfochrome c /HCS) oene 


4.2 


101472 


M22960 


Hs 118126 ' 


orotectfve niotain for heta-aalactosidase ( 


6.6 


101478 


M23379 


Hs758 


RAS d21 Drotfiin activator /GTPase arth/a 

1 1 piuidii owuvciiv/i \v 1 rose a^uva 


14 


101484 


M24594 


Hs.20315 


interferon-induoed protein 56 


9.2 


101539 


M30818 


Hs.926 


myxovims (influenza) resistance 2; homo! 


5.1 


101540 


M30938 


Hs.84981 


X-ray repair comptementrng defective rep 


4.7 


101544 


M31169 




Human propionyl-CoA caiboxylase beta-s 


5.5 


101552 


M31642 


Hs.82314 


hypoxanthine phosphoribosyltransferase 1 


8.5 


101580 


M34677 


Hs.83363 


DMA segment on chromosome X (unique) 


4.5 


101600 


M37583 


Hs.1 19192 


H2A histone family; member Z 


5.7 


101663 


M60750 


Hs.2178 


H2B histone famity; memberA 


5.8 


101664 


M60752 


Hs.121017 


H2A histone fami^ member A 


13.5 


101667 


M60858 


Hs.79110 


nudeofin 


4 


101684 


M63256 


Hs.75124 


cerebellar degeneration-related protein (62 


7.6 


101702 


M64929 


Hs.179574 


protein phosphatase 2 (fonrierty 2A); regu 


4.2 


101754 


M77142 


Hs.239489 


TIA1 cytotoxic granule-assodated RNA-b 


4.5 


101758 


M77836 


Hs.79217 


pyrroline-S^rboxylate reductase 1 


5.7 


101767 


M81057 


Hs.180884 


carboxypeptldase B1 (tissue) 


21.7 
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101770 M81601 Hs78869 transcription elongation factor A (Sll); 1 4.6 

101791 M83822 Hs.62354 cell division cyde 4-Iike 9.7 

101803 M86546 Hs.155691 pre-B-cell leukemia transcription factor 1 5.5 

101809 M86849 Hs.323733 Homo sapiens connexin 26 {GJB2) mRNA 22.5 

5 101839 M93036 Hs.692 membrane component; chromosomal 4; su 4 

101851 M94250 Hs.82045 midkine (neurite growth^romoting factor 7.6 

101888 M99701 Hs.95243 transcription elongation factor A (StlHtke 11.4 

101973 S82597 Hs.80120 UDP-N-acetyl-alpha-D-^aladosamineipo 4.6 

101991 U00968 Hs.166 Human SREBP-1 mRNA; complete cds 4.1 

10 102009 U02680 Hs.82643 protein tyrosine kinase 9 4.4 

102025 U03911 Hs.78934 mutS (E. coli) homolog 2 (coton cancen n 4 

102047 U07158 Hs.83734 syntaxin 4A (placental) 6.1 

102051 U0755O Hs.1197 heatshocklOkD protein 1 (chaperonin 10 4.4 

102083 U10323 Hs.75117 interleukin enhancer binding factor 2; 45k 10.4 

15 102095 U11313 Hs.75760 sterol carrier protein 2 9.5 

102130 U15009 Hs.1575 small nudear rit)onucleoprotein D3 polyp 6.6 

102133 U15173 Hs,155596 BCL2/adenovirusE1B 19kD-iiiteradingp 4.3 

102148 U16954 Hs.75B23 ALL1 -fused gene from chromosome 1q 6.9 

102179 U19713 Hs.76364 allograft inflammatory fadorl 4.8 

20 102180 U19718 Hs,83551 microfibrillar-associated protein 2 7:2 

102193 U20758 Hs.313 secreted phosphoprotein 1 (osteopontin; b 7,2 

102198 U21Q9Q Hs.74598 polymerase (DNA directed); delta 2; negu 4.3 

102202 U21931 Hs.574 frudose-bisphosphatase 1 4.5 

102209 U22970 Hs.265827 interferon; alpha-tnducibte pintein (done 9.9 

25 102211 U23070 Hs.78776 putative transmembrane protein 4.9 

102220 U24389 Hs.65436 lysyl oxidase-iike 1 8.5 

102224 U24704 Hs.148495 proteasome (prosome; macropain) 26S sub 5.4 

102234 U26312 Hs.278554 chromobox homolog 3 (Drosophila HP1 g 7.7 

102250 U28014 Hs.74122 caspase 4; apoptosis-related cysteine prot 5.4 

30 102260 U28386 Hs.159557 karyopherin alpha 2 (RAG cohort 1 ; impo 6.3 

102261 U28488 Hs,155935 complement component 3a receptor 1 5.7 

102273 U30888 Hs.75981 ubiquitin specific protease 14 (tRNA-guan 6.1 

102298 U32849 Hs.54483 N-myc (and STAT) interactor 4.1 

102302 U33052 Hs.69171 protein kinase G-like 2 4.3 

35 102305 U33286 Hs.90073 chromosome segr^ation 1 (yeast homolo 5.4 

102320 U34683 Hs.82327 glutathione synthetase 4.1 

102330 U35451 Hs.77254 chromobox homolog 1 (Drosophila HP1 b 4 

102348 U37519 Hs.87539 aldehyde dehydrogenase 8 9.4 

102361 U39400 Hs.75859 chromosome 11 open reading frame 4 5.2 

40 102362 U39412 Hs.75932 N-ethylmaleimide-sensitive fador attachm 9.3 

102369 U39840 tte.299867 hepatocyte nudear factor 3; alpha 7.7 

102395 U41767 Hs.92208 a disintegrin and metalloproteinase domai 10.4 

102409 U4Z2d6 Hs. 118725 sefenophosphate synthetase 2 6.2 

102418 U43923 Hs.79058 suppressorofTy(S.cerevisiae) 4 homolog 4.1 

45 102425 IJ44772 Hs.3873 pafrnitoyl-protein thkjesterase (cerakf-Hpo 4.8 

102457 U48807 Hs.2359 dual spedficity phosphatase 4 6.3 

102465 U49352 Hs.81548 2;4-dienoy I CoA reductase 1; mitochondri 9.4 

102495 U51240 Hs.79356 Lysosomal-associated multispanning mem 6.5 

102534 U56833 Hs.198307 von HippeHindau binding protein 1 8.6 

50 102546 U57877 Hs.3577 sucdnate dehydrogenase complex; subuni 4.3 

102549 U58046 Hs.l98899 eukaryotic translation initiation factor 3; s 6.3 

102557 U58766 Hs,264428 tissue specific transplantation antigen P35 5 

102562 U59309 Hs.75553 fumarate hydratase 6 
102568 U59877 Hs.223025 RAB31; member RAS oncogene family 9.1 

55 102580 U60808 Hs.152981 CDP-diacylglycerol synthase (phosphatid 7.9 
102581 U61145 Hs.77256 enhancer of zeste (Drosophila) homolog 2 7.6 

102590 U62136 HsJ9300 Homo sapiens enterocyte differentiation a 7 

102591 U62325 Hs,324125 amyloid beta (A4) precursor protein-bindi 4 

102592 U62389 Hs,11223 Human putative cytosolic NADP-<iepende 5 
60 102617 U65928 Hs.198767 Jun activation domain binding protein 6.1 

102618 U65932 Hs.8107l extracellular matrix protein 1 232 
102638 U67319 Hs.9216 caspase 7; apoptosis-relaled cysteine prat 8.9 
102663 U70322 Hs.168075 karyopherin (importin) beta 2 7,1 
102666 U7Q66Q Hs,279BiO ATX1 lantioxidanl protein 1; yeast) homo 4,7 
65 102679 U72661 Hs.11342 ninjurin 1; neive injury-induced protein-1 4,7 
102687 U73379 Hs.93002 ubiquHin carrier protein E2-C 7.7 
102704 U76638 Hs.54089 BRCA1 associated RING domain 1 5.6 
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102705 U77180 


H530002 


small indudble cytokine subfamily A (Cy 


11.8 


102721 U7924i 


Hs.1 18666 


Human done 23759 mRNA; partial ods 


15 


102729 U79254 


Hs:i81311 


asparaginyMRNA synthetase 


5 


102739 


U79282 


Hs.155572 


Human clone 23801 mRNA sequence 


6 


102742 U79293 


Hs.159264 


Human done 23948 mRNA sequence 


13.1 


102761 


1 I00 4 0A 

Uo21d0 


Hs. 118910 


tumor susceptibility gene 1 01 


7 


10Z78O 


1 loccno 
UOOOO2 


Hs.74407 


nudeolar protein p40 


4.1 


102790 


U87269 


Hs.1 54 196 


E4F transcription factor 1 . 


7.1 


102801 


U89605 


Hs.38041 


pyridoxai (pyridoxine; vitamin 66) kinase 


4.7 


102808 


U90426 


Hs.179606 


nudear RNA helicase; DECD variant of D 


7.5 


102817 


U90904 


Hs.83724 


Human done 23773 mRNA sequence 


15.2 


102823 


U90914 


Hs.5057 


cari)oxypeptidase D 


6.6 


102827 


Uyl327 


Hs.6456 


chaperonln confining TCP1; subunit 2 (b 


6 


102838 


U94592 


Hs.80658 


Human uncoupling protein tiomolog (UCP 


6.1 


102841 


U95006 


Hs.37616 


Human D9 splice variant 8 mRNA; comp 


4.2 


102844 


U96113 


Hs 324275 


Homo sapiens Nedd*4^ike ubiquitin-prot 


6.8 


102868 


X02419 


Hs.77274 


plasminogen activator, urokinase 


4 


102907 


X06985 


Hs.202833 


heme oxygenase (decycUng) 1 


22.7 


102919 


XI 2447 




aldolase A; fructose-bisphosphate 


9,9 


102929 


X13238 


Hs.74649 


cytochrome c oxidase subunit Vic 


5.4 


102973 


XI 6663 


Hs 14601 


hematopoistic celKspadfic Lyn substrate 


4.8 


102983 


XI 7620 


Hs 118638 


non-metastafic cells 1 \ prot^ (NM23A) 


4.6 


102985 


XI 7644 


H« 9707 


d to S nhase transition 1 


20.6 


103003 


X52003 


Hs 14QB 


trefoil factor 1 (breast cancer; estrogervind 


10.7 


103018 


X53296 


Hs 81134 


inferleukin 1 receptor antagonist 


5.8 


103023 


X53793 


Hs 117950 


muttifunctional DoIvDentkie similar to SA 


4 


103036 


X54925 


Hs 83169 


matrix metatloDroteinase 1 (tnterstHlal col 


7.3 


103060 


X57766 


Hs 155324 


matrix metalloproteinase 1 1 (stromelysin 


17.8 


103073 


X59417 


He 74077 


nmtpasnme /rirc£Qm9* macrooainl subunit 


5.6 


103075 


X5g543 


Hs 2934 


rihonucteofide raduc^e Ml Dofvoeofide 


4,2 


103080 


X59798 


Hs 82932 


cydin D1 (PRAD1: parathyroid adenomat 


6.7 


103094 


X60787 


Hs 296281 


interieuidn enhancer binding fador 1 


5.7 


103105 


X61970 


He 7RQ1'^ 


nrntcasnmp /o/nsofUfi* macrooainVsubunit 


5.8 


103121 


X63679 


Hs.4147 


transincatino chain-associatlno msmbrane 


4.2 


103149 


X66363 


Uo 171fl?4 
no. 1 f 1 o*>*T 


pr!TAIRF nmfpin kinasp 1 


12 


103180 


X69433 


rio.ooof 


i^nritratp dphvdrDQenase 2 fNADP+V mit 


18.9 


103182 


\yy^r\Ct A fx 

X69819 




infprroftHt;ir arihp^inn ninfpriilp 3 
11 llciuciiuiai duiicdiuii iiivjicuuic «j 


10.7 


103188 


X70040 


Hs.2942 


macrophage stimulating 1 receptor (o-met 


4.1 


103191 


X70218 




nmipin nho«;nhatase 4 fformeriv X^' catalv 


10.7 


103193 


X70476 


Hs 7S794 


matnmpr omtein comolex* subunit beta 2 


8.2 


103194 


X70649 


Ue 7A<;ro 
nd</ouov/ 


DEAD/H f AsD-G!u-Ala-AsD/His> box Do! 


13.7 


103195 


X70940 


Hs.2642 


eukaryotic translation elongation factor 1 


13.4 


103206 


X72755 


Ue 77^fi7 
naif f oQi 


mnnnkinp indiicpd hv nannma interferon 


15.1 


103207 


X72790 




Human enrioaenous retrovirus mRNA for 


5.3 


103208 


X72841 


Hs 31'?14 


rptinoblastoma-bindino orotein 7 


12.3 


103216 


X74262 


Hs 1600'^ 


rpfinohlasfnma-bnidina orotein 4 


4.1 


103226 


X75042 


Hs 44*^13 


v>m( avian ratictiloendofheliosis viral onco 


6.9 




Y7COR1 . 


Hs 74637 


testis enhanced cene transcriot 


7.9 


103262 


X78565 


Hs2B9114 


h^abrachion ftaiasdn C: cvtotactin) 


5 


103278 


X7g882 


Hs 80680 


lunn leslstance^lated DfDtein 


5.7 


103297 


X81788 


Hs 907& 


immahinp mlnn candnorm tianScriDt 1 


4.6 


103302 


X82103 




mflfrMner nmtehi mmntex* subunit beta 


4.5 


103316 


X83301 


Hs.324728 


SMA5 


7.1 


103330 


X85373 


Hs.77496 


small nudear ribonudeoprotein potypepti 


4 


103349 


X8g059 




serine/threonine kinase 9 


4.7 


103352 


X89398 


Hs.78853 


uracH-DNA gtycosytase 


5.3 


103364 


X90872 


Hs,279929 


SULT1C sulfotransfeiase 


4 


103374 


X91788 


Hs.84974 


chloride channel; nudeotide-sensitive; 1A 


4.2 


103380 


X92396 


Hs.24167 


synaptobrevin-Iike 1 


13.6 


103395 


X94754 


Hs.279946 


methionine-tRNA synthetase 


14.2 


103402 


X95404 


Hs.180370 


cofilin 1 (non-musde) 


4.6 


103410 


X96506 


Hs.295362 


DRI-assodated protein 1 (negative cofact 


8.3 


103420 


X97065 


Hs.173497 


Sec23 (S. cerevisiae) homolog B 


4.9 


103421 


X97074 


Hs.1 19591 


adaptor-related protein complex 2; sigma 


5 


103427 


X97303 


Hs.250655 


H^sapiens mRNA for Ptg-12 protein 


7 


103430 


X97544 


Hs.20716 


translocase of inner mitodiondrial membr 


4.5 


103438 


X98263 


•Hs.152720 


M-phase phosphoprotein 6 


4.5 
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103464 Y00285 
103470 Y00796 
103494 Y08991 
103505 Y09912 
103547 Z14982 
103551 Z15115 
103565 Z22548 
103587 Z29083 

103621 Z47727 

103622 Z48042 
103658 Z74615 
103680 293784 
103772 AA092473 
103774 AA092898 
103821 AA157623 
103835 AA172215 
103886 AA236384 
103890 AA236843 
103892 AA243523 
104054 AA393432 
104115 AA428090 
104136 AA442669 
104147 AA451992 
104173 M475564 
104181 AA479521 
104183 AA480838 
104192 AA486945 
104209 AB000221 
104234 AB002357 
104271 C01687 
104278 C02582 
104307 D52818 
104309 D55869 
104370 H19378 
104446 L44497 
104453 M19169 
104476 N33807 
104558 R56678 
104582 R81003 
104634 M004274 
104638 AA004415 
104658 AA007145 
104667 AA007234 
104675 AA009596 
104767 AA025534 
104785 AA027163 
104791 AA029046 
104804 AA031357 
104807 AA032147 
104837 AA039469 
104849 AA040270 
104867 AA045481 
104884 AA053021 
104906 AA055809 
104919 AA057193 
104921 AA057839 
104926 AA058846 
104938 AA064627 
.104943 M065217 
104957 AA074919 
104961 M076672 
104968 AA084602 
104975 AA086071 

104977 AA088228 

104978 AA06d458 
104987 AA101723 
105002 AA113266 



Hs.76473 

Hs.174103 

Hs.83050 

Hs.33102 

Hs.180062 

Hs.75248 

Hs.1 46354 

Hs-82128 

Hs. 150675 

Hs.278672 

Hs.172928 

Hs.278554 

Hs.92gi8 

Hs.198793 

Hs.93748 

Hs.105737 

Hs.72085 

Hs.239189 

Hs.7100 

Hs.26102 

Hs.268371 

Hs.283037 

Hs.76561 

Hs.283740 

Hs.1 14309 

Hs.21321 

Hs.16530 

Hs.168212 

Hs.7381 

Hs,109253 

Hs.111680 

Hs.284123 

Hs.21851 

Hs.7351 

Hs.123114 

Hs.324275 

Hs.88959 

Hs.325820 

Hs.19151 

Hs.106106 

Hs.27268 

Hs.30098 

Hs.301553 

Hs.8852 

Hs.7942 

Hs.301871 

Hs.31803 

Hs.23296 

HS21126 

Hs.241507 

Hs.225979 

Hs.14511 

Hs.26802 

Hs.25252 

Hs,1508 

Hs.33363 

Hs.318725 

Hs.1 14218 

Hs.10026 

Hs.33905 

Hs^9669 

Hs.50758 

Hs.18272 

Hs.19322 

Hs.11661 

Hs.182704 



Insulin-like growth factor 2 receptor 
integrin: alpha L (antigen CD11A (pi 80); 
phosphatidyfinositol 3-kinase-assodated p 
transcription factor AP-2 beta (activating 
proteasome (pnosome; macropain) subunit 
topoisomerase (DNA) 11 beta (180kD) 
thioredoxin-dependent peroxide reductase 
514 oncofetal trophoblast glycoprotein 
polymerase (RNA) II (ONA directed) pel 
membrane component; chromosome 11; s 
collagen; type I; alpha 1 
Homo sapiens DNA sequence torn PAC 
chromobox homolog 3 (Diosophila HP1 g 
ESTs; Weakly similar to R07G3.8 [C.eleg 
KIAA0750 gene product 
ESTs; Moderately similar to TRANXRIP 
ESTs; Weakly similar to gene 9306 pnotei 
ESTs; Weak^ similar to unknown [S.oere 
ESTs 

hypothetteal protein 
ESTs 

zv68f6.r1 SoaresJotalJetus^Nb2HF8^9w 
ESTs; Highly similar to HSPC039 protein 
ESTs; Weakly similar to finger protein HZ 
ESTs 
ESTs 

Homo sapiens mRNA; cDN A DKFZp564 
small inducible cytokine subfamily A (Cy 
kinesin family member 3B 
ATP synthase; H+ transporting; mitochon 
ESTs; Highly similar to N-temiinal acetyl 
endosulfine alpha 

Homo sapiens mRNA full length insert cD 
Homo sapiens mRNA; cONA DKFZp586 
ESTs 

cystatin SN 
protease; serine; 15 

Human DNA sequence from done 987N2 
serine protease; umbilical endothelium 
ESTs 
ESTs 

Homo sapiens mRNA; cDNA DKFZp564 
ESTs 

ESTs; Moderately similar to llil ALU SU 

ESTs 

ESTs 

ESTs; Moderately similar to cAMP indue 
ESTs; Weakly similar to N-WASP (H.sap 
ESTs 

ESTs: Weakly similar to klAA0299 [H.s 
Homo sapiens mRNA; cDNA DKFZp564 
Human gene from PACs 37M17 and 305B 
SCO (cytochrome oxidase deficient; yeast 
ESTs; Weakly similar to phosphoprotein [ 
ESTs 
ESTs 

DKFZP434N093 protein 

ESTs; Highly similar to CGl-72 protein [H 

ESTs 

ESTs; WeaWy similar to ORF YJL053c [S 

ESTs 

ESTs 

chromosome^ssQciafed polypeptkle C 

ESTs 

ESTs 

ESTs 

ESTs; Moderately similar to alternatively 
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A2 
4.5 
4.1 
4.5 
4.3 
4 

7.6 

14.6 

6.3 

4.4 

5.9 

4.4 

4.9 

6.1 

23.3 

4 

4.9 

7.8 

4.8 

5.3 

28.7 

57 

6.9 

5.2 

7,8 

5.1 

4.3 

12.3 

6.2 

42 

4.5 

47 

4.2 

6.4 

4.9 

11.6 

5.6 

6.3 

13.i6 

6.3 

10.1 

4.3 

16.6 

.4.6 

4.8 

.8.1 

10.9 

5.5 

10.4 

4.6 

4.3 

4.5 

47 

8.8 

5.5 

4.2 

7 

7.1 

5.7 

47 

5.5 

4.3 

8.3 - 

6.2 

6,7 

9.2 

6.9 
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105012 


AA1 16036 


Hs.9329 


chromosome 20 open reading frame 1 


10.7 


105019 


AA121879 


Hs.9280 


proteasome (ptosome; macropain) subunit 


5.7 


105029 


AA126855 


Hs.13268 


ESTs 


4,4 


105033 AA127964 


Hs.274329 


TP53 target gene 1 


6.3 


105035 


AA1 28486 


Hs.8859 


ESTs 


6.5 


105039 


AA1 30349 


Hs.36475 


ESTs 


4 


105062 


AA134968 


Hs.36529 


ESTs 


4.3 


105076 


AA142858 


Hs.37810 


ESTs 


6.4 


105087 


AA147884 


Hs.9812 


ESTs 


9.2 


105091 


AA148859 


Hs.179909 


ESTs: Weakly simOar to till ALU SUBFA 


5.7 


105093 


AA149051 


Hs.32405 


ESTs 


6.3 


105107 


AA152302 


Hs.25035 


DKF2P566G223 protein 


6.2 


105127 


AA158132 


Hs.301957 


ESTs; Weakly similar to contains simDari 


5.7 


105132 


AA159501 


Hs.247280 


HBV associated factor 


4.2 


105143 


AA165333 


Hs.24808 


ESTs 


4.7 


105154 


AA171736 


Hs.35947 


methyt-CpG binding domain piote'n 4 


9 


105162 AA176690 


Hs.4084 


KfAA1025 protein 


9.1 


105186 


AA191512 


Hs^8005 


Homo sapiens mRNA; cDNA DKFZp564 


19.3 


105209 


AA205072 


Hs.227743 


KIAA0980 protein 


7.4 


105223 AA211388 


Hs.7750 


ESTs 


5.1 


105252 AA227428 


Hs.9728 


ESTs; Weakly simitar to KIAA0512 pnDte 


11.1 


105253 AA227448 


Hs.5003 


KIAA0456 protein 


6!4 


105261 


AA227871 


Hs.6361 


MEK partner 1 


9.1 


105263 


AA227926 


Hs.6682 


ESTs 


67 


105274 


AA228122 


Hs.281866 


ATPase; H+ transporting; lysosomal (vacu 


5.3 


105297 


AA233451 


Hs. 183858 


transcriptional intermediary factor 1 


8.7 


105309 


AA233790 


Hs.4104 


ESTs; Weakly similar to cDNA EST yk38 


7.4 


105312 


AA233854 


Hs.23348 


S-phase kinase-associated protein 2 (p45) 


5.8 


105342 


AA235286 


Hs.157078 


ESTs 


4.5 


105376 


AA236559 


Hs.8768 


ESTs; Weakly similar to HI! ALU SUBFA 


5.8 


105386 


AA236950 


Hs.8115 


ESTs 


5.5 


105397 


AA242868 


Hs.7395 


ESTs; Weakly similar to house-keeping p 


7.7 


105399 


AA243007 


Hs. 16420 


ESTs; Highly similar to SH3 domauvbind 


5.6 


105400 


AA243052 


Hs.65648 


RNA binding motif protein 8 


5.8 


105404 


AA243303 


Hs.21187 


ESTs 


9.1 


105409 


AA243562 


Hs.301855 


ESTs 


4.4 


105436 


AA252172 


Hs,237856 


ESTs; Moderately similar to cAMP indue 


5.1 


105483 


AA255874 


Hs.23458 


ESTs 


4.9 


105493 AA256268 


Hs.i0283 


ESTs 


6 


105495 


AA256317 


Hs.28785 


Homo sapiens mRNA; cONA DKFZp588 


5.2 


105496 


AA256323 


Hs.301997 


DKFZP434N126 protein 


8.7 


105500 


AA256485 


Hs.2223g9 


061-96 protein 


9.5 


105507 


AA256678 


Hs.226318 


ESTs; Moderately similar to CCR4-associ 


4.1 


105538 AA258860 


Hs.32597 


ring finger protein (C3H2C3 type) 6 


4.1 


105544 


AA261954 


Hs.24678 


ESTs 


8 


105546 AA262032 


H5.268281 


BSTs; Weakly 5im9ar to 6209^ [D^nelan 


8.1 


105549 


AA262417 


Hs.5415 


ESTs 


4:6 


105551 


AA262477 


Hs.25292 


ribonudease HI; large suburut 


9.1 


105560 


AA262783 


Hs.306915 


ESTs 


4.5 


105565 


AA278302 


Hs.18349 


ESTs; Weakly similar to partial COS [C,e 


4.2 


105566 


AA278323 


Hs.17481 


Homo sapiens done 24606 mRNA sequen 


11.9 


105575 


AA278717 


Hs. 12772 


ESTs 


5.9 


105584 


AA279012 


Hs.3454 


ESTs; Weakly similar to K1AA0665 prote 


4.4 


105596 


AA279418 


Hs.18490 


ESTs 


4 


105604 


AA279787 


Hs.15467 


ESTs; Moderately similar to putative pho 


5.6 


105610 


AA279991 


Hs.99872 


ESTs; Weakly simBar to trithorax homolo 


5.3 


105621 


AA280865 


Hs.6375 


Homo sapiens mRNA; cDNA DKFZp564 


4.8 


105627 


AA281245 


Hs.23317 


ESTs 


7.5 


105538 


AA281599 


Hs.247817 


Homo sapiens mRNA for for histone H2B 


5.9 


105645 


AA282138 


Hs.11325 


ESTs 


6.4 


105650 


AA282347 


Hs.25635 


ESTs; Highly similar to HSPC003 [H.sap 


11.3 


105666 AA283930 


Hs.34906 


ESTs 


4.7 


105674 


AA284755 


Hs.279789 


CDW52 antigen (CAMPATH-1 antigen) 


8 


105687 


AA286809 


Hs.28423 


ESTs 


7.1 


105700 


AA287643 


Hs.35254 


ESTs; Weakly simSar to hypotiietical pro 


4.9 


105705 


AA2S0767 


HS.1D1282 


Homo sapiens mRNA; cONA OKFZp434 


8 


105709 AA291268 


Hs.26761 


OKFZP586L0724 protein 


6.8 
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105731 AA2927i1 ns.2913i 


toTS 


ft A 
D.l 


105753 AA299789 Hs, 110857 


ESTs 


7 


105774 AA348014 Hs.23412 


ESTs 


/.I 


105784 AA350771 Hs. 17850 


ESTs 




105791 AA358036 Hs. 14368 


SH3-bind(ng doman glutamic actd-rich p 


4.0 


105807 AA393803 Hs.16869 


ESTs; Moderately similar to COLLAGEN 


5.3 


105808 AA393808 Hs.286131 


KIAA0438 gene product 


A A. 
4.1 


105812 AA394126 Hs^0814 


ESTs; Highly similar to CGI-27 protein [H 


AA C 


105813 AA394140 Hs.18585 


ESTs 


4.9 


105619 AA397920 Hs.28783 


Homo sapiens mRNA; cONA DKFZp564 


A O 

4,S 


105870 AA399623 Hs.101067 


ESTs 




105874 AA400074 Hs.171118 


ESTs 


4 


105896 AA400999 Hs.7838 


Human ring zinc-finger protein (ZNF127- 


A Q 

4.8 


105934 AA404248 Hs,16577 


ESTs 


5.2 


105935 AA404277 .Hs.263727 


ESTs; Weakly similar to bisphosphate 3*- 


4 


105966 AA406105 Hs.5344 


adaptor-related protein complex 1; gamma 


8.3 


105974 AA406321 Hs.6224 


KIAA0895 protein 


4.6 


105990 AA4 10336 Hs.29403 


ESTs; Weakly similar to PROBABLE AT 


4.5 


105995 AA410510 Hs.5345 


ESTs 


4.9 


106000 AA410972 Hs.20725 


ESTs 


5.8 


106007 AA411462 Hs.11042 


ESTs; Weakly similar to veli 1 [H.sapiens 


c n 
0,9 


106016 AA411819 Hs.8164 


KIAA089d protein 


5 


106034 AA412473 Hs. 14928 


ESTs 




105042 AA412700 Hs.169895 


ubiquitin-oonjugating enzyme E2L 6 


4.6 


106057 AA417067 Hs.289074 


ESTs 


4.5 


106065 AA417558 Hs.25206 


ESTs 


12.3 


106070 AA417761 Hs.5957 


Homo sapiens clone 24416 mRNA sequen 


5 


106103 AA421104 Hs.12094 


ESTs 


15.4 


106126 AA4240a6 Hs.22972 


ESTs; Moderately similar to H5AR tM.m 


6.4 


106154 AA425304 Hs.6994 


ESTs 


5.1 


106157 AA425367 Hs.34892 


ESTs 


11.1 


106166 AA425872 Hs.19561 


NADH dehydrogenase (ubiquinone) 1 alp 


19.3 


106204 AA428024 Hs.21479 


ESTs 


4.7 


106210 AA428239 Hs.10338 


ESTs 


5.7 


106220 AA428582 Hs.32196 


ESTs; Moderately similar to metargidin p 


7.7 


108236 AA429951 Hs.21104 


ESTs 


8 


106240 AA430074 Hs.18552 


ESTs; Weakly simitar to Ylr218cp [S.cere 


A A 

4.4 


106263 AA431462 Hs.28329 


ESTs 


4.9 


106288 AA435536 Hs.24336 


ESTs 


8.8 


106293 AA435591 Hs.301444 


signal sequence receptor; gamma (transloc 


8.7 


106310 AA436244 Hs.17240 


ESTs 


4.5 


106317 AA436568 Hs. 1081 24 


ESTs 


4 


106328 AA436705 Hs.28020 


KIAA0766 gene product 


A A 

4.4 


106341 AA441798 Hs.5243 


ESTs; Moderately similar to p]L2 hypoth 


23.7 


106348 AA442253 Hs.10702 


ESTs 


4,7 


106350 AA442763 Hs.194698 


cidi\t\ B2 


6.1 


106371 AA443923 Hs.170310 


ESTs 


6.8 


106389 AA446949 Hs.6236 


ESTs 


4.7 


106394 AA447223 Hs.25320 


Homo sapiens clone 25142 mRNA sequen 


A A 

4.4 


106426 AA448282 Hs. 16206 


ESTs; Weakly similar to F55C12.5 [C.ele 


4.5 


106459 AA449741 Hs.4029 


gRoma-amplIfied sequence-41 


A D 

4.0 


106462 AA449912 Hs.30532 


ESTs; Highly similar to CGl-77 protein [H 


5.2 


106468 AA450047 Hs. 14770 


ESTs 


o.o 


106479 AA450351 Hs.75251 


ESTs 


12.4 


106494 AA452108 Hs.18387 


transcription factor AP'2 alpha (activating 


4.5 


106503 AA452411 Hs.29679 


ESTs; Highly similar to mediator (H.sapie 


5.1 


106507 AA452584 Hs.267819 


protein phosphatase 1; regulator (inhibito 


4.9 


106533 AA453786 Hs.145998 


ESTs 


8.3 


106568 M455970 Hs.28285 


patched related protein translocated in ren 


7.6 


106586 AA456598 Hs.57767 


ESTs 


8.2 


106589 AA456646 Hs.28661 


ESTs 


4.8 


106606 AA457730 Hs.283437 


Homo sapiens done 23851 mRNA sequen 


4.4 


106611 AM58g04 Hs.26267 


ESTs; Weakly similar to torsinA [H.sapie 


7 


106614 AA458934 Hs.256150 


ESTs 


4.5 


106628 AA459657 Hs.12311 


Homo done 23570 mRNA sequen 


6.5 


106637 AA459951 Hs.250824 


ESTs 


5.5 


106644 AA460239 Hs.12680 


ESTs 


4.4 



221 



wo 02/059377 PCT/US02/02242 



106564 AA460969 Hs.7510 mitogen-acfivated protein kinase kinase ki 

106698 AA463745 Hs.29403 ESTs: Weakly similar to PROBABLE AT 

106719 AA465171 Hs.236844 ESTs 

106726 AA465339 Hs.3886 ESTs 

5 106747 AA476473 Hs.171957 triple functional domain (PTPRF interacli 

106759 AA477263 Hs.25584 ESTs 

106765 AA477717 Hs.306117 interfeukin 13 recepton alpha 1 

106784 AA478558 Hs,227913 APl5-fike1 

106831 AA482014 Hs29463 centrin; EF-hand protein; 3 (CDC31 yeast 

10 106836 AA482112 Hs.238707 ESTs 

106840 AA482548 Hs.5534 ESTs 

106856 AA486183 Hs.285123 ESTs; Weakly similar to sMar to oxyste 

106855 AA487228 Hs.19479 ESTs 

106878 AA488872 Hs.12314 Homo sapiens mRNA; cONA DKFZp586 ' 

15 106888 AA489101 Hs.24734 oxyslerol binding protein 

106895 AA489665 Hs,25245 ESTs 

106909 AA490323 Hs.250747 SUMO-1 activating enzyme subunit 1 

106919 AA490885 Hs.21766 ESTs 

106920 AA490899 Hs.296323 ESTs 
20 106941 AA495204 Hs.237971 ESTs 

106942 AA496347 Hs.31314 retinoblastoma-binding protein 7 

106948 AA496788 Hs.21077 KIAA0532 protein 

106968 AA504631 Hs.26813 ESTs; Weakly similar to hypothetical 43.2 

106973 AA505141 Hs.11923 Human DMA sequence from cfone 167A1 

25 105980 AA521121 Hs.8858 bromodomain adjacent to zinc finger dom 

108981 AA521157 Hs.74101 ESTs 

105998 AA598461 Hs.195464 insulin-like growth foctor binding protein 

107008 AA598710 Hs.23740 ESTs 

107028 AA599214 Hs.24143 ESTs 

30 107032 AA599472 Hs.247369 succinate-CoA ligase; GDP-fomiing; beta 

107052 AA600134 Hs. 12482 glyceranephosphate O-acyftransferase 

107053 AA600147 Hs.5741 ESTs; Weakly similar to NADH^ytochro 
107056 AA600310 Hs. 18720 programmed cell death 8 (apoptosis-induc 
107080 AA609210 Hs.19221 ESTs 

35 107102 AA609723 Hs.30652 ESTs 

107109 AA609943 Hs.32793 ESTs 

107129 AA620553 Hs.4756 flap structure-specffic endonudease 1 

107132 AA620598 Hs.9052 ESTs 

107136 AA620795 Hs.8207 ESTs 

40 107140 AA620a89 Hs.170088 ESTs 

107151 AA621169 Hs.8687 ESTs 

107159 AA621340 Hs.10600 ESTs; Weakly similar to ORFYKR081c[ 

107174 AA621714 Hs.25338 ESTs 

107217 D51095 Hs.35861 DKFZP586E1 621 protein 

45 107252 D59971 Hs,25925 ESTs 

107295 T34527 Hs.80120 UDP-N-acetyl-aIpha-D-galactosamine:po 

107299 T40327 Hs.30661 lung resistance-related protein 

107324 T81665 Hs,276422 DKFZP586G 11 22 protein 

107372 U85625 Hs.8297 ribonudease 6 precursor 

50 107373 U85773 Hs.154695 phosphomannomutase 2 

107481 W58247 Hs279766 Homo sapiens kmesin superfamily motor 

107531 Y13936 Hs.17883 protein phosphatase 1G(fomrieriy2C}; ma 

107859 AA024835 Hs.47584 potassium voltage-gated channel; defayed 

107890 AAQ26030 Hs.61311 ESTs; Weakly similar to CALPAIN 2; LA 

55 107908 AA026894 Hs.42826 ESTs 

108039 AA041341 Hs.46670 ESTs 

108040 AA041551 Hs.159971 ESTs 

108102 AA046424 Hs.49433 ESTs; WeaWy similar to HYPOTHETICA 

108217 AA058686 Hs.62588 ESTs 

60 108255 AA063157 Hs.172608 ESTs 

108358 AA071514 Hs.1634 ESTs 

108609 AA100694 Hs.69499 Human DNA sequence from BAG 15E1 o 

108647 AA1 12396 Hs.44276 ESTs; Moderately similar to HOMEOBO 

108676 AA1 15562 Hs.274417 Homo sapiens mRNA; cDNA DKFZp564 

65 108687 AA120785 Hs.54347 ESTs 

108695 AA121315 Hs.70823 KIAA 1077 protein 

108733 AA126422 zn84f1 .si Stratagene lung carcinoma 9372 
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108774 
108828 
108872 
108884 
108893 
109008 
109010 
109011 
109042 
109086 
109090 
109101 
109112 
109160 
109165 
109178 
109179 
109261 
109270 
109277 
109313 
109415 
109454 
109467 
109481 
109508 
109514 
109572 
109632 
109644. 
109703 
109726 
109747 
109799 
109814 
110189 
110240 
110280 
110520 
110561 
110707 
110734 
110770 
110780 
110787 
110794 
110799 
110818 
110839 
110844 
110854 
110856 
110860 
110897 
* 110915 
110935 
110970 
111006 
111008 
111018 
111084 
111100 
111125 
111132 
111139 
111164 
111172 



AA128125 
AA131584 
AA1 34063 
AA134958 
AA135894 
AA156360 
AA156460 
Ml 56542 
AA1 59525 
AA165695 
AA167006 
AA167708 
AA169379 
AA1 79387 
AA179845 
AA181600 
AA181902 
AA195255 
AA195515 
AAig6332 
AA206800 
AA227219 
AA232255 
AA232904 
AA233342 
AA233892 
AA234087 
F02027 ■ 
F04165 
F04477 
F09684 
F10009 
F10161 
F10770 
F10979 
H20543 
H25577 
H29285 
H56965 
H59617 
H95079 
H98714 
N22262 
N23174 
N24716 
N25262 
N26101 
N29454 
N30856 
N31952 
N32919 
N33063 
N33438 
N39148 
N46252 
N48787 
N51374 
N53375 
fsJ53388 
N54067 
N59543 
N62522 
N63823 
N64378 
N64683 
N66857 
N67102 



HsJI 040 ESTs; Moderately similar to CEa GROW 

Hs.273344 DKFZP564O0463 protein 

Hs.111680 ESTs 

Hs.293591 ESTs 

Hs.l 94691 retinolc add induced 3 

Hs.87128 ESTs 

Hs.44229 dual spedfici^ phosphatase 12 

Hs.72127 ESTs 

Hs.7 1 779 Homo sapiens DNA from chromosome 19 

Hs.270737 tumor necrosis factor (ligand) superfamity 

Hs.70499 ESTs 

Hs.52184 ESTs 

Hs.257924 ESTs 

Hs.301997 DKF2P434N126 protein 

Hs.73625 RAB6 interacting; Idnesin-like (rabkinesin 

Hs.283707 ESTs 

Hs.192789 ESTs; Weakly similar to !!!! ALU SUBFA 

Hs.61779 ESTs 

Hs.3585 ESTs; Weakly similar to alternatively spti 

Hs.86043 ESTs 

Hs.86276 ESTs; Moderately similar to zinc finger p 

Hs, 1 10826 trinucleotide repeat containing 9 

Hs.295232 ESTs 

Hs.63187 ESTs 

Hs.289059 ESTs; Weakly similar to WD40 protein C 

Hs.55902 ESTs; Weakly similar to !!!! ALU SUBFA 

Hs.262346 ESTs; Weakly similar to 0RF2: function 

Hs.171937 ESTs 

Hs.235873 ESTs; Weakly similar to K1 1C4.2 [C.eleg 

Hs.291531 ESTs; Moderately similar to GLYCERAL 

Hs.24792 ESTs; Weakly similar to ORF YOR283w 

Hs.9196 ESTs 

Hs.22969 ESTs 

Hs.l 80378 Homo sapiens clone 669 unknown mRNA 

Hs.153106 Homo sapiens clone 23728 mRNA sequen 

Hs.6278 DKFZP586B1621 protein 

Hs.176588 ESTs; Weakly similar to CYTOCHROME 

Hs.32468 ESTs 

Hs.4082 yiO9f06.s1 Scares fetal liver spleen 1NFL 

Hs.5199 ESTs; Weakly similar to UBIQUITIN^O 

Hs.15617 ESTs; Weakly similar to IL» ALU SUBFA 

Hs,24131 ESTs 

Hs.131705 ESTs 

Hs.22891 solute carrier family 7 (cattonic amino ad 

Hs.12244 ESTs; Weakly simflar to C44B9.1[C.eteg 

HS.27931 ESTs 

Hs323401 Human ring zinc-finger protein (ZNF127- 
Hs^7S52 ESTs; Weakly similar to putative p150 [H 

Hs.30246 solute earner family 19 (thiamine transpo 
Hs.167531 Homo sapiens mRNA fuH length insert cD 
Hs.27931 ESTs 

ESTs; Weakly similar to S164 [H.sapiens 
Hs.170065 ESTs 
Hs.6880 DKFZP434D1 56 protein 
Hs.29724 ESTs 

Hs.305979 protease inhibitor 1 (anti-elastese); alpha- 
Hs.96870 Homo sapiens mRNA full length insert cO 
Hs.l 66146 Homer, neuronal immediate earty gene; 3 
Hs.7222 ESTs 

Hs.3628 mitogen-activated protein kinase kinase ki 
Hs.15456 PDZ domain containing 1 
Hs^0450 ESTs 
Hs.269115 ESTs 

Hs.83293 ESTs; Highly similar to unknown (unctk)n 
Hs.290943 ESTs 

Hs.14808 ESTs; Weakfy similar to HI! ALU CUSS 
HS.21&51 Homo sapiens mRNA; cDNA DKFZp586 
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111178 


N67227 


Hs.24633 


ESTs 


5.7 


111179 


N67239 


Hs.10760 


ESTs 


37 


11181 


N67278 


Hs.171802 


ESTs; Weakly similar to h/potheficaf pro 


5.6 


111184 


N67437 


Hs.243901 


Homo sapiens mRNA; cDNA DKFZp564 


8.7 


111221 


N68869 


Hs.15119 


ESTs 


7.3 


11223 


N68921 


Hs.297939 


ESTs; Weakly sim9ar to neogenfli [H.sap 


9 


11229 


N69113 


Hs. 110855 


ESTs 


8,9 


11241 


N69514 


HS.28868G 


ESTs; Weakly similar to CGI-82 protein I 


6.9 


11258 


N70481 


Hs.26118 


Homo'sapiens done 24766 mRNA sequen 


4.5 


11295 


N73275 


Hs.21275 


ESTs* Weaktv ^m3ar io ubiouiiin-f^miijfl 


5.6 


11299 


N73808 


Hs.24936 


ESTs 


8.5 


11336 


N79565 


Hs.29894 


ESTs 


6,7 


11357 


N91023 


Hs.87128 


ESTs 


15 


11370 


N92915 


Hs.94631 


brefeldin AHnhibitad gusuiine nudeotide-e 


5.2 


11806 


R33468 


Hs 279008 


ESTs 


10 


11825 


R35885 


Hs.286148 


stromal an^en 1 


4.5 


11836 


R36228 


Hs 25119 
1 Ilk/ 


ESTs 


7 2 


11890 


R38678 


Hs. 12365 


ESTs 


17,3 


11923 


R39995 


Hs 25925 


Homo saoiens clone ?3880 mRNA spniiPn 


7.3 


11942 


R40576 


Hs.21590 


ESTs 


9.2 


11987 


R42035 


Hs.6763 


K1AA0942 protein 


10.6 


12101 


R44793 




pHpfivlvl rvHa<u>^<«cnri9tAH nmtpin 9 
auciiyiji wjwaoc^asd^^a usu |jiuiciii £ 




12134 


R46025 


Hs.7413 


ESTs 


17.4 


12197 


R49482 


Hs.5637 


ESTs 


4.4 


12244 


R513Q9 


Hs 70823 

1 r VW&W 


KIAA1077 Drotein 


11 


12253 


R51616 




Homo saolens mRMA* cDNA DKFToSfifi 


9.3 


12305 


R54822 


Hs.26244 


ESTs 


4.4 


12449 


R63802 


Hs 124186 


rinn finn^r nmtpin 9 


6*^ 

u.o 


12483 










112519 


R68631 


Hs 11861 


ESTs 


14.3 


12610 


R79392 


Hs.23643 


ESTs 


5.2 


12693 


R88741 


Hs 91065 


FST<i' MoHpratpIv <;imil;4r tn nrnlifprafirwi 


4.6 


12751 


R93507 


Hs.8207 


ESTs 


5.6 


12801 


R97486 


Hs. 157160 


Drntpin kirir^^p* nNA-artiuatpri* ratah/fif! n 


8.7 


12869 


T03313 


Hs.4747 


dvQkpralnci^ mnnonits 1 * Hxickprin ■ 


5.9 


12871 




rid. 1 ^zLOO 




iJ.O 


12908 


T1DD65 


Hq 3*530 






12966 


T17119 


Hs 102548 


aliinnnnrtlf^iri rprpntnr HNA htnriinn fsrf 


5.7 


12971 


T17185 


Hs 83883 


ESTs 


V.H 


12995 


T23528 


no./ tuij 


PQTc* \A/P9trlu eimnar tn TVl^l nmtoin t\A 
COiO| VvcanJy ouliUai 11/ 1 7 AI f^fUlCMl [IVJ 


Q 1 


13047 


T25867 


■ Hs.7549 


CO 1 o 


DM 


13075 




Hq RQR8 


FC:t<:* Wpalfix/ cimilar tn fMl Ai 1 1 <il IRF& 
CO id, vvcaiviy olliiuar Ivi ll" MLU OUDrM 




13117 


T47819 


Hs.159153 


ESTs 


5.8 


13206 


T58044 


Hs.241471 


ESTs' Moderaielv simitar to UH ALU SU 

1 W) iviuudGii^ij ^iiiiiiai lu ills r^^w vJU 


6.4 


13248 


T63857 






4.6 


13260 


T64896 


Hs.287420 


ESTs 


6.9 


13277 


T65797 


Hs. 11774 






13278 


T65802 






A 


13440 


T86121 


Hs.191445 


ESTs 


6.4 


13523 


T9D037 






R A 


13604 


T92735 


Hs 296083 


ESTs 


8.7 


13702 


797307 




ESTs* Mnripratplv similar tn III) At ti SI J 


9.5 


13783 


W19222 


Hs.7041 


ESTs; Weakly similar to ALU SUBFA 




13794 


W37382 


Hs.11090 


ESTs 


11.9 


13808 


W44735 


Hs.9286 


ESTs 


16.7 


13811 


W44928 


Hs.6994 


ESTs 


4 


13822 


W47350 


Hs.17466 


retinoic acid receptor responder (tazaroten 


4.8 


13823 


W47388 


Hs.55099 


rab6 GTPase activating protein (GAP and 


4 


13836 


W56792 


Hs.12040 


ESTs; Weakly similar to KIAA0881 pnote 


4.1 


13857 


W65477 


Hs.5297 


Homo sapiens mRNA; cDNA DKFZp564 


4.3 


13886 


W72471 


Hs.23920 


ESTs 


4.6 


13895 


W73738 


Hs.12921 


ESTs 


7.1 


13923 


W80763 


Hs,3849 


ESTs; Weakly similar to FK505-binding p 


6.8 


13931 


W81205 


Hs,3496 


ESTs 


6.1 


13950 


W85765 


Hs.30504 


Homo sapierte mRNA; cONA DKFZp434 


14 


13970 


W86748 


Hs.8ia9 


ESTs 


15 


14051 


W94942 


Hs.177534 


dual specifidty phosphatase 10 


5.4 
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Hs.34192 


ESTs 


4.8 


ll^nRR 7%9RR 




Mnntn Qflnk>rv% PAH rtonp n.tn777n9^ frn 


5.1 


IliinQA 7^fl^7' 


no. 1 lOwiJO 


PfsTc* lA/oakhf ctmifar in cimilar in Q ct^m 


6.2 


1 1^ IW CMytOa 


Ms IfldinA 

flo. 10*t lUO 


nhncAinal nmtain 1 91 
ituuauiiioi piuunii 1 


4.6 


1 IH left 


Ue 195010 


PSTq* HIahlv similar to KIAA088R nrote 


22 


1 l*t lOO £^30(00 


na. 19/ Hif 




8.8 




Ue 971QR 


F<5T« 


4 






Cd 1 S • 


7 9 




Lie otiAn 


cor 5 




1141 yo ^i}9i211 


Ue K^nooA 

nS. 100920 


^fe/%eA 4 M^AenkQ^A m l^ntiKilfranefArQeA 

rucosc-i'pnTOpnaiie guanyiyiuansisrasc 


A A 


liA^UO ^oiJoUl 


ns.f ooy 


CO IS 




'l'1>IOCn ■ 7*50007 


ns. i^2y/ 


CQTe 
COlS 


7 0 


I'lVIOC'l 7^QDQQ 

114Z31 ^OijQ^O 


Ue OIQvlfl 

ns.2iy4o 


Co 1 S 


\A 7 


M44y2 ^*1071& 


ns.1o4o41 


d6tt3*6 fstty dcid d6saturas6 


1Q A 


-nyiOQ? 7iin7Cft 

n4^y/ A4U/t)o 


Ue 47m04 






^4134/: 


Ulf« OOQ/*4 

ns.22y4i 


CO IS 


1*^ 7 


114460 AAU24004 


nS.2Dl02 


CO 1 S 


in 1 


4'4>4>i7<i A Anion? 
1 1 447 1 AA02o074 


11 „ 4nAC4 0 

nS.1U461o 


CCTe 

CblS 


R 7 
O.f 


114400 AA0j224j 


Ue 4*;4C7ft 

nS.lOlD/o 


u ur-iN-aceqf i-aipnaHJ-y aiaciDsainin^ 


7 1 


114510 PAQAoAQJ 


nS. 105469 


suppressor of vaii (S.cerevisiae) 3-like 1 


A 0 
4.0 


4AACAO AAnCC7CQ 

114042 AAOoofDO 


ij_ ooooon 
nS.2y3ooO 


COTe 

CO IS 


11 7 
11./ 


114549 AA05d4o4 


Llf> OOOQOO 

nS.292oJo 


COTe 

CO IS 


7 0 


114552 AA10i41d 


nS.lO/ 149 


Co 1 S, WeaKly Slrnnar 10 r i b-AooUOIA i 


0.1 


114b7o AAliooOo 


LI- OCCDO 


b'snsmenfibrane 4 supsrfamtiy member (te 


A 


n4oyo /vMiiioyoi 


Uc 1 inR<;7 


COTe* UlnhKf etmilorfrv niitafi\ia r^MA.^ir 

coi b, niyniy diiniiai lu puiauvc uixrV'Uii 


7 1 


AAAtCy AA4>IQOOC 

11476/ AA140O0O 


11-. 4C>I>IJ|0 

ns. 15444 J 


nriinichrofnosomB iralntenance defident (S 


0.0 


AAA 700 A A 4 CQ'lOO 

114 f yy AAi 5y 02 J 


Ue 4nQQOQ 

ns.iuyy2y 


COTe 

col S 


A 0 


114oD4 AAlQiJ^O 


ns.2oyyob 


CCTe 
CO IS 


A A 


AAAQA4 AA4C44e4 

114011 AAlbllOl 


Ue QKOfY7 


fTiultiplB inositol polyphosphate phosphata 


7 1 
/.J 


1 1 4o21 AAlbdoU 


Ue l«(«ARB 


COTe 
CO IS 


A A 


114002 nA2^0ll«10 


Ue lAOCn 


CCTe. hiInHAP94AKf elmilor (a iiKimtMin ena 

CO 1 s, iviOQciaieiy Similar lo uuiquiiin spe 




1 1 43UI /\A2i3o2/D 


Ue 4Qf5il^7 

rlS.iyo4o/ 


CO 1 s, weaKiy sinruiar lo r\2ooDU_i . pani 


1R Q 


114302 AA2*lDO0y 


Ue ^Qc:nA 


CCTe 

cols 


ij. 1 


AAAdAfX AAOitm<f^ 

114940 AA24ju12 


nS.7oyZo 


CCTe 

cols 


0.0 


114900 NK£SMfo( 


Ue 70il70 

nS./ 24f 2 


CCTe 
Co IS 


09. 1 


AASit\At AA1C0C07 

lloU4/ f\fi2£3Zn£.i 


nS.o291o 


homeo box B5 


<^ 7 

0.1 


110054 AA252o5J 


Llo fl7700 

nS.O/ 729 


CCTe 
CO IS 




AACnOA AAOC0O47 

115061 AA250217 


11 _ 44074 
HS.41271 


COTe 

cols 


11 


AACnOO AAOCCCC7 

115082 AA25555/ 


Llo 4oaocQ 
nS. 196269 


NADH dehydrogsnase (ubiquinone) 1 alp 




IIOilD AA2oo4ob 


Ue C007K 


COTe 

coiS 


A A 
0.0 


115140 AA25oOoU 


nS.2/99oo 


ESTs; Weaidy similar to supported by GE 


A 1 


A A cmc A A OCOyl7rt 

1 15205 AA2d24/0 


Ue 0D>I04C 

nS.2c>421b 


COTe 

cols 


A 

0.0 


1 10200 «A20^:4yi 


Ue 4 0c;q70 


PCTc 
Colo 


5 1 


A A COOA A A OTOCCA 

115239 AA27oo50 


Ue 70004 

HS. 73291 


coTs, weaKiy siiniiar to simdar to trie bet 


A a 


4 4COilO A A07Q7CC 

115242 AA27o755 


Lie 0Q070O 


COTe 

coTS 


A 

0.0 


<4 4COAn AA07D0C4 

115249 AA2/o9d» 


Ue 71 4 Oil 

MS./ 1124 


COTe 

coTS 


in 1 


A AC*^Cf\ A AnftA7H 

115259 AA27907I 


Ue 4 0>ICO 

nS.13453 


spitDng factor 00, subunit 1; iookD 


y.o 


AACiOC A A 070700 

115255 flJ\Z/iif\)iS 


Ue 00070C 


noTe 
coiS 


'v A 
0.0 


AACnnA AA07QOjI'J 

115291 AA27994J 


Ue iOOC7Q 

ns.i220# y 


CCTe 
CO IS 


f\ 1 

O.l 


<44COC7 A A 00 4 

115357 AA2ol79o 


Ue 700flfl 

nS./29o6 


• COTe 

cbTS 


e 
0 


44C077 AAOOOOi*? 
115377 fiAZO^HI 


Ue 4QOnC9 

ns.iyoooo 


CCTe 

CO IS 


0.1 


A AC Ann A A 000 4 no 
115400 AA2o319o 


Ue 004 40 


COTe 

cols 


^ Q 


AAG.A*iCi AAOO>ICC1 
1 104 jy AA2o40Ol 


Ue 10*10011 


Col S 


s ft 
0.0 


115471 AA287138 


Hs.59346 


ESTs; Weakly similar to ASPARTYL-TR 


11.7 


115506 .AA292537 


Hs.45207 


IHuman DNA sequence from done 620E1 


6,8 


115522 AA331393 


Hs.47378 . 


ESTs 


5.8 


115572 AA398392 


Hs.:;9594 


ESTs; Weakly simHar to FSSOI "^ : gene p 


9.7 


115587 AA399264 


Hs.253037 


ESTs; Highly similar to HSPCO: ■ . rotein 


8.7 


115600 AA400247 


Hs.42173 


ESTs 


4 


115612 AA40094d 


Hs.71243 


ESTs; Weakly similar to zinc firMT^*^ piote 


8.4 


115646 AA404352 


Hs.305971 


ESTs 


5.3 


115652 AA405098 


Hs.38178 


ESTs 


16.1 


115657 AA405620 


Hs.65158 


ESTs; Weakly similar to weak similarity t 


4.7 


115658 AA405625 


Hs.183056 


Human DNA sequence from done 34B21 


5.1 


115675 AA406546 


Hs.82085 


Homo sapiens mRNA; cDNA OKFZp564 


20.5 


115721 AA417102 


HS.S0960 


ESTs 


4.8 


115763 AA42156d 




ESTs 


7 
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115764 


AA421562 


Hs.91011 


anterior gradient 2 pCenepus laevis) homo 


41.6 


115835 


AA428576 


Hs.41371 


ESTs 


4.2 


115844 


AA430124 


Hs.7773 


ESTs 


11.9 


115875 


AA433943 


Hs.43946 


ESTs; Weakly similar to Weak similarity 


33.5 


115888 


AA435839 


Hs.76591 


KIAA0887 protein 


7.2 


115922 


M441911 


Hs.71869 


ESTs; Weakly similar to K1AA0926 prote 


5.1 


115941 


AA443602 


Hs.46679 


ESTs 


4.8 


115947 


AA443793 


Hs.94761 


ESTs 


8.3 


115948 


AA443798 


Hs.43445 


poly(A}-specinc ribonudease (deadenytat 


13.5 


115951 


AA443918 


Hs.301048 


cofiiin 1 (non-muscle) 


7.5 


•115967 


AA446887 


Hs.42911 


ESTs 


8.8 


115984 


AA447687 


HS.911D9 


ESTs 


13.1 


116009 


AA449448 


Hs.44238 


ESTs 


5.5 


116024 


AA451748 


Hs.83883 


Human ONA sequence from done 718J7 


7.5 


116028 


AA452112 


Hs.42644 


thioredoxffhiike 


12.7 


116050 


AA453656 


Hs.88417 


ESTs 


7.2 


116097 


AA456099 


Hs.176376 


ESTs 


11.8 


116108 


AA457566 


Hs^8777 


ESTs 


4.5 


1.16121 


AA459254 


Hs.48855 


ESTs 


4.5 


116127 


M4597a3 


Hs.279884 


vwnyc avian myelocytomatosis viral onoo 


4.3 


116129 


AA459956 


Hs.49163 


ESTs; Highly similar to putative ribonude 


7.6 


116142 


AA460649 


Hs.39457 


ESTs 


4.8 


116204 


AA465701 


Hs.108646 


ESTs . 


6.8 


116221 


AA478397 


Hs.50180 


ESTs 


4.9 


116222 


AA478415 


Hs.89986 


ESTs 


4 


116238 


AA479362 


Hs.47144 


OKFZP586N0819 protein 


4.6 


116246 


AA479961 


Hs.250646 


ESTs; Highly similar to ubiquitirHXMiiuga 


4 


116249 


AA480886 


Hs.86693 


ESTs 


18.5 


116250 


AA480975 


Hs.44829 


ESTs 


10.8 


116254 


AA481146 


Hs.41086 


ESTs; Weakly similar to OXYSTEROL-B 


9.1 


116256 


AA481256 


Hs.88201 


ESTs; Weakly similar to lyso phosphofipa 


8.4 


116264 


AA482594 


Hs.272239 


Homo sapiens mRNA; cDNA DKFZp586 


7.2 


116265 


AA482595 


Hs.55189 


ESTs; Weakly similar to F25B5.3 [C.eleg 


11.1 


116282 


AA486550 


Hs.204501 


ESTs; Weakly similar to Wiskott-Aldrich 


6.2 


116298 


AA489046 


Hs,94109 


ESTs 


4.9 


116300 


AA489194 


Hs.159471 


ESTs; Weakly similar to snRNP protein B 


4.6 


116327 


AA490959 


Hs.28005 


Homo sapiens mRNA; cDNA DW=Zp564 


5.8 


116334 


AA491457 


Hs.48g48 


ESTs 


4.3 


116337 


AA496127 


Hs.44070 


ESTs 


8.4 


116351 


AA504116 


Hs.82501 


• Homo sapiens mRNA; cONA DKFZp434 


5.3 


116357 


AA504806 


Hs.90797 


Homo sapiens clone 23620 mRNA sequen 


5.2 


116415 


AA609204 


Hs.27973 


K1AA0874 protein 


6.6 


116443 


AA620313 


Hs.190488 


ESTs; Weakly similar to KERATIN; TYP 


4.5 


116470 


C13992 


Hs.83464 


ESTs 


4.5 


116480 


C14088 




glyceraldehyde-d-phosphate dehydiogena 


5.6 


116578 


D51272 


Hs.75337 


nucleolar phosphoprotein pi 30 


4.1 


116579 


D51276 


Hs.81915 


leukemia-assodated phosphoprotein p18 


5.8 


116626 


F02028 


Hs.81907 


ESTs 


4.9 


116647 


F03069 


Hs.15395 


ESTs; Weakly similar to ARGINYL-TRN 


6.1 


116674 


F04816 


Hs.92127 


ESTs 


10.6 


116680 


F08813 


Hs,273829 


LINE retrotransposable element 1 


4.2 


116700 


F09983 


Hs.317589 


ESTs 


13 


116724 


F13665 


Hs.65641 


ESTs 


8.5 


116726 


F13681 


Hs.53913 


ESTs 


5.6 


116732 


F13779 


Hs.165909 


ESTs 


.11.6 


116734 


F13789 


Hs.93796 


DKFZP585D2223 protein 


5.4 


116760 


H11054 


Hs.155342 


protein kinase C; delta 


4.3 


116780 


H22566 


Hs.30098 


ESTs 


5.7 


116786 


H25836 


Hs.301527 


tumor necrosis factor (Hgand) superfamily 


8.8 


116787 


H28581 


Hs.15641 


ESTs 


8.6 


116790 


H29532 


Hs.101174 


mtcrotubule-assodated protein tau 


22,2 


116803 


H47357 


Hs.109701 


ESTs; Moderately similar to weak similar 


6.7 


116877 


H68116 


Hs.168732 


ESTs 


6.5 


116921 


H72948 


Hs.821 


bigtycan 


20.7 


117216 


N20083 


Hs.42792 


ESTs 


4.4 


117232 


N20579 


Hs.61153 


ESTs 


7.4 


117284 


N22162 


Hs.183779 


ESTs; Weakly similar to cONA EST yk33 


4.1 
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117344 


N24046 . 


Hs.210706 


ESTs 


7.4 


117367 


N24954 


Hs.42502 


ESTs 


10.5 


117392 


N26175 


Hs,93405 


ESTs 


5.8 


117394 


N26257 


Hs.39871 


KIAA0727 protein 


8.4 


117412 


N26722 


Hs.42645 


ESTs 


18.1 


117498 


N31726 


Hs.44268 


ESTs; Highly similar to myelin gene expr 


5.8 


117557 


N33920 


Hs.44532 


diubiquitln 


12.3 


117634 


N36421 


Hs.13323 


ESTs; Weaicly similar to SODIUM- AND 


4.4 


117639 


N35923 


Hs.44833 


ESTs 


6 


117754 


N47469 


Hs.59757 


ESTs 


7.6 


117852 


N49408 


Hs.136102 


KIAA0853 protein 


5.9 


117879 


N50050 


Hs.303025 


ESTs; Weakly similar to keratin; 67K typ 


7.9 


117924 


N51056 


Hs.38891 


ESTs 


7.9 


117950 


N51394 


Hs.75478 


KIAA0956 protein 


5 


117992 


N5200O 


Hs.172089 


Homo sapiens mRNA; cDNA DKFZp586 


7 


118138 


N57773 


Hs.93560 


ESTs; WeaWy simflar to trg [R,noweglcu 


4.8 


118215 


N62195 


Hs.77910 


3-hydroxy-3-methylglutaiyl-Ccenzyme A 


13.4 


118229 


N62339 


Hs.166254 


heat shock 90kD protein 1 ; alpha 


5.4 


118265 


N62827 


Hs.48645 


EST 


4.2 


118336 


N63604 


Hs.47166 


ESTs 


7.2 


118363 


N64168 


Hs.48938 


ESTs 


6 


118429 


N66158 


Hs.74649 


ESTs 


4.1 


118470 


N66769 


Hs.291033 


ESTs 


5.4 


118472 


N66818 


Hs.42179 


ESTs 


10.8 


118475 


N55845 




ESTs; Weakly similar to )!!! ALU CUSS 


4.5 


118493 


N67149 


Hs.50115 


ESTs 


5.3 


118528 


N67889 


Hs-49397 


ESTs 


10.4 


118542 


N68010 


Hs.49427 


ESTs 


7.9 


118600 


N69222 




ESTs 


9,2 


118695 


N71781 


Hs.50081 


Homo sapiens mRNA fijll teng&i insert cD 


9.8 


118698 


N72113 


Hs.50187 


ESTs 


4.3 


118901 


N90719 


Hs.94445 


ESTs 


8.1 


118952 


N92966 




ESTs; Highly similar to CGI-90 protein (H 


12.5 


118976 


N93629 


Hs.93391 


ESTs 


5 


118986 


N94362 


Hs.125830 


ESTs 


7.3 


118989 


N94439 


Hs.45105 


ESTs 


8.2 


119027 


N99256 


Hs.114611 


ESTs 


5 


119042 


R05316 


Hs.5472 


ESTs 


4 


119075 


R36451 


Hs.287820 


fibronectin 1 


6 


119260 T15916 


Hs.102950 


ESTs; Highly similar to coat protein gamm 


4.1 


119271 


T16387 


Hs.65328 


ESTs 


12.1 


119298 T23820 


Hs.155478 


cyclin T2 


5.6 


119302 T25725 




ESTs 


143 


119341 


T62571 


Hs. 146388 


micnotubule-assodated p/olein 7 


4 


119495 


W35390 


Ks.55533 


ESTs 


5.3 


119580 


W42451 


Hs.92260 


high-mobility group protein 2-(ike 1 


5.6 


119602 


W46286 


Hs.233694 


ESTs; Weakly similar to ZK1058.5 [C.ele 


6.5 


119620 


W47620 


Hs.56009 


Z-S'oligoadenylate synthetase 3 


8.1 


119576 


W60473 


Hs.57787 


ESTs 


5.5 


119717 


W69134 


Hs.57987 


ESTs 


4.6 


119729 


W69747 


Hs.94806 


K1AA1062 protein 


4 


119805 


W73788 


Hs.43213 


ESTs 


4 


119859 


W80702 


Hs.58461 


ESTs 


4.8 


119867 


W80852 


Hs.260696 


KDEL (Lys-Asp-Glu-Leu) endoplasmic re 


4.2 


119873 


W81129 


Hs.44865 


Homo sapiens mRNA; cDNA DKFZp586 


4.8 


119899 


W84767 


Hs.58698 


ESTs 


5.9 


119940 


W86779 


Hs.272531 


DKFZP586B0319 protein 


9 


119943 


W86B35 


Hs.14158 


copine 111 


4.8 


119970 


W87812 


Hs.93581 


Homo sapiens mRNA; cDNA DKFZp586 


4 


120131 


Z38656 


Hs.75887 


coatomer protein complex; subunit alpha 


4.2 


120150 


Z39549 


Hs.153746 


ESTs 


11 


120206 


Z40805 


Hs.91668 


ESTs 


8.2 


120241 


Z41815 


Hs.65946 


ESTs 


15.6 


120255 AA169752 Hs^672 


ESTs; Weakly similar to Similarity to Yea 


4.2 


120314 AA194166 Hs,221040 ■ 


KIAA1038 protein 


6.8 


120325 AA195651 


1 Hs.104106 


ESTs 


15.2 


120352 AA211400 Hs.193172 


ESTs 


6.8 
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120428 AA236822 Hs.173694 


KIAA1097 DiDtein 


5.6 


120524 AA261852 Hs.192905 


ESTs 


5.6 


120528 AA267107 


Uc 10441') 


ESTs 


4.5 


120^71 AA2Rn71fi 




ESTs 


4.9 


I^UOH? MM^Or 1 


Me 1Q9AA0 


ESTs 


4.5 




Mc 9^R9n*; 

rlb.ZOOZUO 


ESTs 


6.7 


l^nfifift AA9R7M? 




CO 1 a 


8.3 


1 9071 9 A A9Q9f\<i^ 


ns. 1 UZ3UO 


oiflran/nlin frnnclflltnn inlflsitirtn fArtnr 9 al 


4.6 


^o^\7^^ AA9Q9fi*t'i 


Me QR<i(i7 


COI d 


10.6 


1 9n794 A &9Q'^ ATfl 


Me 1fin7^7 


CO 1 9 


5.4 






CO t 


7,1 


1 Zuood AAOOOO 1 0 


nS.oUlo 72 


PQTc- Mnriorafehf eimtlar in ftff At 1 1 fS( 1 
CO 1 b, IVlUucfalBiy oUIIlKli Ul 111! nlAJ OU 


4.6 




nS.oUl 


CO 1 b 


8.2 


■nnO^IQ AA^Q^QOO 

JzU^o AAJy/ozz 


LJc 4nACicn 


CO i s, niyniy suniJar lo simuar vo nta^o n 






nS. 129200 


Casein Kinase i ^ gaiiuna o 




120977 AAo9o155 


ns.97ou0 


CQTe 
CO IS 




40-4 in*) AAQOOOQC 

iziiUj AAJboyob 


nS.9/t3y7 


CCT 

col 


7 A 


12ii29l AA4U1753 


Hs.SloD 


(unQ C3nc8r C3ndid3t6 




121320 AA4 03008 


!_!#» 004 007 

Hs.301y27 


i -Cell IcCcpUJI, oipiia \V,U|J|V.»j 




121463 AA4i1745 


I >_ O0ACO4 

nS.239Do1 


CCTe* \AlaaMu similar I^IAAACI^ nmfo 


A Q 


121o9d AA4io740 


nS-174104 


CO IS 


00 ft 


121723 AA4 19622 


HS. 104800 


CO 1 s, wcaKiy stmiiario iviouse mrviN 


D 
O 


121748 AA42ll71 


nS.234 545 


colS 


ft 


1221/3 AA4o44n 


nS.yooOb 


CO IS 


- 9.0 


122.S21 Aft449444 


nS.9a9o9 


CO IS 


A 
H 


122695 AA454750 


Um 07Q07 

nS.97o37 


Co IS 


4 


122704 AA4aDd2D 


LI— OOAAH 

nS.3y445 


CQTc 
CO IS 


R 0 


122762 AA459o94 


nS.99472 


CCTe 
CO IS 




122o3D AA4q^74U 


7COe7 

nS.75ob7 


Crv* Jllr B_9 H 9 nto r 


13.1 


4 00000 AAilCC004 

1 22882 AA4o5oo1 


4n0040 

nS.lUool2 


Co 1 S» VVcaKiy SlflUlar ttJ DUVrl 1 .0 [y.cictj 




4 OOOOO A A il7CC70 

122928 /\A47oo7o 


nS. 1Ulo40 


CO lo 


6.3 


4 0007>l AAA70C0C 

1229/4 AA4fob2b 


HS. 194215 


COIS 




4 00QQ7 AA>17GOQC 

1 2299/ AA4 f 929o 


nS.1Ub29U 


l^alr*W mAfif /VMifointnA Arr^totn 
ITIQUI uUlllallUilJj pJUlctn 


12.5 


123016 /\A4o0103 


ns.323231 


CO 1 9, VVcdluy bililllGll 10 dlwITlaUVciy apii 


4.4 


123107 AA4od071 


nS.1U4207 


CO 1 a 


8.3 


123111 AA485273 


HS.191721 


COI S 




12on4 AA4od4U/ 


nS. 1 29928 


PQTc- K^ArloratAlu eimtlar In WIAAflA*^ n 
CO J 0, ArJOUciaieiy SlinHar lU f\IMr\trTO»f p 




123136 AA487449 


HS. 134024 


cot S 


A 9 


123137 M487468 


nS.1uuoo6 


cold, vvcaiuy olllUidl lU octclclcU \«clllclil 


14.6 


123169 AA468892 




Co 1 5, weaKiy sirniiar lo oag-roi poiypru 


•f.3 


123176 AA489020 


L!^ CAOOO 

ns.69233 


CQTc 

col s 


9 


400000 AACn/IOAn 

123338 AAo04249 


11- 4 D7CQC 

nS.1o75o5 


PQTc 

col S 


A 


123436 AA598714 


nS.223014 


protsassj sGrinei 15 




123442 AA598803 


1 1 _ AAA AfMy 

Hs.1 11496 


PQTc 

CO 1 o 




123449 AA598899 


■ 1^ 4 4 04 AO 

HS. 112493 


norno sapjens fiira»M« curin Unr^fK)W 


A 1 


123494 AA599786 


1 l_ 4 4 04 4 A 

Hs.112110 


CQTc 
CO IS 


A 


123503 AA600121 


Hs.293156 


CQTc 
CO IS 


i9 ft 


123533 AA608751 




PQTc* VA/oak^iu cimitar 1111 Al 1 1 CI tDPA 

CO 1 St weawy similar lo !!!! alu ouorA 


7 Q 


123619 AA6092Q0 




CCTe 
CO IS 




123673 AA6G9471 


4eoeAn 

Hs.1 58549 


FCTc 
CO IS 




1 9^700 A ARnQ77ft 


We 97RR79 


luemurwie oomponeni^ cmomosome ii, s 


A 7 


123819 AA620636 


Hs.1 12264 


FCTc 
CO IS 




123960 AA621785 


Hs.287733 


fnefhvfmafanafe-semialdehvdfi dehvdioae 


7.6 


.124000 D57317 


HS74861 


activated RNA polymefase II transcriptioh 


4.4 


124006 D60302 


Hs.270016 


ESTs 


20.6 


124012 D80240 


Hs.241471 


HUMSG1 1 A Human fetal brain (TFujiwa 


6.7 


124021 F02859 


Hs.1 3974 


ESTs 


4.7 


124049 F10523 


Hs.74519 


primase; polypeptide 2A (58kD) 


4.7 


124059 F13673 


Hs.283713 


ESTs 


7.7 


124243 H66710 


Hs.133525 


ESTs 


5.5 


124308. H93575 


Hs.241507 


Homo sapiens mRNA; cDNA DKFZp564 


11.4 


124314 H94877 . 


Hs.215766 


OTP-binding protein 


13.7 


124315 H94892 


Hs.288757 


v-ral simian leukemia viral oncogene hom 


14 


124350 N21359 


Hs.101282 


Homo sapiens mRNA; cDNA OKFZp434 


8.6 


124352 N21626 


Hs.102406 


ESTs 


7.2 


124357 N22401 




yw37g07^1 Morton Fetal Cochlea Homo 


5.2 


124390 N29325 


Hs.7535 


ESTs; Highly sm\ar to COBW-fite place 


7.9 
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124438 


N40188 


Hs.11090 


ESTs 


9.5 


124447 


N48000 




Homo sapiens mRNA; cDNA DKFZp586 


4.8 


124457 


N50114 


Hs.266175 


ESTs 


6.1 


124539 


N63172 


Hs.146409 


cell division cyde 42 (GTP-binding prate 


5.6 


124626 


N74604 


Hs.11090 


ESTs 


12.8 


124632 


N79515 


Hs.306117 


interteukin 13 receptor; alpha 1 


6.4 


124644 


N91279 


Hs.109654 


ESTs; Moderately similar to outer membr 


8.3 


124676 


R01037 


Hs.181013 


ptiosphogiycerate mutase 1 (brain) 


12.3 


124677 


R01073 




ESTs; Weakly similar to !!!! ALU CLASS 


5.4 


124724 


R12405 


Hs.112423 


Homo sapiens mRNA; cONA DKFZp586 


6.6 


124773 


R40923 


Hs.106604 


ESTs 


4.9 


124777 


R41933 




ESTs 


7.2 


124792 


R44357 


Hs.48712 


ESTs; Weakly simDar to cDNA EST EM6 


8.6 


124857 


R63652 


Hs.137190 


ESTs 


4.9 


124911 


R88992 


Hs.180612 


ESTs 


4.7 


124955 


T10598 


Hs.324841 


ESTs; Weakly similar to HI! ALU SUBFA 


4.4 


124958 


T11134 


Hs.431 


murine leukemia viral (bmt-l) oncogene h 


12.6 


125038 


T78089 


Hs.270134 


ESTs 


4.1 


125092 


T92544 


Hs.137548 


CD84 ant'gen (leukocyte antigen) 


14.8 


125132 


W15495 


Hs.129781 


chromosome 21 open reading frame 5 


6.7 


125144 


W37999 


Hs.24336 


ESTs 


4.8 


125154 


W38419 




ESTs 


5.3 , 


125243 


W86423 


Hs.105413 


ESTs 


6.6 


125279 


W93640 


Hs.4779 


ESTs; Moderately similar to similar to AD 


5.8 


125299 


Z39436 


Hs.102720 


ESTs 


12.2 


125303 


Z39821 


Hs.288193 


ESTs 


10.2 


125304 


Z39833 


Hs.124940 


GTP-binding protein 


6.8 


125474 


AA151216 


Hs.75103 


tyrosine 3-monooxygenase/tryptophan 5-m 


8 


125509 


M044232 


HS.28B967 


ESTs 


5.4 


125580 


AA1 26504 


Hs.267812 


sorting nexin 4 


4.1 


125582 


AA507383 


Hs.74649 


cytochrome c oxidase subunit Vic 


11.5 


.125670 


AI432621 


Hs.82685 


CD47 antigen (Rh-re!ated antigen; integri 


4 


125698 


AA748483 


Hs.191356 


general transcription factor IIH; polypepti 


9.4 


125745 


AI283493 


Hs.75722 


ribophorin 11 . 


6.2 


125852 


H09290 


Hs.76550 


Homo sapiens mRNA; cDNA DKFZp564 


25.9 


125972 


AA434562 


Hs.35406 


ESTs 


4,1 


126160 


N90960 


Hs.265398 


ESTs; Weakly similar to transfonnatlon-r 


16,4 


126257. N99636 


Hs.124084 


tumor necrosis factor receptor superfamily 


9.5 


126337 


AI066486 


Hs.40500 


similar to S. cerevislae RER1 


5.6 


126405 


U46278 


Hs.122489 


ESTs 


7,5 


126537 


W40262 


Hs.146310 


ESTs; Weakly similar to putative p150 [H 


4.1 


126590 W78968 


Hs.181307 


H3 histone; family 3A 


4.5 


126712 


AA205862 Hs.7942 


ESTs 


5.2 


126721 


T72569 


Hs.125359 


Thy-1 cell surface antigen 


4.4 


126764 


AI334393 


Hs.102178 


ESTs 


4.6 


126804 


AI203334 


Hs.160628 


ESTs 


11.7 


126819 AA305536 


Hs.279607 


ESTs 


4 


126877 


AI052047 


Hs.26102 


ESTs 


7 


126991 


R31652 


Hs.821 


bigtycan 


5.6 


127479 


AA513722 


Hs.179729 


collagen; type X; alpha 1 (Schmid meiaph 


14.3 


127514 


AA826926 


Hs,204214 


ESTs 


4.5 


127663 


W07286 


Hs.10340 


ESTs; Weakly similar to weak similarity t 


5.1 


127677 


AA916752 


Hs.264190 


ESTs; Highly similar to MEM3 [M.muscu 


17.3 


127814 


AA761755 


•Hs.1 36713 


ESTs; Weakly similar to V4-1 (H.sapiens 


4.1 


127997 


AI281549 


Hs.311054 


ESTs 


5.5 


' 128092 


AA904617 


Hs.166229 


ESTs 


5.8 


128218 


H02682 


Hs.292154 


ESTs; Moderately similar to recombinatio 


5.8 


128466 


D59653 


Hs.241471 


EST 


7.4 


128482 


U83908 


Hs.296251 


programmed cell death 4 


5.8 


128517 


AA280617 


Hs.100861 


ESTs; Weakly similar to p60 kalanin [H.s 


8.3 


128530 


AA504343 


Hs.183475 


Homo sapiens clone 25061 mRNA sequen 


6.8 


128559 


AA226801 


Hs.101448 


metastasis associated 1 


5.2 


128574 


AA412048 


Hs.38260 


keratin 8 


5.1 


128595 


U31875 


Hs.152677 


short-chain alcohol dehydrogenase family 


27.1 


128610 L38608 


Hs.10247 


activated leucocyte cell adhesion molecule 


13,2 


128629 


AA399187 


Hs.102708 


OKFZP434A043 protein 


6.7 


128649 AA142853 


Hs.103106 


Homo sapiens mRNA forG7b protein (G 


4.5 
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128651 


AA446990 


Hs.103135 


ESTs 


6.1 


128653 


R48943 


Hs.10315 


sotute carrier family 7 (cationic amino aci 


4.4 


128656 


AA458542 


Hs.10326 


coatomer protein complex; subunit epsilon 


14.3 


128717 


T30617 


Hs.104222 


Homo sapiens mRNA; cDNA DKFZp566 


24.5 


128727 


M64174 


Hs.50651 


Janus kinase 1 (a protein tyrosine kinase) 


7.3 


128764 


N49308 


Hs.104938 


ESTs; Weakly similar to alpha 1 (XVIII) c 


9.2 


128781 


X85372 


Hs.105465 


small nuclear n'bonudeoproteln polypepti 


5.4 


128793 


W93562 


Hs. 105749 


KIAA0553 protein 


4.6 


128835 


W15528 


Hs. 106390 


Homo sapiens mRNA; cDNA DKFZp586 


4 


126845 


AA455658 


Hs,10649 


basement membrane^induced gene 


6.9 


128871 


AA4a0271 


Hs.1 06778 


Homo sapiens mRNA for putative Ca2+4 


4.5 


128922 


AA252023 


Hs.9589 


ESTs; Weakly similar to HRIHFB2157 [H 


6.4 


128925 


D61676 


Hs.21851 


Homo sapiens mRNA; cDNA bKFZp586 


6.4 


128938 


AA4 10325 


Hs.1 07260 


ESTs 


7 


128946 


N29353 


Hs.107318 


kynurenine 3-monooxygenase (kynurenin 


5.2 


128948 


AA485655 


Hs.223025 


proteasome (prosome; macropain) subunit 


13.1 


128955 


F10290 


Hs.185807 


Homo sapiens clone 24758 mRNA sequen 


5,8 


129005 


AA460049 


Hs.1 3323 


ESTs; Weakly similar to SODIUM- AND 


12.6 


129009 


AA131421 


Hs!75607 


ESTs 


9.8 


129017 


H 13108 


Hs 107968 

1 Iv* Iwf wUW 


ESTs 


13.9 


129057 


X62466 


Hs.276770 


CDW52 antgen (CAMPATH-1 antigen) 


107 


129075 


AA129465 


Hs.83765 


ESTs 


4.7 


129095 


L12350 


Hs.108623 


thrombospondin 2 


4.4 


129124 


AA234530 


Hs.1 08802 


N*€thylm2deimide-sensiSvG factor 


20.7 


129160 


AA131252 


Hs 109007 


ESTs 


5.9 


129164 


AA282183 


Hs 109045 


ESTs. 


5.8 


129180 


R40556 


Hs.318401 


ESTs" Hiahlv similar to HSPC039 orotein 


7.6 


129224 


X89109 


Hs 109606 

1 lw« 1 V^wvVI 


nnrnnin* arfln-hinrflnn nrotfiln' 1A 


12 


129229 


AA211941 


Hs 109643 


nnlvAHpnvlatp hinrlinn nmipnvJnfPRirtinfi 


7.9 


129240 


W24360 


Hs.237868 


Interieukin 7 receptor 


5.3 


129241 


AA435665 


Hs.1 09706 


ESTs* Moderafptv similar to HNl fM mim 


8.4 


129243 


H88033 


Hs. 109727 


KIAA0733 DfOtein 


7.8 


129247 


AA151574 


Hs. 109733 


Dilin-iike transcriotion factor 


6.4 


1 29259 


AA090695 


Hs 181385 


ESTs 


6,2 


129270 


Z35227 


Hs 109918 


ra<; hnmnlnn npnp famllv mpmhpr H 


54 


129281 


AA026318 


Hs 289101 


nliim«;p rpniilrifpri nmtpin* 'iRkfl 


4.4 


129300 


C20976 


Hs 110165 


PRTs' Hinhlv similar tn rihn<5nnnpl nnnfpin 


5.7 


129318 


N93155 


Hs 285976 


f^fmnffMli'fi 1 fnhfi?nhnn/la<5P Iffna^A' rfpffp 


7.7 


129319 


AA037467 


Hs 30340 


ESTs 


Q 


129351 


AA1 67268 


Hs 62349 


Human ras inhibitor mRNA* 3' end 


9.3 


129366 


HI 8027 


Hs.1 84697 


plexin CI 


18.2 


129383 


W92984 


Hs,288224 


ESTs 


5.9 


129388 


AA151621 


Hs 110964 


ESTs 


4:1 


129381 


T80814 


Hs.11101 


discs; large (Drosophlla) homolog 3 (neur 


10,9 


129404 


AA172056 


Hs.317584 


ESTs 


5.3 


129406 


N23707 


Hs.1 11138 


K1AA0712 aenp nioduci 


4 


129426 


AA412087 


Hs,111323 


EST; Highly similar to protein inhibitor o 


8 


129453 


AA421213 


Hs,1 11632 


l_sm3 protesfi 


5.5 


129513 


C00225 


Hs.305163 


ESTs' Weaklv similar to fos39554 1 fH s 


5.5 


129519 


AA298786 


Hs.1 12242 


ESTs 


6*8 


129606 


R21443 


Hs.1 66254 


heatshod^ 90kD protein 1; alpha 


5 


129622 


AA278243 


Hs.323949 


ESTs 


6.8 


129626 


AA447410 


Hs,111334 


ESTs; Weakly similar to !!H ALU SUBFA 


5.1 


129627 


AA258308 


Hs.71968 


Homo sapiens mRNA; cDNA DKFZp564 


5.3 


129628 


U26727 


Hs.1 174 


cyclin-dependent kinase inhibitor 2A (met 


8.2 


129642 


R50008 


Hs.1 1806 


7-dehydrocho!estero! reductase 


4.3 


129663 


AA442768 


Hs.1 1866 


translocase of inner mitochondrial membr 


4.4 


129665 


M88458 


Hs.1 18778 


KDEL (Lys-Asp-GIu-Leu) endoplasmic re 


4 


129691 


X06700 


Hs.1 19571 


collagen; type ill; alpha 1 (Ehlets-Danlos 


6 


129783 


AA454618 


Hs.12479 


associated molecule with the SH3 donnain 


6.4 


129800 


AA252436 


Hs.12540 


lysophospholipase 1 


7.7 


129836 


AA452161 


Hs.206521 


YME1 (acerevisiae)-nke 1 


5 


129850 


N20593 


Hs.288932 


GDP dissociation inhibitor 2 


5.9 


129869 


AA102520 


Hs.13015 


ESTs; Weakly similar to heat shock prote 


5 


129896 


AA043021 


Hs.13225 


UDP-GaltetaGtcNAc beta 1;4- galactosy 


6.6 


129982 


M87789 




immunogtobufin gamma 3 (Gm marker) 


4 


129965 


AA45004S 


Hs.140452 


cargo selection protein (mannose 6 phosp 


5.8 
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130029 AA236412 
130033 M90696 
130036 AA195260 
130069 AA055896 
130077 T24055 
130080 X14850 
130096 AA223874 
130114 AA234717 
130125 M36803 
130135 M61764 
130170 AA610070 
130189 D43947 
130208 AA620556 
130211 D50840 
130235 X14046 
130276 S75295 
130280 L13738 

130313 AA620323 

130314 D86967 
130328 AA135673 
130356 X84373 
130367 Z38501 
130378 T47333 
130384 X66364 
130393 013630 
130399 AA449417 
130407 N29888 
130414 M21121 
130417 U58522 
130421 D21260 
130441 U35835 
130455 X17059 

130498 L38951 

130499 AA416723 
130511 L32137 
130553 AA430032 
130558 Hg6654 
130568 AA232535 
130583 W24957 
130585 H66211 
130804 X03635 
130614 AA132007 
130619 M477739 
130622 AA235247 
130625 F03969 
130627 L23808 
130629 M60346 
130635 M87503 
130639 D59711 
130677 H17861 
130681 D82808 
130693 AA487202 
130703 N63295 
130706 AA488843 
130712 AA292066 

130714 X92896 

130715 T98227 
130744 AA203527 
130747 AA471293 
130751 AA435633 
130796 R39390 
130800 AA223386 
130855 AA425439 
130859 AA287327 
130866 M58028 
130880 D14678 
130891 031891 



Hs.236510 

Hs.181301 

Hs.125849 

Hs.146428 

Ms.91379 

Hs.147097 

Hs.197955 

H&14992 

Hs.1504 

Hs.21635 

Hs.151469 

Hs.151761 

Hs.15250 

Hs^3703 

Hs.1 53053 

Hs.169149 

Hs, 153937 

Hs.154320 

Hs.154332 

Hs.l 54668 

Hs.155017 

Hs.8768 

Hs.1 55188 

Hs.1 66071 

Hs.155291 

Hs.155356 

Hs.l 55410 

Hs.241392 

Hs.155485 

Hs.178710 

Hs.155637 

Hs.155956 

Hs.180446 

Hs.158286 

Hs.1584 

Hs.252587 

Hs.15984 

Hs.16085 

Hs.293907 

Hs.16331 

Hs.1657 

Hs.16697 

Hs.12532 

Hs.16846 

Hs.260720 

Hs.1695 

Hs.1697 

Hs.1706 

Hs.17132 

Hs.17767 

Hs.17820 

Hs.17962 

Hs,15103 

Hs.201673 

Hs.279762 

Hs.18212 

Hs,171952 

Hs.18747 

Hs.6879 

Hs.18879 

Hs.19525 

Hs.19574 

Hs.143323 

Hs.20478 

Hs.2055 

Hs.20830 

Hs.20991 



ESTs; Modenately similar to PFT27 (M.m 
cathepstn S 

ESTs: Moderately similar to ALU SU 

collagen; type V; alplia 1 

ribosomal protein L26 

H2A htstone family, member X 

KtAA0704 protein 

ESTs 

hemopexln 

tubulin; gamma 1 

calcium/calmodulin-dependent serine pro 
K1AA0100 gene product 
peroxisomal D3;D2-enoyl<CoA isomerase 
UDP^lucose ceramide glucosyltransferas 
CD37'antigen 

karyopherin alpha 1 (importin alpha 5) 

activated p21odc42Hs kinase 

ubiquitin-activating enzyme E1C (homoio 

KlAA0212gene product 

KiAA0391 gene product 

nuclear receptor interacting protein 1 

ESTs; Weakly similar to ?(!! ALU SUBFA 

TATA box binding protein (TBP)-associa 

cyclin-dependent kinase 5 

K(AA0005 gene product 

Homo sapiens mRNA for putative glucosy 

ESTs 

small inducible cytokine A5 (RANTES) 
huntingtin-interacting protein 2 
clathrin; heavy potypeptlde-like 2 
protein kinase; DNA-activated; catalytic p 
N-acetyltransferase 1 (arylamine N-acetyl 
karyopherin (importin) beta 1 
Homo sapiens mRNA for KIAA0446 prot 
cartilage ofigomeric matrix protein (pseud 
pituitary tumor-transforming 1 
ESTs; Weakly sMar to gene pp21 protei 
ESTs; H^hly similar to CGI-13 protein (H 
ESTs; Moderately similar to simitar to C.e 
ESTs 

estrogen receptor 1 

ESTs 

ESTs 

ESTs; Weakly similar to cytochrome P45 
matrix metaRoproteinase 2 (geiatinase A; 
matrix metalloproteinase 12 (macrophage 
ATPase; H+ transporting; lysosomal (vacu 
interferon-stimulated transcription factor 
ESTs 
ESTs 

Rho-assQctated; coHed-coil containing pro 

ESTs 

ESTs 

comichon-like 
adenylate cyclase 7 

DMA segment on chromosome X (unique) 
occludin 

P0P7 (processing of precursor, S. cere vis 
ESTs 

Homo sapiens done 23965 mRNA sequen 
ESTs 

ESTs'r Weakly similar to katanin p80 subu ' 
putative DN>Ochromatin binding motif 
ceroid4ipofuscinosis; neuronal 2; late tnfa 
ubiquitin-activating enzyme El (A1S9T a 
kinesin-llke 2 
SET domain; bifurcated; 1 
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7.4 
7.6 
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5 

7.8 
7.2 
5.6 
7.5 
6.4 
6.4 
4.5 
9.1 
8.6 
5 

6.1 

10 

6.1 

10.6 

8.3 

7.1 

5.6 

4.1 

4.6 

7 

4.1 
7.9 
4 

6.8 

26.4 

4.8 

6.1 

8.3 

7.5 

5.6 . 

4 

13.3 

10.1 

39.9 

5.1 

5.9 

4.1 

8.3 

10.3 

7 

5.5 
7.2 . 
13.5 
6 

6.1 
4.3 
4 

5.1 
8.4 
5.7 
6.2 
8.2 
8.3 
4.5 
7.7 
4.3 
9.8 
4.3 
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130905 


AA056489 


Hs.129998 


ESTs 


8.7 


130913 


W03592 


Hs.21198 


translocase of outer mitochondrial membr 


20^ 


130919 


AA291710 


Hs.21276 


collagen; type (V; a!pha 3 (Goodpasture a 


9 


130921 


AA074596 


Hs.194688 


bromodomain adjacent to zinc finger dom 


5.3 


130944 


M97935 


Hs.21486 


signal transducer and activator of Iranscrip 


18.8 


130974 


X57985 


Hs.2178 


H2B histone family: member Q 


13.4 


130987 


R45698 


Hs.21893 


ESTs; Weakly similar to cAMP inducible 


8.5 


130999 


N48963 


Hs.21992 


KIAA0689 protein 


7.2 


131010 


AA435748 


Hs.169341 


ESTs; Weakly similar to phosphatidic add 


5.2 


131046 


X02530 


Hs.2248 


small inducible cytokine subfamily B (Cy 


10.1 


131091 


T35341 


Hs.22880 


ESTs; Highly similar to dipeptidyt peptid 


6.3 


131153 


H11760 


Hs.23606 


ESTs 


7.3 


131185 


M25753 


Hs.23960 


cydin 81 


6.2 


131200 


AA609427 


Hs.293732 


ESTs; Moderately similar to !!!! ALU SU 


4.3 


131206 


AA044078 


Hs^4210 


ESTs 


5.5 


131210 


AA430047 


Hs.95549 


ESTs 


7.1 


131227 


AA429472 


Hs.236522 


DKFZP434P106 protein 


5.6 


131244 


038076 


Hs.24763 


RAN binding protein 1 


5.5 


131245 


AA620599 


Hs.24766 


DKFZP564E1962 protein 


6.7 


131257 


AA256042 


Hs,24908 


ESTs 


5.8 


131319 


U25997 


Hs.25590 


stanntocaidn 


8.9 


131339 


'AA463450 


Hs.25812 


Nijmegen breakage syndrome 1 (nibrin) 


6.5 


131368 


R34531 


Hs.92200 


KIAA0480gene product 


9.2 


131410 


H84658 


Hs.279836 


ESTs 


12.1 


131472 


AA608962 


Hs.27258 


calcydin binding protein 


18.1 


131475 


Z39053 


Hs.27263 


ESTs 


7.5 


131501 


AA121127 


Hs.8207 


H3 histone; family 3A 


5.5 


131514 


X02152 


Hs.2795 


lactate dehydrogenase A 


5.1 


131524 


N39152 


Hs.301804 


ESTs 


4.3 


131528 


D60856 


Hs.28309 


UDP-glucose dehydrogenase 


8.4 


131544 


N33236 


Hs.28555 


ESTs; Weakly similar to 80511.8 [C.eleg 


5.6 


131557 


D30946 


Hs.28707 


signal sequence receptor; gamma (transloc 


8.7 


131562 


U90551 


Hs.28777 


H2A histone family; member L 


18.8 


131564 


AA491465 


Hs.28792 


ESTs • 


11.8 


131586 


AA235385 


Hs.26966 


ESTs; Moderately similar to alternatively 


4,7 


131587 


M15182 


Hs. 183868 


glucuronidase; beta 


5.2 


131589 


U52100 


Hs.29191 


epithelial membrane protein 2 


4.4 


131615 


D14533 


Hs.192803 


xerodemia pigmentosum; complementatio 


4.6 


131664 


AA136126 


Hs.30327 


mitogen-activated protein kinase-activated 


4.3 


131679 


AA138660 


Hs.30579 


ESTs 


9.4 


131684 


U26174 


Hs.3066 


granzyme K (serine protease; granzyme 3 


9.7 


131687 


LI 1066 


Hs.3069 


heat shock 70kO protein 98 (mortalin-2) • 


6.2 


131689 


AA599653 


Hs.30696 


transcription lador-lfke 5 (basic hefix-loo 


8.3 


131693 


W60913 


Hs.1 10796 


ESTs; Weakly similar to cDN A EST yk45 


9 


131710 


AA233225 


Hs.30985 


MRS1 protein 


5.2 


131716 


D49738 


Hs.31053 


cyfoskeleton-associated protein 1 


6.6 


131742 


D31352 


Hs.31433 


ESTs 


11 


131762 


H46831 


Hs. 107767 


ESTs; Moderately similar to CaM-KII inh 


4.9 


131781 


AA460450 


Hs.31989 


DKFZP586G1722 protein 


9.2 


131795 


N32724 


Hs.32317 


Sox-like transcriptional factor 


4.5 


131809 


L76517 


Hs.3260 


presenilin 1 (Alzheimer disease 3) 


5.4 


131814 


AA437226 


Hs.157 


interteukin 10 receptor; alpha 


4 


131838 


AA091932 


Hs.1 80628 


dynamtn-Iike protein 


6.7 


131877 


J04088 


Hs.1 56346 


topoisomerase (DNA) 11 alpha (1 70kD) 


5 


131885 


AA044095 


Hs.3402 


ESTs 


11.1 


131891 


AA1 58258 


Hs.30376 


heterogeneous nuclear protein similar to r 


5.6 


131925 


AA248470 


Hs.1 83180 


ESTs; Weakly similar to RING finger pro 


4.5 


131930 


AA205460 


Hs.69476 


ESTs 


14.3 


131941 


D62657 


Hs.35086 


ubiquitin-spedfic protease 1 


6.2 


131965 


W90146 


Hs.35962 


ESTs 


6.3 


131970 


D86950 


Hs.3510 


KIAA02b5 gene product 


4.2 


131971 


R70167 


Hs.154938 


ESTs 


4.3 


131974 


AA410424 


Hs.268122 


Homo sapiens mRNA; cDNA DKFZp586 


4.6 


131977 


F09788 


Hs.3622 


procollagen-proline; 2-c»(oglutarate 4-dkDx 


6.4 


131994 


AA479515 


Hs.279882 


Human DNA sequence from done 703H1 


12 


131997 


D82399 


Hs.136644 


Homo sapiens done 23714 nfiRNA sequen 


10 


132017 


W67251 


Hs.267659 


Homo sapiens vav 3 oncogene (VAV3) m 


4.7 
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132021 
132065 
132085 
132089 
132109 
132143 
132149 
132153 
132160 
132164 
132180 
132183 
132225 
132227 
132235 
132256 
132298 
. 132314 
132325 
132384 
132387 
132393 
132406 
132407 
132413 
132446 
132465 
132482 
132492 
132528 
132540 
132543 
132580 
132586 
132608 
132616 
132617 
132618 
132640 
132668 
132694 
132700 
132724 
132738 
132742 
132744 
132795 
132807 
132811 
132817 
132840 
132845 
132847 
132856 
132869 
132874 
132880 
132894 
132900 
132903 
132904 
132906 
132914 
132918 
132936 
132951 
132957 



T68246 
D82226 
D44466 
AA131971 
AA599801 
AA257056 
T10822 
N90141 
AA281770 
U84S73 
AA405569 
L19183 
AA128980 
AA412620 
F09058 
AA608856 
N41849 
AA285290 
N37055 
AA479933 
R70914 
W85888 
F09979 
AA431459 
AA132969 
AA426218 
AA047896 
AA429478 
T03749 
AA283006 
M488987 
AA417152 
L37042 
AA412452 
AA199588 
AA386264 
AA171913 
AA253330 
U33821 
AA453614 
M60830 
N47109 
AA417952 
W42674 
AA490862 
X54326 
H99152 
AA331777 
U25435 
AB004884 
N23817 
D62588 
T48195 
W79865 
N26855 
AA425776 
AA444369 
D82422 
N56451 
AA235404 
X83618 
AA142857 
AA496037 
AA252605 
AB002305 
U04209 
AA234791 



Hs.306079 
Hs.211594 
Hs3887 
Hs^9122 
Hs.40098 
Hs.7972 
Hs.324743 
Ks.41066 
Hs^95923 
Hs.41270 
Hs.418 
Hs.199695 

Hs.4248 
Hs.42656 
Hs.431 
Hs.7120 
Hs.44499 
Hs.44856 
Hs.46967 
Hs.281434 
Hs.47334 
Hs.4774 
Hs.47783 
Hs.260116 
Hs.48764 
Hs.49169 
Hs.238126 
Hs.4990 
Hs.50758 
Hs.5097 
Hs.5101 
Hs.283738 
Hs.52515 
Hs.5321 
Hs.283558 
Hs.5338 
Hs^79916 
Hs.5437 
Hs.5460 
Hs.5509 
Hs.5521 
Hs.55498 
Hs.264636 
Hs^92812 
Hs.55921 
Hs.57079 
Hs.57301 
Hs.57419 
Hs.57553 
Hs,5807 
Hs.5813 
Hs.58189 
Hs.58367 
HS.2Q3961 
Hs.5i3609 
Hs.177537 
Hs,5944 
Hs.5978 
HS.598S 
Hs.59889 
Hs.234896 
Hs.60293 
Hs.6051 
Hs.6111 
Hs.61418 
Hs.61469 



chaperonin containing TCP1; subunit 5 (e 
proteasome (prosome; macfopain) 26S sub 
proteasome (prosome; macropain) 26S sub 
ESTs 
ESTs 

KIAA0871 protein 
ESTs 

ESTs; Moderately similar to ELONGATl 

seven in absentia (Drosophila) homolog 1 

procoilagen-tysine; 2-oxoglutarate 5-dtoxy 

fibroblast activaSon protein; alpiia; sepras 

hypothetical protein 

ESTs 

ESTs 

ESTs 

murine leukemia viral (bmi-1) oncogene h 
Homo sapiens cytokine receptor related p 
sma(IEDRK-rjchfector2 
ESTs 

Human DNA sequence from clone 167A1 

heat shock 70JcD protein 1 

ESTs; Moderately sinularto III! ALU SU 

ESTs 

ESTs 

KIAA1 104 protein 

ESTs 

ESTs 

ESTs; Highly similar to CGI-49 protein (H 
KIAA1089 protein 

chromosome-associated polypeptide C 
synaptogyrin 2 

protein regulator of cytokinesis 1 

casein kinase 1 ; alpha 1 

DKFZP434N024 protein 

ARP3 (adin-reiated protein 3; yeast) horn 

isodtrate dehydrogenase 2 (NADP+); mit 

carbonic anhydrase XII 

adaptor-related protein complex 1; gamma 

Tax1 (human T-cell leukemia vims type I 

KIAA0776 protein 

ecotropic viral integration site 2B 

ESts 

geranylgeranyl diphosphate synthase 1 
ESTs; Moderately similar to neuronal thre 
ESTs; Weakly similar to C43H8.1 [C.eleg 
glutamyl-prolyl-tRNA synthetase 
ESTs 

mutL (E. coli) homolog 1 (colon cancer, n 
transcriptional repressor 
tousled-like kinase 2 

Homo sapiens done 23675 mRNA sequen 
ESTs 

eukaryoGc translation initiation factor 3; s 

glypican 4 

ESTs 

ESTs 

ESTs 

ESTs 

LIM domain only 7 

Homo sapiens done 25186 mRNA sequen 
3-hydroxy-3-methylglutaryl-Coenzyme A 
ESTs; Highly simifar to geminin [H.sapie 
ESTs 

K1AA0616 protein 
KIAA0307 gene product 
microfibrillar-associated protein 1 
Human gene from PAC 753P9; chromoso 
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5.2 

8.5 

13.5 

4.8 

6.2 

14.6 

5.3 

9.2 

5.5 

8.1 

15.4 

12.2 

5.6 

6.7 

6.2 

6 

5.6 

6.8 

4.7 

4.2 

9.1 

4 

15 

8 

4 

5.3 

15.4 

9 

8.5 
4.3 
9.8 
10.1 
5.9 
4.2 
4.2 
5.2 
10.1 
4.8 
5.7 
4.4 
15.6 
7 
.5.6 
4.9 
7.9 
4,1 
8 
8 
4 

6,5 
5.6 
12.4 
7 

6.2 

6.5 

5.6 

7.2 

7.5 

4.4 

9.1 

10.7 

10.2 

4.7 

7.1 

8.3 

4.3 

13.2 
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132959 


AA028103 


Hs.61472 


ESTs; Weatdy similar to unknown [S.cere 


18.9 


132968 


N77151 


Hs.61638 


myosin X 


5.8 


132984 


H80409 


Hs.621 12 


zinc finaer ofDiein 207 


4.3 


132930 


AA458761 


Hs 18387 


transcnDfion factor AP-2 alcha factivatina 


4.2 


132994 


AA505133 


Hs,279905 


solute carrier family 2 (^cilitated glucose 


26.4 


132998 


Y00062 


Hs 170121 


orotein tyrosine ohosohatase* receotor tvD 


4.4 


133002 


AF006082 


Hs 42915 


ARP9 factin-related nintein 2* veast) honri 


4.7 


133005 


C21400 


Hs.278605 


K(AA0970 protein 


6.6 


133015 


AA047036 


Hs 246315 


ESTs 


7.9 


133016 


W81298 


Hs 62B9 


nrnwrfh fartnr rprpntnr4iniinri nmtptn 9 


5.2 


133039 


X62055 


Uc R34ftq 

riO.U«i/H09 


nmtMn Kmneino nhnenhataep* nnn-TPnPntn 


4 


133050 


S67325 


Hs 637R8 


nrnninnvl ("InpnTvmp A f»rhnYv!siSP".hPta 


5.2 


133056 


AA071387 


Hs 6306 


himninn translnf^tinn hrpaknninf 


5 




R33fiG3 




F<5Te 
CO 1 0 


5.4 




N!7nR33 




pVianprnnin /Yinlpininn Tf^PI* ciihiinit 9 /h 
una^clvjillll UJIIIallUiiy I i* oUUUIIIl^ yJ 


5 


133091 


AA199147 
rNM IHf 


Me R4RQ1 






133093 


AA*>0R74Q 




CO 1 o 


5.6 


133124 


AAl'ififUQ 


H<: 96709? 


F<5Te 
CO 1 o 


4.1 


1 JO l£X} 




LJe 


M I rase, xv^ iransporony, lysasomai (viacii 


6 9 




rvOf 00/ 


He R707 


Kos-o i K ose acuvaur^ piDiein ono uoina 


f\ 1 
3. 1 


1 33214 

1 «J«J£ l*T 




Uc 9fiR11R 


inicimutvut lo icwcpiot, atpna i 


6 9 




7414 IS 


Me RA97 


EQTe* lA/oalriu ctmilar in inirineSo istr^nrJPK 

CO 1 i>» vv&aKiy SHiHiof (u inif ui5ic lanut'-D 


A 3 


133900 




Ue AR11 

ns.oooi 


noiTio sapiens Qune i^uu uniuiown piuie 


A 7 






Me 17QftR9 


nomo sapiens cione ^loos iukixa sequen 




433040 


n31 1R1 

1 ID 1 


Me 949RQ4 


Colo 


q 


'I33OC7 


APfinnnRR 

ArUUvVJOu 


Me RRQR 


acun iciottiBU piuicui £io cunipicA« ououiiit 


7 7 




M/791P7 


Me AQ1Q9 


(7CTe* lA/aaUi/ cimtlar fn /«nMA PQT 1/1^^7 

CO 1 s, weoKiy siifinar id cuimm co i yiur 


6 7 
O.r 


133974 






CO la 


4 2 


133781 


AA49in7Q 




CO 1 d« vvcaiuy ouiiiioi ui ouAniius uciiisvii 


4.9 


133583 


AA4 10507 


Hs 6968 


ESTs 


4.3 


133287 


LI 5702 


Hs 60771 




9.3 


133294 


R79723 


Hs 60007 


7inr finnpr nmfpin 93A 


30.4 


1 33907 








104 


133318 


AA25616a 


Hs 152316 


ESTs 


8.5 


1 ooooz 




Hq 7104 


CQTc Minhlu cimtlar trt nmfain IM 

CO 1 o, niyiiiy oiiiHidr lu |jrutcin [n 


14 


133370 


A A1 56897 


Hs 79167 


nKFZP664riQ99 nmtpin 


5 


i 33301 




Hs.727 


inhibin; beta A (acGvin A; adivin AB alp 


13 Q 




AA4019QR 


He 79ftnR 


PQTc 
CO 1 o 


4 3 


133499 


Ki7Q51R 
(Mr 10 


Hs 739ft7 


CO 1 o, vvcaiMy oil filial lu cyciiu [L^.incioiiu 


4 6 


133431 


AA9554'^8 


Hs 7358 


Mnmn QantPnc mRNA* rDNA nk'P7rLR66 


fi 
o 




T93Qft3 


Hs 393066 


PQTs 
CO 1 o 


c 




AAn04QR0 


Hs 73fl1 
no. r 00 1 


vuiu3yc~vicpciiucni diiKJii uidiiiici o 


A 7 
O./ 


133468 




Hs 73931 


mainr histnrnmnafihilrh/ pfimnlpv Mass 11 






Y7fl7in 


Hs 911581 


Miciof'i cyuiatu/y uci(iounput/i 1 fciuu/i i 




1 0«33uD 


AA^IRRRR 
Mrw 1 0000 


Hs 74*746 


CQTc< UV/oalrlu cimilarfn 14nf^H h \T\ ma- 

coiO| vvcaiuy siiiiudf lo IHUV9 1 [u.nic 


0.0 


i r 


yR9Q47 


Me yAA74 


yap juiH«uuii proicin, oipna i « hoisu ycoiin 


i? 7 




nR.'44fin 

UOOHOU 




lflAAni4f\ nmfAin 


4 A 




AA313Q77 


Hs 179779 


uanawipuun cnnyauun lociur d ^oiii/i pu 


Q (\ 


I33C70 


W94333 




Iranslncnep nf innpr nnifnr4innririAl mpmhr 

UCU13iUl«09C UI UIIICI tlUlUWiimiUlMll IIICIIUII 




lodOr / 


PfW717 
rXJOt If 


Hs 7Rnfi') 
n9.f 3UOO 


nuiTidn II 1 ill lunuueiicwiu^ viiub ijiic i xsnn 


7 4 




1 373RR 


Ue 7Rin4 


r\iNM~uinuiiig piuicin o i| aciiiic-iiwi uuiii 

ntunyalaco 1 
yiyuAcuaoc 1 


0 


133A0A 
tOOOUO 


D13315 


Ue 75907 


4 9 


133617 


AA1 48318 


Hs.75249 


K1AA0089 protein 


4.5 


133627 


U09587 


Hs.75280 


glycyl-lRNA synthetase 


10 


133633 


D21262 


Hs.75337 


nucleolar phosphoprotein pi 30 


4.5 


133634 


U24166 


HsJ234279 


microtubule-associated protein; RP/EB fa 


15.2 


133640 


D83004 


Hs.75355 


ubiquitin-conjugating enzyme E2N (homo 


9.1 


133644 


D89077 


Hs.75367 


Src4ike^dapter 


6.4 


133649 


AA479139 


HS75393 


add phosphatase 1; soluble 


4.8 


133652 


AA287383 


Hs.7540 


ESTs 


4.2 


133674 


AA458946 


Hs.75497 


ESTs . 


4.3 


133700 


KOI 396 


Hs.297681 


protease Inhibitor 1 (anti-elastase); alpha- 


8.3 


133705 


N21648 


Hs.75659 


MpV17 transgene; murine homolog; glom 


4.6 


133716 


Y00282 


Hs.75722 


nl)ophorin 11 


. 7.5 


133720 


L27841 


Hs.75737 


pericentriolar material 1 


9.4 


133752 


U49278 


HSJ5875 


ubiquitin-conjugating enzyme E2 variant 


4.5 


133765 


D21255 


Hs.75929 


cadherin 11 (OB-cadherin; osteoblast) 


6.4 
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133772 W73693 Hs.76038 
133774 Z23090 Hs.76067 
133776 J03473 Hs.177766 
133784 AA214305 Hs.301064 
133814 M33882 Hs76391 
133829 AA453783 Hs.76550 
133834 AA147510 Hs.288660 
133839 M59815 Hs.170250 
133842 U73477 
133845 T68510 
133859 U86782 

133867 043948 

133868 U58090 
133871 AA454597 Hs.1 82793 
133893 X01060 Hs.77356 
133914 N32811 
133918 W72783 
133944 AA045870 Hs.7780 
133946 AA156565 Hs.173878 
133963 L34587 Hs.184693 
133980 D00760 Hs.250811 
133990 C02374 
133999 M28213 
134030 J03077 
134032 Z81326 

134045 S82470 

134046 D28473 
134064 D87685 
134070 H98521 
134087 U51166 
134090 M22382 
134098 X06323 
134110 U41060 
134132 U32519 
134168 AA398908 Hs.181634 
134170 M63138 Hs-79572 
134208 U88871 
134258 L28010 
134288 AA430008 H5.8117 
134310 M313414 Hs.8148 
134326 U16306 
134329 D38551 
134331 



Hs.285013 

HS.767G4 

Hs.178761 

Hs.76989 

Hs.183874 



Hs.77542 
Hs.58382 



Hs.7822 
Hs.78305 
HS78575 
Hs.78589 
Hs.78768 
Hs.1 72801 ■ 
Hs.78893 
Hs.78946 
Hs.1 73824 
Hs.79037 
Hs.79086 
Hs.79136 
Hs.220689 



Hs.79993 
Hs.80d 



Hs.81800 
Hs.81848 
AA452020 Hs.1112i22 
R82074 Hs.82109 



134351 

134357 L43575 Hs.82171 

134363 M37033 Hs.82212 

134357 X54199 Hs.82285 

134374 D62633 Hs.8236 

134375 AA412720 Hs.82389 

134376 X02874 Hs.82396 
134381 U56637 Hs.184270 
134388 .M15841 Hs.82575 
134395 L09717 Hs.8262 
134399 H99801 Hs.82689 
134401 AA243746 Hs.211577 
134405 J04177 Hs.82772 
134415 AA329274 Hs.82911 
134417 D87969 Hs.82921 
134419 L08044 Hs.82961 
134421 AA122386 Hs.82g85 
134423 W98151 Hs.83006 
134438 AA449S84 Ks.246857 
134446 T25732 Hs.83419 
134453 X70683 Hs.83484 
134470 X54942 Hs.83758 
134487 R38185 . Hs.83954 
134495 D63477 Hs.84087 



isopentenyl-diphosphate delta isbmenase 
heat shock 27kD protein 1 
ADP-ribosyltransferase (NAO; poly (AD 
ESTs 

myxovirus (influenza) resistance 1; homol 

Homo sapiens mRNA; cONA OKF=Zp564 

serine protease; umbilical endothelium 

complement component 4A 

putative human Hl^ dass II associated p 

ESTs 

26S proteasome-assodated padi homotog > 
K1AA0097 gene produd 
cullin 4A 
ESTs 

transfemn receptor (p90: CD71) 
ESTs 

ESTs; Weakly similar to C13F10.5 [C.ele 
Homo sapiens mRNA; cDNA DKFZp564 
4-nitrophenytphosphatase domain and non 
transcription elongation factor B (Sill); po 
"^proteasome (prosome; macropain) subunit 
Homo sapiens mRNA; cDNA DKFZp564 
RAB2; member RAS oncogsDe family 
prosaposin (variant Gaucher disease and v 
protease inhibitor 12 (neuroserpin) 
BB1 

isoleucine-tRNA synthetase 
K1AA0244 protein 
cullin 3 

thymine-ONA glycosylase 
heat shock BOkD protein 1 (chaperonin) 
ribosomat protein; mitochondrial; L3 
UV-I protein: estrogen regulated 
Ras^TPase-activating protein SH3-doma 
Human Chromosome 16 BAG clone CIT9 
cathepsin D (lysosomal aspartyl protease) 
peroxisomal biogenesis fador7 
heterogeneous nudear ribonudeoprotein F 
ESTs 

Homo sapiens clone 24856 mRNA sequen 
chondfoitin sulfate proteoglycan 2 (versic 
RAD21 (S. pombe) homotog 
ESTs; Weakly similar to CGM28 protein 
syndecan 1 

Human done 191B7 placenta expressed m 
CD53 antigen 

phosphoribosylgtydnamide fonnyltransfe 
ESTs 

ESTs; Highly similar to CGl-118 protein 
2*;5'-oligoadenylate synthetase 1 
capping protein (actin filament) musde Z- 
small nudear ribonudeoprotein polypepti 
lysosomal-assoctated membrane protein 2 
tumor rejection antigen (gp95) 1 
kinedin 1 (kinesin receptor) 
collagen; fype XI; alpha 1 
protein tyrosine phosphatase type IVA; m 
solute canier temily 35.(CMP-sianc add t 
trefoil f3dor.3 (intestinaQ 
collagen; type V; alpha 2 . 
ESTs; Highly similar to CGl-139 protein 
ESTs; H^h^ similar (o pratetne kinase JN 
KIAA0252 protein 

SRY (sex detemnlning region Y)-box 4 
COC28 protein kinase 2 
Homo sapiens unknown mRNA 
K1AA0143 protein 
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7.9 

4.1 

13 

5.2 

11.7 

9.4 

4.8 

6.7 

7.1 

6.3 

13.7 

4.1 

4 

4.7 
8.3 
5 

4.5 

6.3 

6.4 

6.3 

11.9 

8.2 

5.2 

4.6 

6.5 

11.9 

5.2 

7.3 

4.7 

7 

4.5 

9.4 

4.4 

6.6 

8.6 

9.3 

6.3 

4.3 

6.9 

7.4 

6.1 

8.6 

6.1 

4.4 

6.6 

5.3 

4.8 

15.2 

7.2 

6.4 

4 

5.7 

6.9 

4.5 

11.2 

15.3 

4,1 

4.2 

5.9 

5.8 

4.4 

7 

4.6 
5.1 
20.3 
5 . 
16.1 
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134498 


M63180 . 


Hs.84131 


threonyl-tRNA synlhetase 


6.1 


134506 


U45328 


Hs.84285 


ubiquitin-conjugating enzyme E21 (homol 


4,6 


134529 


H24460 


Hs.848 


FK508^inciing protein 4 (59kD) 


6,2 


134570 


U66615 


Hs.172280 


SWl/SNF related: matrix associated; actin 


4.8 


134582 


AA234966 


Hs.86041 


CGG triplet repeat binding protein 1 


4.7 


134600 


R68884 


Hs.86347 


ESTs; Weakiy similar to predicted using G 


5.8 


134623 


X74m 


Hs.86978 


pTDlyi endopaplidase 


4.5 


134654 


W23625 


Hs.8739 


ESTs: Weakly simUar to ORF YGR200c [ 


13.7 


134655 


AA454070 


Hs. 123090 


ESTs 


5.8 


134675 


AA250745 


Hs.87773 


protein kinase: cAMP-dependent: catalytl 


8.9 


134711 


X04011 


HS.S897A 


cytochrome b-245; beta polypeptide (chro 


6.8 


134714 


U89922 


Hs.890 


lymphotoxin beta (TNF superfamily; mem 


35.7 


134722 


W47183 


Hs.284226 


ESTs; Weakly similar to neural F box pro 


8.1 


134776 


J05582 


Hs.89603 


mucin 1; transmembrane 


6.2 


134806 


Z49099 


Hs.89718 


spermine synthase 


4.2 


134810 


M27394 


Hs.89751 


membrane-spanning 4-domains: subfamily 


7 


134840 


U51477 


Hs.89981 


diacytgiycerol kinase; zeta (104kD) 


4.1 


134843 


H60595 


Hs.90061 


progesteione binding protein 


4.7 


134853 


D82348 


Hs.90280 


5-aminoinitdazote-4-carboxamide ribohuc 


10.2 


134866 


U84011 


Hs.904 


amyk)-1;6*9luoo5idase; 4-aipfia-glucanofr 


12.1 


134868 


Z39762 


Hs.90419 


KtAA0882 protein 


6 


134885 


N27670 


Hs.9071 


progesterone membrane binding protein 


5 


134982 


N46086 


Hs.92308 


ESTs 


4.1 


134989 


AA236324 


Hs.92381 


Homo sapiens mRNA; chromosome 1 spe 


16.8 


134992 


H05625 


Hs.5831 


ESTs 


4 


134993 


AA282343 


Hs.301005 


purine-rich element binding protein B 


4.4 


135010 


D59675 


Hs.92927 


ESTs 


7 


135015 


U54999 


Hs.278338 


LGN protein 


4.8 


135029 


AA224180 




ESTs; Moderately similar to 17-beta-hydr 


13.6 


135032 


AA243497 


Hs.173685 


Human DNA sequence from clone 30M3 


4 


135037 


U77948 


Hs.278589 


general transcription factor 11; i 


8 


135059 


AA598449 


Hs.93832 


Homo sapiens clone 24483 unknown mRN 


5.4 


135071 


L08069 


Hs.94 


heat shock protein; DNAJ-like 2 


9.3 


135083 


AA495950 


Hs.94262 


ESTs 


6.7 


135117 


W52493 


Hs.946g4 


Homo sapiens clone 24837 mRNA sequen 


10.2 


135144 


AA044842 


Hs.95260 


Homo sapiens mRNA; cDNA DKFZp586 


6.6 


135154 


AA126433 


Hs.267812 


sorting nexin 4 


7.4 


135218 


D31157 


Hs.324277 


ESTs; Weakly similar to growth factor-res 


6.2 


135237 


AA454930 


Hs.9691 


ESTs 


19.5 


135243 


AA215333 


Hs.97101 


putative G piotein-coupled receptor 


8.8 


135335 


H20989 


Hs.198281 


pyruvate kinase; musde 


12.4 


135349 


D83174 


Hs.9930 


collagen-binding protein 2 (coUigen 2) 


5.5 


135367 


AA4801Q9 


Hs,9963 


TYRO protein tyrosine kinase binding pro 


5.4 


135389 


U05237 


Hs.99872 


fetal Abheimer antigen 


7.8 


135400 


M23263 


Hs.99915 


androgen receptor (dihydrotestosterone re 


9.1 


135411 


L10333 


Hs.99947 


reticulon 1 


5.3 


300019 


M97935 




AFFX control: STAT1 


8.3 


300021 


M97935 




AFFX control: STAT1 


7 


300022 


M97935 




AFFX control: STAT1 


14 


300089 


A1199738 


Hs.208275 


ESTs; Weakly similar to 111! ALU CUSS 


9.1 


300107 


A1694585 


Hs.270464 


ESTs; Weakly simitar to illi ALU CUSS . 


7.4 


300254 


AW079607 


Hs.188417 


ESTs; Weakly similar to ZnT-3 [H.safrfen 


30.1 


300328 


AW015860 


Hs.224623 


ESTs 


11.9 


300549 


AA699328 


Hs.298119 


ESTs 


5.5 


300711 


A1492179 


Hs.166244 


ESTs; Weakly similar to cDNA EST yk40 


11 


3G0921 


AW293224 


Hs.232165 


ESTs 


11 


301124 


T79326 


Hs.298262 


ESTs; Weakly similar to dJ88J8,1 [H.sapi 


8.8 


301165 


N85789 


Hs.150186 


ESTs; Weakly similar to PTERIN-4-ALP - 


6 


301576 


A1682905 


Hs.270431 


ESTs; Weakly simflar to 111! ALU SUBFA 


4.7 


301604 


AA373124 


Hs.24809 


ESTs; Weakly similar to C17G10.1 [CJete 


8 


301704 


AA526313 


Hs^93691 


ESTs 


4.2 


301782 


N99399 


Hs. 143046 


EST cluster (not in UniGene) with exon h 


18 


301884 


AA312082 


Hs.105445 


GDNF family receptor alpha 1 


20.7 


301936 


NI\/L004694Hs.1 14924 


EST cluster (not in UniGene) with exon h 


11.6 


302002 


AF013955 


Hs.5637 


chromobox homolog 4 (Drosophila Pc da 


9.2 


302032 


NM_001992Hs.1280B7 


EST cluster (not in UniGene) with exon h 


4.3 


302067 


K05698 


Hs.222399 


ESTs; Wealdy similar to protein-tyrosine 


7.8 
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302145 NtuL003613Hs.1514O7 EST duster (not in UniGene) with exon h 15.1 

302236 AI126606 Hs.6557 zinc finger proteih 161 25 J 

302276 NMJ004448HS.323910 EST duster (not in UniGene) with exon h 21.6 

302290 AL117607 Hs.175563 Homo sapiens mRI^; d)NA OKFZp564 41.4 

S 302326 NM_0042711Hs.184018 EST duster (not in UniGene) with exon h 8.9 

302342 AB023141 Hs.190386 KIAA0924 protein 5.4 

302372 AL117408 Hs.200102 Homo sapiens niRNA; dDNA OKFZp434 8.9. 

302422 AB021227 Ks.3743 mafrU metalloproteihase 24 (membrane-ih 5.2 

302431 AF129530 Hs.226434 EST duster (not in UniGene) with exon h 5.3 

10 302501 AF022726 Hs.251446 EST duster (not in UniGene) with exon h 9.9 

302505 A1049650 Hs.247874 multiple UniGene matdies 4.3 

302533 L36149 Hs.248116 chemoWne (C motif) XC receptor 1 4.9 

302638 AA463798 Hs.102696 ESTs; Weakly similar to C 11 D2.4 [C.eleg 5.3 

302656 AW293005 Hs.7O704 ESTs 8.4 

15 302792 AA343696 Hs.46821 ESTs; Weakly similar to putative [H.sapie 4.5 

302820 X04588 Hs.85844 EST duster (not in UniGene) with exon h 6.8 

302838 U66049 Hs.82171 EST duster (not in UniGene) with exon h 8.4 

302892 N58545 Hs.42346 histone deacetylase 3 22.8 

302977 AW263124 Hs.315111 EST duster (not in UniGene) with exon h 6,8 

20 302989 N46406 Hs.84700 EST duster (not in UniGene) with exon h 8.9 

303007 AA478876 Hs.317714 pallid (mouse) homolog; pallidin 10.1 

303052 AF140242 Hs.279926 EST duster (not in UniGene) with exon h 24.4 

303131 AW081051 Hs.103180 actin-like6 6.3 

303132 AI929819 Hs.4055 ESTs 17.7 
25 303153 U09759 H5.246857 mitogen-adivated protein kinase 9 11.4 

303387 AA908797 Hs.180799 ESTs 15.8 

303499 A1815990 Hs.293515 ESTs 7.2 

303502 AA488528 EST duster (not in UniGene) with exon h 5.3 

303576 T07216 Hs.301226 EST duster (not in UniGene) with exon h 16.2 

30 303620 AA397546 Hs.1 19151 ESTs 8.9 

303634 Ai953377 Hs.28444 ESTs; Weakly similar to predided using G 12 

303642 AW29g459 Hs.1 1 1977 EST duster (not in UniGene) with exon h 4.2 

303654 AA436942 Hs.28852g ESTs 8.4 

303733 AW502498 Hs.15220 ESTs: Weakly sfmilar to zinc Ongerprote 5.2 . 

35 303780 AI424014 Hs.18995 ESTs; Moderately simHar to K1AA0456 p 28.4 

303792 C75094 Hs.199839 ESTs; Highly similar to NG22lH.sapiens 4.4 

303842 AI337304 Hs.126268 ESTs; Weakly similar to similar to PDZd 8.1 

303951 AW475081 Hs,172928 collagen; type I; alpha 1 7.5 

304465 AA421948 EST singleton (not in UniGene) with exon 6.5 

40 304507 AA456426 EST 5.4 

304591 AA505702 EST singleton (not in UniGene) with exon 9.8 

304601 AA507875 EST singleton (not in UniGene) with exon 7.5 

304659 AA533185 EST singleton (not in UniGene) with exon 7 

305040 AA630582 Hs.169476 glyceraldehyde-3-phosphate dehydrogena 12.4 

45 305134 AA653159 Hs. 179661 EST singleton (not in UniGene) with exon 8.7 

305415 AA725116 Hs.78465 EST singleton (not in UniGene) with exon 5.3 

305453 AA738110 EST singleton (not In UniGene) with exon 4.1 

305898 AA872838 keratin 8 7.7 

305913 AA876109 EST singleton (not in UniGene) with exon 6.3 

50 305950 AA884479 EST singleton (not in UniGene) with exon 5.6 

306004 AA889992 Hs.2186 EST singleton (not in UniGene) with exon 13.2 

306009 AA894560 Hs.283370 EST singleton (not in UniGene) with exon 4.4 

306080 AA908161 Hs.76277 EST singleton (not in UniGene) with exon 4.6 

306398 AA970548 Hs.297681 EST singleton (not in UniGene) with exon 7.6 

55 306505 AA987722 Hs.172928 EST singleton (not In UniGene) with «con 19.7 

306576 AA995761 Hs.276092 EST singleton (not in UniGene) with exon 5.5 

307117 AI184111 Hs.76067 heat shod«27kD protein 1 7.7 

307138 A1185516 Hs.172928 collagen: type I; alpha 1 f..8 

307187 AI190870 Hs.276417 EST singleton (not in UniGene) with exon 4.1 

60 307542 A1280859 Hs.62954 EST singleton (not in UniGene) with exon 6 

307554 AI281603 Hs.172928 EST singteton (not in UniGene) with exon 10.8 

307806 A1351739 Hs.276726 EST singleton (not in UniGene) with exon 4.7 

308079 AI472733 Hs.270208 ESTs 4.2 

308307 AI581398 Hs.172928 collagen; type I; alpha 1 5.4 

65 308511 AI687580 Hs.169476 EST singleton (not in UniGene) with exon 10.1 

308615 AI738593 Hs.101774 EST singleton (not in UniGene) v/ith exon 15.1 

308677 AI761173 EST singleton (not In UniGene) with exon 4.6 
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308flR9 AtA99A4A Hs 
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5.9 




immtinnntnhiiHn namma ^ l(^rn maricor^ 


4.5 


«JUO;70 1 rVIOf 0£rr^ 


PQT einntofnn (nni in i lnif?pnp\ utrifh ov/tn 

cot OllltjlSlUII ^IIUl Ul UIIIOCIIC^ WIUI CaUII 


7.6 


3QaQQ*i AIRfl0179 


PQT etnnlot/^n fnnt in 1 lntf?pnp^ u/tfh pynn 
CO 1 aiiiytciun ^liui iii \,iiiiwd>c^ wiui cauii 


6.6 


'^OQITT AIQ*^111ft 
1 1 1 r\i3U 1 1 1 0 


PQT ctnnlofnn fnnt tn 1 Inif^onoA uflfh ovnn 
COI allltjiClUII ^IIUL IN UlllwdlCy WiUI CA,U[I 


24.3 




PQT cinnlotnn fnnt tn 1 lnlr?pnp\uri}h OYon 


6.1 




msirtr hicfrtmmnsiihtfHxi ri^ninlpy* Haee 1* 

IllaJUi IlldvUUUII^/OUUIlUj \AilllipiBA| UlcldO 1, 


5.6 




PQT einnlptnn /nnt in 1 Inif^pnpX urifH pvnn 

CO 1 OllliJIclOJI \IIU1 III \Jtil\DlSttfSf WIUI CAUII 


6.2 




PQT einnlptnn fnnt in 1 lnif^pnp\ with Pvnn 
CO 1 aiiiyicUJii (iiui 111 uiiiodtc/ wmi cauii 


7.9 




PQT 


64.5 




PQT 

CO 1 
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PQTe 
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9A 1 
Zl. 1 
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fi A 

O.H 


ollf74 AA/UOofO nS.lHoUA 


bolS 


ft 9 


oll/ot) AlUdbfby ns.ljooiz 


bolS 


3 


o11yJo lbUo4o . nS.loybry 


bo IS 


3.y 


311930 AAz1d3o7 


col dustsr (not in uniiaene) 


3.3 


311972 N51511 ns.l 88449 


coTS 


R 9 
3.Z 
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cols 




312047 AAoooZ7o nS.loODb9 


coTS 


1A 7 
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bo 1 Cluster (not in uniuene) - 


OR 

y.o 


3 1 z 1 3 J AA/ syzou ns.l oouzo 


cytochroine b-561 


97 1 
Z/. 1 


1191fiR TQ99'\1 Hc 1QRRR9 
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R 1 
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jizzzo AuyDoi3 ns.iyyyyo 


bo 1 Sf wesKiy siniuar to uDiquRous i rr< 


R R 
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312837 AW292286 Hs.255058 


ESTs 


7.1 


312849 AA846353 Hs.194054 


ESTs 


5.9 


3iz8d4 AAozo713 nS.3zl03o 


EST cluster (not in Um'Gene) 


4,1 


312992 AA088446 Hs.170298 


ESTs 


7.3 


313096 AI422367 Hs,163533 


ESTs 


6.1 


313112 AA732534 Hs.269099 


ESTs 


4.2 


313126 AA720887 Hs.283313 


EST cluster (not in UniGene) 


18.1 


313136 N59284 Hs.288010 


ESTs 


17 


313197 AI738851 Hs.222487 


ESTs 


12.9 


313219 N74924 Hs.182099 


ESTs 


7.1 


313258 AW068358 Hs.183918 


ESTs 


13.7 


313328 AW449211 Hs.105445 


ESTs 


27.9 


313352 AW292127 Hs.144758 


ESTs 


9.8 


313417 AA741151 Hs.137323 


ESTs 


8.2 


313455 AW081702 Hs.98571 


ESTs 


6.9 
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313590 AA804410 Hs.291677 


EST duster (not in UniGene) 


5.3 


313663 Ai953261 Hs.169813 


ESTs 


7.6 


313667 U69201 Hs.13684 


ESTs; Weakly similar to choline Wmase is 


12.5 


313749 AW450376 Hs.119004 


ESTs 


5.5 


313832 AW271022 Hs.133294 


ESTs 


4,3 


313881 AA535580 Hs.16331 


ESTs 


7,7 


313915 AI969390 Hs.163443 


ESTs 


27.1 


313955 AI858884 Hs.270647 


ESTs 


5.7 


313974 AI310151 Hs.173524 


ESTs 


4.3 


314097 AA648744 Hs.269493 


ESTs 


14.5 


314129 AA228366 Hs.115122 


ESTs 


9.5 


314359 AA205569 Hs.194193 


ESTs 


5.4 


314384 AA535840 Hs.162203 


ESTs; Weakly similar to alternatively spli 


5.3 


314394 A1380563 Hs.130816 


ESTs 


13,2 


314462 AA347951 Hs.326413 


ESTs 


6.2 


314465 AA602917 Hs.156974 


ESTs 


18.1 


314470 AI934422 Hs.3a661 


ESTs • 


4.2 


314488 AA358265 Hs.182890 


ESTs 


6.1 


314506 AA833655 Hs.206868 


ESTs 


27.8 


314510 At204418 Hs.190080 


ESTs 


9.5 


314558 A1873274 Hs.190721 


ESTs 


22.5 


314661 AA436432 Hs.324239 


EST cluster (not in UniGene) 


13.3 


314691 AW207206 Hs.136319 


ESTs 


21.4 


314754 AW025761 Hs.134374 


ESTs 


4.4 


314775 AI149880 Hs.188809 


ESTs 


4.4 


314943 AI476797 Hs.184572 


cell division cyde 2; G1 to S and G2 to M 


18.4 


314961 AW008061 Hs.231994 


ESTs 


10.2 


314963 AI689617 Hs.20O934 


ESTs 


5.3 


315006 A1538613 Hs.29824l 


ESTs 


20.7 


315010 AA531082 Hs.240049 


ESTs 


5 


315019 AA532807 Hs.105822 


ESTs 


6.1 


315033 A1493046 Hs.146133 


ESTs 


12 


315036 AA534953 Hs.l63297 


ESTs 


8.3 


315037 AW205863 Hs.133988 


ESTs; Weakly similar to gene MAC25 pr 


6.1 


315051 AW292425 Hs.163484 


EST 


12.7 


315054 AI968598 Hs.78768 


ESTs 


7.6 


315073 AW452948 Hs.257631 


ESTs 


13.9 


315080 AA744550 Hs.136345 


ESTs 


4.4 


315083 AI221325 Hs^05442 


ESTs 


5.1 


315088 AA557351 Hs.152448 


ESTs; Moderately simPar to MULTIFUN 


4.7 


315175 AI025842 Hs.152530 


ESTs 


11.9 


315196 AA972756 Hs.44898 


ESTs 


28.8 


315298 AA876905 Hs.125286 


ESTs 


16.1 


315303 AW194364 Hs.128022 


ESTs; Weakly similar to FIG-1 PROTEIN 


25.7 


315352 AA604799 Hs.136528 


ESTs; Moderately similar to !f!! ALU SU 


12.3 


315364 AA643602 Hs.155485 


ESTs; Highly similar to serine protease [H 


4.6 


315368 AW291563 Hs.104696 


ESTs 


4.8 


315390 AI801565 Hs.200113 


ESTs: Weakly similar to alternatively spli 


4.4 


315408 AW273261 Hs.216292 


ESTs 


5 


'315458 AA872000 Hs.1 16104 


ESTs 


7.6 


315472 AA828850 Hs.165469 


ESTs 


4.9 


315478 AA665612 Hs.120874 


ESTs 


5.2 


315498 AA628539 Hs.116252 


ESTs; Moderately similar to 1111 ALU SU 


4.8 


315527 AI791138 Hs.116768 


ESTs 


4^4 


315530 AI200852 Hs.127780 


ESTs 


22.4 


315562 AA737415 Hs.152826 


ESTs 


5.9 


315634 AA837085 Hs.220585 


ESTs 


8.8 


315647 AA648983 Hs.212911 


ESTs 


15 


315652 A1521489 Hs.3053 


ESTs 


6.3 


315676 AW002565 Hs.124660 


ESTs 


9.2 


315680 AA814309 Hs,123583 


ESTs 


8.1 


315735 A1831760 Hs.1551t1 


ESTs 


13.4 


315741 AA812168 Hs.122559 


ESTs 


5.4 


315769 AA744875 Hs.189413 


ESTs 


4.4 


315978 AA830893 Hs.1 19769 


ESTs 


10.4 


315984 AI015862 Hs.131793 


ESTs- 


5 


316042 AW297979 Hs.170698 


ESTs 


14.7 
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316136 


AA830808 


Hs.1 24356 


ESTs 


4 


316177 


AI908272 


Hs.293102 


EST cluster (not in UniGene) 


32.6 


316313 


AA741300 


Hs.202599 


ESTs 


4.8 


316405 


AA757900 


Hs.270823 


ESTs 


4.8 


316480 


AI749921 


Hs.205377 


ESTs 


12.9 


316564 


AI743571 


Hs.168799 


ESTs: Weakly similar to !!!! ALU SUBFA 


8.1 


316714 


AA809792 


Hs.123307 


ESTs 


5 


316715 


A1440266 


Hs.170673 


ESTs 


4.2 


316828 


AA828116 


Hs.173076 


ESTs 


5.2 


316869 


Ai954880 


Hs.134604 


ESTs 


13.3 


316905 


AW1 38241 


Hs,210846 


ESTs 


6.2 


316943 


AW014875 Hs.137007 


ESTs 


5.3 


316949 


AA856749 


Hs. 124620 


ESTs 


7.2 


317008 


AW051597 Hs.143707 


ESTs 


4.1 


317028 


AA962623 


Hs.189144 


ESTs; Weakly similar to RENAL SODIU 


4.2 


317067 


AI805392 


Hs.325335 


ESTs 


4.5 


317059 AI732892 


Hs. 190489 


ESTs 


6.4 


317210 AA490718 




EST cluster (not in UniGene) 


4.4 


317298 


AI922374 


Hs.158549 


ESTs 


5.9 


317658 


AW139077 Hs.202217 


ESTs 


4.6 


317674 AW294909 Hs.132208 


ESTs 


5.2 


317685 


AI798630 


Hs.149997 


ESTs 


4.3 


317836 AAg83913 


Hs.128929 


ESTs 


12.4 


317881 


AI827248 


Hs.224398 


ESTs 


12,1 


317902 


A1828602 


Hs.211265 


ESTs 


6.8 


317916 


AI565071 


Hs.159983 


ESTs 


12.6 


318042 


AW294522 Hs. 149991 


ESTs 


5.6 


318053 


A)074465 


Hs.133469 


ESTs 


4 


318064 


AW296888 Hs.170939 


ESTs 


5.2 


318070 


AI024594 


Hs.248942 


ESTs 


4.7 


318073 


AW167087 Hs.131562 


ESTs 


15.7 


318146 


AI040125 


Hs.150521 


ESTs 


5.9 


318186 


AW016773 Hs.3709 


ESTs 


5.3 


318481 


A1291584 


Hs.145921 


ESTs; Weakly similar to HYPOTHETICA 


7.6 


318566 


AI335361 


Hs.226376 


ESTs 


5.8 


318617 


AW247252 Hs75514 


nucleoside phosphorylase 


11.1 


318662 


AI285898 


Hs.294014 


ESTs 


16.3 


318691 


AW192139 Hs.181307 


H3 histone; family 3A 


4 


318740 


NM_002543Hs.77729 


EST cluster (not in UniGene) 


21.3 


318744 


AI793124 


Hs.144479 


ESTs 


35 


318948 


AA317274 


Hs. 13995 


ESTs 


11.7 


319163 


F15257 


Hs.27 


glycine dehydrogenase (decarboxylating; 


7 


319478 


R06d41 


Hs.270307 


EST duster (not in UniGene) 


8.9 


319545 


RB3716 


Hs.14355 


ESTs 


8.2 


319668 NM„002731Hs.87773 


EST cluster (not in UniGene) 


25.4 


319763 


AA460775 


Hs.6295 


ESTs 


7 


319913 AA179304 


Hs.271586 


ESTs; Moderately similar to IHI ALU SU 


8.7 


319936 


W22152 


Hs.2d2929 


EST duster (not in UniGene) 


5.6 


319951 


AA307665 


Hs.14559 


ESTs 


4.9 


319962 


H06350 


Hs,135056 


ESTs 


9.2 


319977 


AA632632 




EST cluster (not in UniGene) 


4.6 


320074 


AA321166 


Hs,278233 


EST cluster (not in UniGene) 


16.7 


320092 


AF022799 


Hs.113292 


calpain 9 (nCL^) 


5.4 


320107 


AA836461 


Hs.291712 


EST cluster (not in UniGene) 


5.3 


320133 D63271 




EST cluster (not in UniGene) 


5.5 


320167 


AA984373 


Hs.90790 


EST cluster (not in UniGene) 


15 


320187 


T99949 


Hs.303428 


EST duster (not in UniGene) 


6.7 


320211 


AL039402 


Hs.1 25783 


DEME-6 protein 


24.3 


320401 


U90449 


Hs.152717 


nudeoside diphosphate kinase type 6 (inh 


10 


320458 


AI884396 


Hs.24131 


ESTs 


5.4 


320488 


R31386 


Hs.191791 


EST duster (not in UniGene) 


4.9 


320521 


N31464 


Hs.24743 


ESTs 


9.5 


320661 


AA864846 


Hs.1 15175 


EST duster (not in UniGene) 


6.6 


320691 


R61576 


Hs.313951 


hypothetical protein 


5.9 


320699 


R63161 


Hs.1 18249 


EST duster (not in UniGene) 


4 


320727 


U96044 


Hs.181125 


EST duster (not In UniGene) 


•15.3 


320993 ALJ050145 


Hs.225986 


Homo sapiens mRNA;cDNADKFZp586 . 


7.2 
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321012 AA737314 Hs.194324 

321050 AW393497 

321051 AF134149 Hs.240395 
321171 A(769410 Hs.221461 

5 321192 AA295304 Hs.297939 
321354 AA078493 
321387 H66014 Hs.141278 
321412 AW366305 Hs.22891 
321489 AW392474 Hs,172759 

10 321539 N98619 Hs.42915 
321593 H84762 Hs.253197 
321666 D28390 Hs272m 
321891 AW157424 Hs.165954 
321910 H67065 Hs.271530 

15 321953 AW068268 Hs.292833 
321978 N77342 Hs.21851 
322017 AA310039 Hs.9192 
322026 AA233527 Hs.283675 
322035 AL137517 Hs.306201 

20 322171 AF085968 Hs.48474 
322175 AF085975 
322236 AL134970 Hs.104222 
322303 W07459 Hs.l57601 
322735 AA086123 Hs.297856 

25 322777 AA679082 Hs.269947 
322818 AW043782 Hs.293616 
322882 AW248508 Hs.279727 
322975 C16391 
322991 C18965 Hs.1 59473 

30 323011 AA580288 

323091 AW014094 Hs,210761 
323107 AI301107 Hs.150790 
323136 AL120351 Hs.30177 
323168 AL120862 Hs.124165 

35 323195 AI064982 Hs.117950 
323201 AL049370 Hs.13350 
323203 AA203135 Hs.130186 

323243 W44372 Hs.1 10771 

323244 T70731 Hs.193620 
40 323328 AA228078 Hs.255096 

323332 AIB29520 Hs.227513 

323333 AA228883 Hs.208558 
323570 AL038623 Ks.208752 
323604 AI751438 Hs.41271 

45 323685 AA344205 Hs.289088 
323753 AA327102 H&70266 
323817 AA410943 
323845 Ai684674 Hs.41127 
323930 AA570698 Hs.8173 

50 323997 AA844907 Hs,274454 
324047 AA378201 Hs.271340 
324261 AL044891 Hs.269350 
324302 AA643008 Hs.292471 
324338 AL138357 Hs.145078 

55 324344 AW502000 Hs.46677 
324432 AA464510 Hs.152812 
324495 AW501411 Hs.122489 
324497 AW152624 Hs,136340 
324598 AA502659 Hs.163985 

60 324603 AW016378 Hs.292934 
324620 AA448021 Hs.94109 
324727 AI610425 Hs,19597 
324774 A1031771 Hs.132586 
324783 AA640770 Hs.200994 

65 324824 AI826999 Hs.224624 
324826 AA704806 Hs.143842 
324902 D31323 Hs.271492 



EST duster (not in UniGene) 
EST duster (not in UniGene) 
EST duster (not in UniGene) 
eSTs 

ESTs; Weakly similar to neogenin.fH^p 

EST duster (not in UniGene) 

ESTs; Weakly similar to Wll ALU SUBFA 

EST duster (not in UniGene) 

ESTs; Moderately similar to !!!! ALU SU 

ARP2 (actin-related pre>tein 2; yeast) hom 

ESTs 

EST duster (not in UniGene) 
ESTs 

ESTs; Weakly similar to ill! ALU SUBFA 
ESTs; Weakly simHar to !!!! ALU CLASS 
EST duster (not in UniGene) 
ESTs 

low density lipoprotein receptor (famifial 
EST duster (not in UniGene) 
EST duster (not in UniGene) 
EST duster (not in UniGene) 
follistat'n-like 1 

EST duster (not in UniGene) 
EST duster (not in UniGene) 
ESTs 
ESTs 

DiGeorge syndrome critical region gene 2 

EST cluster (not in UniGene) 

ESTs 

EST cluster (not in UniGene) 

ESTs 

ESTs 

EST duster (not in UniGene) 
ESTs 

multifunctional polypeptide similar to SA 
Homo sapiens mRNA; cDNA DKFZp586 
ESTs 

EST duster (not in UniGene) 
EST cluster (not in UniGene) 
EST duster (not in UniGene) 
ESTs 

EST duster (not in UniGene) 

ESTs; Weakly similar to WW ALU SUBFA 

EST?; Weakly similar to HI! ALU SUBFA 

EST duster (not in UniGene) 

EST duster (not in UniGene) 

EST duster (not in UniGene) 

ESTs; WeaWy similar io waclaw [Cmelan 

ESTs 

EST cluster (not in UniGene) 

EST duster (not in UniGene) 

EST duster (not in UniGene) 

ESTs; Weakly similar to Mil ALU SUBFA 

ESTs 

EST duster (not in UniGene) 

EST duster (not in UniGene) 

ESTs; Weakly similar to III! ALU CLASS 

ESTs 

ESTs 

ESTs 

EST duster (not in UniGene) 

ESTs 

ESTs 

EST duster (not in UniGene) 

ESTs 

ESTs 

ESTs 
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5 

11.4 
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5.5 

16.9 
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6.3 

9 
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10.4 

19.9 

5.6 

5.4 

6.5 

10.2 

9.8 

27.8 

40.2 

5.7 

7.7 

14.4 

13.4 

7.6 

4.4 

21 

15.3 

21.3 

11.7 

8.9 

10.8 

6.5 

5.5 

17.9 

5.8 

11.6 

6.4 

7.3 

15.8 

4.8 

20,2 

8.8 

5 

6.5 

7.1 

6.1 

16.8 

10.1 

6.4 

8 

6.3 

50.1 

5.7 

9.5 

4.4 

167 

5.5 

5.4 

8.8 

23.1 

21.2 

5 

5 

4.1 
6.3 
11.7 
4.8 



oftterv-rMrv -won 



wo 02/059377 PCT/US02/02242 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 
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13.3 


324987 


708882 


He 177R34 


ESTs 


19.6 


324988 T06997 


1910?& 


POT rliiQtPr (nni in 1 lnt(^np\ 


24.5 


325146 A1064690 




COIS> 
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TH 14 hs ai|(>A67n00 
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12.7 


326816 
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327196 
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A ft 
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PU no he rtii'^RftT7Rfi 
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328059 
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R 9 
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328492 
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328857 
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329367 
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7 R 
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329655 
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329899 
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4 ■ 


329960 






PU 1R n9 nII'%nQ'1»\Q4 
v/rl. IO_p^ yi|uu9 IOtH 


7fi 


330084 
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330384 


M23263 




anQiogen recepior ^ainyuroiesiosieione re 


0.0 


330385 AA449749 




CO 1 s, nig my simiioi lu scuicicu apopiDsi 


in 9 


330387 


H14624 - 




CO 1 nigniy similar lo SGcreiea apopiosi 




330388 


X03363 




ncn^ recepior qfrosine Kinase ^v-enjD-£, 


17 7 
l/.f 


330409 


D50692 


Hs.78221 


c-fiiyc uifiuing proiein 


in 1 


330460 


TIGR:HT544 




cnQon 


330486 


M13755 


Hs.833 


interferon-stimulated protein; 15 kDa 


Of 


330494 


M2g696 


Hs.237868 


interleukin 7 receptor 


c 
0 


330500 


M34423 


HSJ9222 


galactostdase; beta 1 


1 J.l 


330510 


M75099 


Hs.227729 


Cl/aOC Ktnrlinn nmtotn O /I '3l/n\ 

rKOUo-Dinuing proiem z (ioku) 


90 


330513 


M81057 


Hs.180884 


carboxypeptidase B1 (tissue) 


JO.O 


330541 


U22970 


Hs.265827 


muitipfe UntGene matches 


T A 

J A 


330542 


U23942 


Hs.226213 


cytocnrome P450; 51 (lanosterol i4-aipna 


15 


330547 


U32989 


Hs.1 83571 


tryptophan 2:3-dioxygenase 


A 'i 

n 


330551 


U39840 


Hs.299857 


hepatocyte nuclear factor 3; alpha 


0.0 


330562 


U49082 


Hs.76460 


transporter protein 


7 7 

1.1 


330573 


U62800 


Hs.83393 


cystatin E/M 


■A 


330673 


D57823 


Hs-321403 


Sec23 (S. cerevisiae) homolog A 


lO.O 


330711 


AA1 64687 


Hs.177576 


mannosyl {alpha-1;3-)-glycoprote]n beta-1 


Oil o 

24.0 


330814 


AA015730 


Hs.265398 


ESTs; WeaWy similar to tnansformation-f 


AA 1 


330850 


AA075298 


Hs,322710 


cols 


A A 
4 .1 


330874 


AA127474 


Hs.191157 


roT-.. nil Alii io^a 

ESTs; Weakly similar to ll!! ALU SUBFA 


0.1 


330884 


AA133457 


Hs.102548 


ESTs 


0.Z 


330912 


AA195936 


Hs.82719 


geneiBi iranscnption tactor llA, l (37KD a 


ft - 


330924 


AA232136 


Hs.159737 


Homo sapiens mRNA; cuNA uKrZp4j4 


9.1 


330997 


H55762 


Hs.9302 


COT— 

cSTs 


T A 

f .b 


331014 


H98597 


Hs.30340 


cots 




331024 


N32919 


Hs.27931 


cols 


Q 1 

9.1 


331046 


N66563 


Hs.191358 


CO IS 


in ^ 


331135 


R61398 


Hs.4197 


ESTs 


7,4 


331145 


R72427 


Hs.129873 


ESTs; Weakly similar to CYTOCHROME 


41.9 


331148 


R73816 


Hs.17385 


ESTs 


4.7 


331222 


T98531 


Hs.1 73904 


ESTs 


4.1 


331230 


W69807 


Hs.16537 


hypothetical protein; similar to (U08944) 


4.9 


331306 


AA252079 


Hs.63931 


dachshund (Drosophila) homolog 


15.1 


331327 


AA281076 


Hs.109221 


ESTs 


4.8 


331337 


AA287662 


Hs.50495 


ESTs 


7.6 


331341 


AA303125 


Hs.23240 


ESTs; Weakly similar to 111! ALU SUBFA 


13 


331344 


AA357927 


Hs.126550 


ESTs 


12.4 


331362 


AA4 17956 


Hs.40782 


ESTs 


6.5 


331363 


AA421562 


Hs.91011 


anterior gradient 2 (Xenepus laevts) homo 


28.2 


331376 AA443802 


Hs.41007 


ESTs; Weakly similar to cDNA EST yk47 


15.1 


331384 


AA456001 


Hs.93847 


. ESTs 


7.9 
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331478 N26608 


Hs.40639 


ESTs 


"7 




331526 N49967 


Hs.46624 


ESTs 


19.8 




331533 N51517 


Hs.47282 


ESTs 


0.5 




331681 W85712 


Hs.119571 


collagen; type (U; alpha 1 (Ehters-Danids 


13.8 


5 


3316B6 W88502 


Hs.182258 


ESTs 


9.9 




331750 AA284372 


Hs.1 11471 


ESTs 


5.6 




331761 AA284840 


Hs.143818 


ESTs 


5.8 




331760 AA292721 


Hs.154434 


ESTs; Weakly similar to unknown [H.sap 


7.4 




331763 AA312861 


Hs.96704 


ESTs 


7.8 


10 


331825 AA411144 


Hs,292862 


ESTs 


15.2 


331890 AA432166 


Hs.3577 


succinate dehydrogenase complex; subuni 


24.3 




331952 AA454756 


Hs.97837 


ESTs 


5 




332015 AA487910 


Hs.208600 


ESTs; Weakly similar to !!!! ALU CLASS 


10.5 




332043 AA490831 


Hs.125056 


ESTs 


11.4 


15 


332060 AA504779 


Hs,191402 


ESTs 


13,6 




332071 AA5g8594 


Hs.2aS293 


ESTs 


9.1 




332093 AA608794 


Hs,112592 


ESTs 


8.8 




332139 AA620669 


Hs.1 12879 


EST 


9 




332219 N22508 


Hs.139315 


ESTs 


7.1 


20 


332225 N33213. 


Hs.100425 


ESTs 


12.2 




332246 N57927 


Hs.120777 


ESTs; Weakly similar to RNA POLYME 


15.6 . 




332247 N58172 


Hs.109370 


ESTs 


16.9 




332260 N70088 


Hs.1 38467 


ESTs 


4 




332269 N91279 


Hs.109654 


ESTs; Moderately similar to outer membr 


8.2 


25 


332336 T96130 


Hs.137551 


ESTs 


7.7 




332340 W15495 


Hs.129781 


chromosome 21 open reading frame 5 


14.1 




332347 W60326 


Hs.288684 


ESTs 


4.4 




332362 W93640 


Hs.4779 


ESTs; Moderately similar to similar to AD 


16.9 




332467 AA489630 


Hs.1 19004 


K1AA0665 gene product 


4.8 


30 


332499 Ml 2035 


Hs.323910 


Human tyrosine kinase-type receptor (HE 


10.4 




332513 AA018182 


Hs.154424 


deiodinase; iodothyronine; type H 


5.8 




332526 AA281753 


Hs77515 


inositol 1 ;4;5-triphosphate receptor; type 


19 




332532 N63192 


Hs.1892 


EST; Highly similar to PHENYLETHAN 


15.3 




332565 AA234896 


Hs.25272 


El A binding protein p300 


12.3 


35 


332607 R41791 


Hs.36566 


LtM domain kinase 1 


11.1 




332640 AA417152 


Hs.5101 


protein regulator of cytokinesis 1 


18.2 




332694 AA262768 


Hs.243901 


K1AA1067 protein 


15.2 




332702 H93968 


Hs.75725 


transgelin 2 


4.7 




332705 T59161 


HS76293 


thymosin; beta 10 


5.5 


40 


332749 AA479g68 


Hs.88251 


aiytsulfotase A 


9.8 




332927 




CH22_FGENES.38_1 


17.7 




332929 




.CH22_FGENES.38_3 


4.7 




332930 




CH2^FGENES.38.4 


7.4 




332955 




CH2a.FGENES.48_12 


5.4 


45 


332958 




CH22 FGENES.48J5 


17.8 




332961 




CH22 FGENES.48_16 


10.6 




332983 




CH22 FGENES,54_5 


4.3 




333009 




CH22 FGENES.61J 


5.2 




333010 




CH22 FGENES.61.2 


8.1 


50 


333013 




CH22 FGENES.61_5 


8.5 




333108 




CH22 FGENES.79_14 


5.6 




333139 




CH22_FGENES.83^16 


6.3 




333254 




CH22 FGENES.118_2 


6.8 




333305 




CH22 FGENES.137^2 


11.4 


55 


333343 




CH22_FGENES.139J2 


5.1 




333388 




'CH22 FGENES.144.3 


12.7 




333456 




CH22„FGENES.157^5 


4.2 




333459 




CH22.FGENES.157.8 


7.6 




333517 




CH22.FGENES.173^2 


8.2 


60 


333585 




CH22.FGENES^03^4 


5 




333679 




CH22 FGENES.2473 






333743 




CH22 FGENES.264.1 


13,4 




333758 




CH22 FGENES.268J 


4 




333767 




CH22_FGENES.271,6 


5,6 


65 


333768 




CH22-FGENES.271.7 


12,2 


333769 




CH22.FGENES.2713 


48.3 




333795 




CH22_F6ENES^5J 


6.1 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



333796 


CH22_FGENES.275,3 


6.8 


333892 


CH22 FGENES.292J4 


4.4 


333904 


CH22_FGENES.294.2 


6.5 


333905 


CH22_FGENES^94_3 


9.3 


333921 


CH2a.FGENEa296^12 


9.6 


333966 


CH22_FGENES.307_4 


15.9 


334102 


CH22_FGENES.327_60 


7.1 


334222 


CH22 FGENES.360 3 


6.7 


334223 


CH22_FGENES.360_4 


33.5 


334264 


CH22_FGENES.367 15 


18.5 


334343 


CH22_FGENES.375_25 


6.1 


334360 


CH22_FGENES,378_5 


6.1 


334784 


CH22 FGENES.432^9 


4.8 


334789 


CH22_FGENES.432J4 


5.1 


334794 


CH22_FGENES.434_2 


7 


334889 


CH22 FGENES.452 3 


12.4 


335004 


CH22_FGENES.472^8 


7.9 


335115 


CH22 FGENES.496_2 


18.8 


335287 


CH22_FGENES.526_11 


4.5 


335342 


CH22_FGENES.536_1 


5.3 


335491 


CH22_FGENES.570J3 


24 


335495 


CH23_FGENES.570_28 


7 


335498 


CH22_FGENES.571_7 


12.2 


335544 


CH22 FGENES.576 5 


8.4 


335610 


CH22 FGENES.583 4 


12.9 


335653 


CH22 FGENES.590 4 


6.7 


335682 


CH22 FGENES.595 2 


12.1 


335687 


CH22 FGENES.596 2 


13.9 


335755 


CH22_FGENES,604_4 


11.5 


335782 


CH22 FGENES.609 4 


17.9 


335791 


CH22 FGENES.611_7 


27.3 


335809 


CH22 FGENES.617 6 


19.2 


335822 


CH22 FGENES.619 7 • 


19.1 


335823 


CH22 FGENES.619 8 


4.5 


335824 


CH22^FGENES.619_11 


40.2 


335825 


CH22 FGENES,619 12 


34.3 


335895 


CH22 FGENES.635^3 


10.2 


335917 


CH22 FGENES.636 13 


6 


335920 


CH2^_FGENES.636J6 


8.8 


336035 


CH22_FGENES.678_6 


5,9 


336042 


CH22^FGENES.679jl 


5.8 


336093 


CH22 FGENES.691_2 


11.6 


336096 


CH22_FGENES.691_5 


7.6 


338150 


CH22.FGENES.706.6 


6.3 


336152 . 


CH22_FGENES.706_9 


10.5 


336416 


CH22_FGENES.823_38 


5 


336444 


CH22 FGENES.827 10 


4.8 


336449 


CH22 FGENES.829 5 


13.6 


336471 


CH22 FGENES.829_30 


6.9 


336512 


CH22 FGENES.834_7 


21.4 


336558 


CH22 FGENES.842_3 


8.2 


336560 


CH22 FGENES.842^5 


9 


336676 


CH22 FGENES.434 


9.4 


336959 


CH22 FGENES.357-13 


19 


337968 


CH22 EM:ACO055QO.GENSCAN.103-2 


13.4 


338008 


CH22 EM:AC005500.GENSCAN.127-9 


15.2 


338057 


CH22 EM:AG005500.GENSCAN.160-1 


13.9 


338410 


CH22 EM:AC005500.GENSCAN.341-6 


8 


338451 


CH22 EM:AC005500.GENSCAN.359^ 


11.6 


338588 


CH22 EM:AC005500.GENSCAN.432-1 


10.3 


338665 


CH2^EM:AC005500.GENSCAN.464-2 


4.8 


338669 


CH22 EMAC005500.GENSCAN.475-3 


6.7 


338832 


CH2^DJ246D7.GENSCAN.6-9 


4.8 


338980 


CH22 DA5gH16.GENSCAN2-4 


5.1 


339352 


CH22 BA354l12.GENSCAN^-7 


6.9 


339373 


CH22_BA232E17.6ENSCAN.1-29 


4.3 
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TABLE 13A 

Table 13 A shows the accession niunbers for those pkeys lacking unigenelD's for Table 13. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" colmnn. 



Pkey: Unique Eos probeset identifier number 

CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 

123619 371681J AA602964 AA609200 
103207 30635_-4 X72790 
103349 11052_-2 X89059 

110856 19346.14 AA992380 N33063 N21418 H79958 R21911 H79957 
113248 328626J T63657 AW971220 AA493469 T63699 

123169 44573_2 AI950087 N702b8 R97040 N36809 AI308119 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 AI820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AA884444 N92578 F13493 AA927794 AI560251 AW874088 AL134043 AW235363 AA66334d AW008282 AA488964 
AA283144 AI890387 AI950344 AI741346 AI689062 AA282915 AW102898 AI872193 AI763273 AW173586 AW150329 A1653832 
AI762688 AA988777 AA486892 AI356394 AW103813 Ai539642 AA642789 AA856975 AW505512 AI981530 AW629970 
BE612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 AW961 101 AA251669 AA251874 
AI819225 AW205862 A1683338 AI858509 AW276905 A1633006 AA972584 AA908741 AW072629 AW513996 AA293273 
AA969759 N75828 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 AW613456 A1373032 AI564269 
F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 F04005 

123533 genbanleAA608751 AA608751 

116480 genbank.C14088 C14088 

132225 genbank.AAl 28980 AA128980 

125154 genbank_W38419 W38419 ' 

118475 genbanK.N56845 N66845 

102919 25180.2 M21191 AL035748 AA021266 AA323126 AA180515 AI613029 D28356 NM_000034 M11560 AW401425 AW248248 AA012956 
AA323294 W04965 H38759 AA206622 AA580747 AI541172 AA381075 AA354229 AW402353 AW405575 AW404021 AW406207 
AA075752 AW176066 AA287222 AA195818 T20243 R87945 AA295539 AA402533 AA232419 AA224515 AW401583 AA331367 
AW402140 AW249079 R31488 AA075757 X05236 AW239490 AA338036 AW239495 AA357262 AA431005 AA306726 R33804 
AA216544 AW275288 AA227044 AL038124 AA243300 C03242 AA315615 AL035840 R64336 AA313917 AA018963 AA001385 
AA054395 H30840 AW498825 AA086141 AI557324 AA121576 H39128 R77161 AA019688 AA380987 AA348140 AA348257 
AW176086 AA362432 AA171389 AA362416 AA299938 AA319093 AA337972 C04921 AA345696 R89640 AA085425 AA481708 
AA313637 AL039229 H&4490 H85153 F00656 AA326668 AA347304 R65890 H41949 AA339309 AW402002 AW404854 
AA192582 AA1 12802 H09248 N83165 H38367 AA356339 AA455763 R66853 AA294935 H8591 1 AA310414 H93436 N87014 
/ AA001186 H83640 AA411328 AA317929 C04192 AW406288 U46335 AA323179 AA427649 AI366131 H14328 AA197161 
AA379497 AA31 1816 AA017206 AA001 137 AA017420 AA012990 AW163775 AA021397 AA295513 AA355248 AA374921 
AW380419 AA345864 AA318058 AA371711 AA363255 AA057094 R88057 AA394045 AW362741 AA479579 AW362789 
AW362775 AA223624 AW362B09 AW362746 AW362753 AW380412 AA057104 AA295271 XD6352 AW362771 X12447 R59553 
R87925 AA292864 R83920 F01 120 AA496740 AA852551 AL039209 AA093339 AA333961 N86416 C03325 C03831 AA194750 
AA359270 AA359460 AW245492 AA339400 AA330784 Ai908970 N89223 AW361938 AJ94O063 AA134323 AA371741 F01267 
AW372970 AA341973 AA346098 AW372969 AA337549 AA327342 H93855 A1074079 F291 18 AA852940 F35696 AA345953 
AA079578 AA1 13785 C02989 AA095945 C03145 C05199 AA346024 AA190506 AW361659 AI909845 AW374374 AW374382 
AW374401 AW374373 AW374370 AI909831 AW374367 AA353658 F01041 C02843 AA37594B AW374414 AA213946 AI525039 
H13744 R31007 AA112044 AA134404 H47935 AW177018 AA429768 AA336873 AA112875 H46393 AA191267 D59131 
AW406037 AA055244 AA341880 AA179024 AA308537 AW406985 AW327311 H30301 AA300705 H43788 AW364149 
AA806213 AA481936 C04941 AW375299 M21 190 AA410818 AA250940 AI354547 AA317422 AA250903 AI865497 AA890603 
AA366197 AW498538 M78072 AW406461 C03092 F00308 H564B8 AA336320 AW406501 AA354102 AA382942 AA096393 
AW376830 AA383446 F01259 AW081388 T94036 AA379643 H43842 AI524063 AA621727 AA379099 AA371417 R66811 
H44129 N84794 F01 135 AA477852 AA293052 AW361595 H27194 A1831650 H43253 H24797 AI564680 AA380090 W20057 
AI921586 AI192549 AW090808 H25967 AA918121 AI626060 H20221 AA812572 H42178 AA887222 H96000 C03180 F00946 
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C03986 AI318091 AI860172 AA582179 AI633388 AA557193 R68075 F24105 AW518239 W56622 AI62S219 AI925243 
AW468046 AI921828 AA339164 AI144391 AA643334 AA459631 AA873247 AW373432 AA604384 H27600 AI680458 AA159956 
AA610836 AA364298 AW373435 AA604225 W73754 AW087924 AA599776 N89227 AI630871 AI633128 AW514329 AA010455 
AA563928 At5715g6 AI128394 W73707 AI423575 AA583809 AA657988 AI950837 AA169782 AA600009 A{885540 AA771884 
5 AI978829 AA505408 AA533937 AA481469 AA610869 AA775241 AW273870 AW070909 AI905695 AA4801 15 AA574051 

AI8891 85 AA773167 AA331375 AA001437 AA194324 AA194300 AA558632 AI038538 AA41 1329 AA781570 AI8331 76 
AA935520 AW074197 AA583063 AW073099 AW001198 AA948025 AA587857 AA191540 AI460085 AA193244 A1538037 
AA515572 AA758587 AI14931 1 AA508610 AA206409 AA534004 AA994600 AA827543 AI916349 AW245129 AW517804 025663 
AA781985 AA284536 A1819422 H16040 H27531 AA456564 AA845555 AI423596 AA012908 AA889439 AA716311 AA968868 

1 0 AA320508 AA725731 AA834202 AA935997 AA724815 AA769353 AA594803 F00827 AI342442 A1003519 AI002503 AI347597 

AI04O946 AA197162 AA987883 AA292865 AW001944 AI64071 1 AW244044 AA456784 F30588 AA290829 H24754 AI978683 
AA483686 AA583939 AA121382 AA833831 AA477102 AA977322 AA665379 F35456 AA993537 Ai749610 AA226934 AA716204 
AW51 3025 AA628543 AA583705 F25702 A1368748 A1 1 24097 A1880086 AA47751 3 AI758834 AI690753 AA477746 F37761 
AA642243 AA159957 AA250844 AA459406 AA427556 F25054 AI569314 AA961665 A1922050 AI759000 AA555235 AA514432 

1 5 AA293474 AA001129 AA826789 AA641390 AA134405 F35585 AA477416 AW193359 AI361315 AA284988 F36340 AI361322 

F26969 AA991922 AA021267 F26973 AI361314 F35891 AI918509 AA250964 AA190992 AA577139 AA865535 AA134324 
AW192842 A1224046 F18975 AA779626 AA856894 AW269997 AW014614 H95554 F31378 AA374868 F26343 AA654007 
-A1830942 AA1 13195 F26432 W55652 AA464690 AA055263 AA340654 AA031448 AA976399 AA972526 AA063476 R83921 
T16240 AA533290 N91545 H44053 AA883451 AA513761 AA086477 H09249 F20482 F26737 AA054148 AA857G63 AA017259 

20 AA179789 AA088908 H43704 AA194320 F35950 AI880127 F20441 F32878 AA962483 H39094 H56489 H44621 F19390 

AI302232 F25162 AA826965 AA086052 AA917410 AA454513 R59554 AA196755 AA086369 AA079530 H28106 AA243301 
AI025737 AA101239 AA088887 AI214910 AA974886 F16089 F26Q54 AA515092 F33436 F32829 M78061 AA235645 F19715 
F37529 A1811549 AA665180 AA708200 F01124 F32382 AA346220 AA6273S1 F30741 F30010 F28543 AA211715 F20245 
AA331222 F25634 F21996 W28215 F2191 1 R65793 AI192566 H20130 H84491 AA719223 AA557435 F16967 F26989 F30353 

25 AA857159 AA291918 F28234 F20840 F25176 F22437 T27904 AA480355 F19528 R87926 H14286 F27532 AW337864 F2841 1 

H13692 F25651 AA975454 F24229 F29657 F18024 AA464779 F17588 F34954 AA947328 F18063 AA657777 AA459644 N91455 
F27850 F29608 F27206 F18418 F3.1459 F18564 F33498 F16376 F29740 F29843 F29904 F29866 F19135 AW276602 T40337 
F24B35 F34672 F26474 AI926215 AA464185 F18217 N26193 AL043256 T41197 F33055 F00386 F29500 F34191 F33297 
AI937207 F22724 F15909 F26232 F18889 AA318627 F29085 AA872104 F17509 F23373 F15660 F17552 F17412 F16863 

30 F34033 F21515 F173e4 F18383 F16546 F17551 F17260 AA292000 F15723 T47438 F16798 F18046 F18319 F17978 F17566 

F34230 F33258 F20860 F17998 AI695701 
118600 genbanK_N69222 N69222 
118952 genbank^N92966 N92966 
120873 genbank_AA358015 AA358015 

35 113702 genbank_T97307 T97307 

129982 221_267 Z14221 AW381862 M97920 AW401444 Z66542 M29470 AW406502 X6101 1 M34024 AA327072 Z14166 Z14167 Z14165 

AW403806 Z14200 AA383972 Z14205 Z14201 M18513 Z14202 AW403684 X14584 AF062221 U43760 X65892 X65883 X62107 
Z80847 X65885 X65893 AF062142 X65891 X17675 Z47274 Z47277 Z47276 X65888 Z47275 X62109 AF062140 L01278 
AF052134 AF062139 X81723 Z80840 X81733 X81743 X81744 X81732 Z80843 AW402942 AW403516 X65919 AF062190 

40 AF062177 AF062222 AF0621 15 Z47240 AF062263 AF062261 AF062223 AF06221 1 Z47238 AW401714 AW404008 AW404991 

AF062280 M99595 Z47214 Z47232 Z47218 M26995 AF062184 X65895 L38433 X81731 Z1 1946 Z47226 AF062205 AF174012 
L01276 AF062168 AF062136 X81755 X81748 AF174ai9 AF062285 L06924 L1 1699 AW402665 L09070 L28049 L08090 
AW407843 Z14171 AW402944 L08083 AW405627 L33035 L26907 M17750 L28052 M17751 AJ239360 U19885 L14821 X56526 
AB014341 Li2087 L12098 U68231 L12184 AF062242 AF022000 U64499 U00570 A1268604 Y15773 X64239 X62969 U00506 

45 X73605 X99360 U00577 D83677 AB021539 AF035796 Z33899 U00588 AJ239353 AF174062 Z33901 X98899 AF174058 X63080 

D83676 AB021529 Z18318 U00488 L01412 X81746 U21262 U21272 U00560 AF174060 U00547 U00561 Z18321 S73957 
M26435 AF115130 AF115117 Z92896 U21254 Z92895 AF115112 AF062290 T28938 Y09386 AF174067 U27189 Y09384 
U77373 AF174057 M17749 X69692 AF174038 U64478 U64486 AF174098 X99365 AF174077 AF174083 AF174089 AF015133 
AF127792 AJ006171 U00510 U58144 AF004323 AF1 15109 AF004324 AF015130 AF090414 AF090418 AB021536 AB021519 

50 Z96957 AF021986 Z92698 Zg8956 L34164 AF062251 AF052524 U00549 U00541 D83687 AF052522 AF087420 AF052520 

AF022011 L43085 U00536 L19915 L43083 AF090420 AB021530 AB021534 Z98955 Z96954 Z33903 X62964 AF103282 U00509 
AF062298 AF062289 AF062307 AW408326 133037 L04337 L04328 X81742 L04336 L03385 AF032360 S56184 AF062191 
AF191092 AI906954 A'F103184 L04343 AW364860 AF001424 AF103163 Z98717 AF103143 AJ008250 L04323 AF103321 
AF103335 L04342 L03818 L03817 AW404978 AW403690 UB6801 AF103150 AJ010334 AF035027 AJ007327 AF103115 

55 AF017458 AJ008207 AJ008183 AJ008196 AJ008241 AJ008208AF103210AF068668 AF068670 AF068671 AF068664 

AF068669 AF0B8666 AF068665 AF068672 AW371244 AW403670 AW408074 AW404575 AW3621 53 AW403803 AW406702 
AW351514 D78345 129140 J00231 NfvL002179 AW405146 AA301091 X04646 H84660 AW402990 AW406534 T93007 
Ai857980 AW368899 AI905833 AW406586 AA482084 A1872299 AA715266 AW404328 AI831674 AI709348 AA6031 12 
AW514864 AA485775 H64492 AW404789 AA487630 AA715498 AA295885 T2761 3 T981 13 

60 115763 genbank_AA421560 AA421560 
124357 genbank_N22401 N22401 
108733 504187 J AA121022AA126422 
101544 entre2_M31169 M31169 
124447 genbank_N48000 N48000 

65 124677 genbank_R01073 R01073 
124777 genbank_R41933 R41933 
119302 genbanK.T25725 T25725 
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103680 entrez^Z93784Z93784 
135029 H58818^at H58818 
112253 genbanK.R51818 



R51818 



Pkey CAT number Accession 
5 322175 46877_1 AF085975 H53458 H53459 

323011 139750 1 AA580288 AA31 5655 AA1 33031 AA377748 
322975 1510563 1 C16391 C16413 

317210 21 1994_1 AW881 145 AA490718 M85637 AA304575 T06057 AA331991 

323817 233566_1 AA410943 AW948953 AA334202 AA332882 
10 309583 1046029 -2 AW170035 

324961 376239J AA613792 AW182329 T05304 AW858385 

303502 325188.1 BE174240 AA488528ALJ042253 

320133 447553 1 BE151746 BE336853 D63271 T94955 AA774994 

311935 174129 1 AA216387 T63548 AA228676 
15 321050 502195 1 C05928 AW393497 

319977 345248J AA534222 AA632632 T81234 

312772 4380.7 AW962299 AA310349 AW962294 H63791 H63751 

321354 116028.-2 AA078493 

336512 CH22 3941FG_834.7.UNK_DJ 
20 336558 CH22_3992FG_842_3.UNK.DJ 

336560 CH22.3994FG_842_5.LINK_DJ 

329367 c_x_hs 

329373 c_x^hs 

336676 CH22_4154FG_43„4_ 
25 - 338008 CH22_6490FG__LINK_EM:AC00 

338057 CH22.6558FG_UNK.EM:ACa0 

329655 c14j)2 

336959 CH22_4764FG_367.13. 

329899 c15j)2 
30 329960 c16_p2 

338410 CH22^7067FG_LINK>EM:AC00 

338451 CH22 7124FG_LINK_EM:AC00 

338588 CH22_7331FG_L!NK_EM.AC00 

338665 CH22.7438FG_LINK^EM'J\C00 
35 338689 CH22.7464FG_L1NK.EM:AC00 

308677 AI761173 

338832 CH22 7678FG_UNK_DJ246D7 
338980 CH22_7859FG_UNK.DA59H18 
333009 CH22 233FG_61J.LINieEMA 
40 333010 CH22_234FG_61.2.UNK.EWtA 
333013 CH22_237FG.61.5.LINK_EMA 
308981 AI873242 
308995 A{880172 

333108 CH22.336FG_79_14_LINK_EM: 
45 333139 CH22L368FG_83JB_UNK^EM: 

333254 CH22_495FG_118_2_LINK_EM: 

333305 CH22 550FG.137.2_UNICEM: 

333343 CH22.589FG_139.12.LiNK.EM 

333388 CH22_634FG_144_3_UNK»EM: 
50 326213 c17.hs 

333456 CH22.706FG.157.5.LINK.EM: 

333459 CH22.709FGJ57.8.LINK EM: 

333517 CH22.773FG_173.2_LINK EM: 

333585 CH2^.846FG_203.4.UNK_EM: 
55 333679 CH22_941FG_247.6.LINK_EM: 

326474 c19.hs 

333743 CH22^1009FG.264_1^L1NK_EM 
333758 CH22_1024FG.268.1^LINK^EM 
333767 CH22^1034FG^271.6.LINK_EM 
60 333768 CH22_1035FG_271„7_LINK.EM 
333769 CH22_1036FG_271.8_LINK^EM 

333795 CH22.1063FG_275.1.LINK_EM 

333796 CH22.1065FG.275.3.LINHLEM 
335004 CH22_2326FG.472_8_UNK.EM 

65 333892 CH22_1163FG_292.14.UNKJE 
335115 CH22J2447FG_495.2-UNK-EM 
333904 CH22.1176FG_294_2_UNK.EM 
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333905 CH22_1177FG.294.3.UNK_EM 

333921 CH22_1194FG_296_12_UNK^E 

333968 CH22^1245FG_307^4_LINK_EM 

328059 c_6_hs 

5 335287 CH22.2629FG.526_11_LINK^E 

326816 c20_hs 

326817 c20_hs 

335342 CH22_2689FG_536J_LINK_EM 

335491 CH22_2843FG_570_23_UNK_E 

10 335495 CH22_2847FG^570_28_LINK_E 

335498 CH22^2850FG.571_7_LINK_EM 

328304 c^7_hs 

305453 AA738110 

335544 CH22_2899FG_576_5_LINK_EM 

15 335610 CH22_2969FG_583^4^LINK^EM 

335653 CH22_3013FG 590_4_LINK^EM 

335682 CH22_3043FG.595^2,LINK_EM 

335687 CH22_3048FG.598^2_LINK_EM 

328492 c_7_hs 

20 335755 CH22_3122FG^604_4^LINK^EM 

335782 CH22J151FG_609_4^LINK_EM 

335791 CH22_3160FG_611_7„LINK_EM 

335809 CH22,3181FG_617^6_LINK_EM 

335822 CH22_3195FG_619_7_LINK_EM 

25 335823 CH22_3196FG_619_8^LINK^EM 

335824 CH22_3197FG_619J1-LINK^E 

335825 CH22_3198FG_619 12,LINK_E 
335895 CH22_3272FG_635 3_UNK_EM 
335917 CH22_3294FG_636J3_LINK_E 

30 335920 CH22_3297FG_636J6_UNK_E 

305898 AA872838 

305913 AA876109 

305950 AA884479 

328857 c_7 hs 

35 330084 c19_p2 

337968 CH22_6419FG_LINK_EM:AC00 

' 309177 AI951118 

309198 AI955915 

309226 AI969897 

40 339352 CH22_8317FG_LINK_BA354n 

309279 AI990102 

339373 CH22_8346FG_LINHLBA232E1 

325522 c14_hs 

334102 CH22^1380FG_327_60^L1NK^E 

45 332927 CH22„148FG.38„1_LINK.C20H 

332929 CH22_150FG,38_3_LINK_C20H 

332930 CH22J51FG_38 4_LlNfeC20H 
332955 CH22_179FG_48^12_LINK_EM: 
332958 CH22_182FG_48J5_LINICEM: 

50 332961 CH22.185FG.48_18_UNK^EM: 

332983 CH22.207FG_54_5.UNK_EM:A 

334222 CH2a„1506FG_360J3_LINK.EM 

334223 CH22L1507FG_360J».LINK.EM 
334264 CH22J551FG_367J5„UNK_E 

55 327110 c21_hs 

334343 CH22_1636FG_375 25^LINK_E 

334360 CH22_1654FG_378_5_UNK«EM 

327196 c_1 hs 

327283 c_1_hs 

60 327313 c_1 hs 

304465 AA421948 

304507 AA456426 

327450 c_2 hs 

304591 AA505702 

65 304601 AA507875 

304659 AA533185 

334784 CH22 2096FG_432_9_UNK„EM 
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334789 CH22^2101FG^432J4_L1NK^E 
334794 CH22^2106FGJ34J2_L1NHLEM 
336035 CH22_3420FG_678_6_LINK_DJ 
336042 CH22_3427FG_679_4_LINK.DJ 
336093 CH22„3481FG_691^2_LINieDJ 
336096 CH22_3484FG^691_5_L1NK_DJ 
334889 CH22_22a6FG^452_3_LINK_EM 
336150 CH22_3540FG^706_6.LINK_DA 
336152 CH22_3543FG_705_9_UNK.DA 
336416 CH22^3833FG_823^38.UNK^B 
336444 CH22_3B64FG_827J0_UNK_D 
336449 GH22_3870FG_829„6_UNK_DJ 
336471 CH22_3894FG_829.30_UNK^D 
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TABLE 13B 

Table 13B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 13B. For each predicted exon, we have listed the genomic 
sequence source Used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Unique number con-esponding to an Eos probeset 

Sequence source. The 7 digit numbers in this column are Genbank IdentiRer (Gl) numbers. "Dunham I. ei al." refers to the publication 

entitled The DNA sequence of human chromosome 22." Dunham i ei al.. Nature (1999) 402:489-495. 
Indicates DNA strand from which axons were predicted. 
Indicates nucleotide positions of predicted exons. 



Pkey: 
Ref: 

Strand: 
NLposition: 



Pkey Ref 



332955 
332956 
332961 
333139 
333254 
333305 
333368 
333517 
333585 
333679 
333767 
333768 
333769 
333795 
333796 
333892 
333921 
333968 
334102 
334284 
334343 
334794 
334889 
335287 
335491 
335495 
335498 
335653 
335687 
335809 
335822 
335823 
335824 
335825 
336035 
336093 
335095 
336444 
336959 
338008 
338057 
338410 
338588 
338665 
338832 
338980 
339352 



Dunham. I. et.ai. 
Dunham. I. ef.al. 
Dunham, I. etal. 
Dunham, I. et.al. 
Dunham, I. etai. 
Dunham, I. etai. 
Dunham, (. etal. 
Dunham, i. etal. 
Dunham, i. etai. 
Dunham, t. etal. 
Dunham. I. etal. 
Dunham, I. etal. 
Dunham, t etal. 
Dunham. I. et.al. 
Dunham, i. etal. 
Dunham, I etal. 
Dunham, I. etal. 
Dunham, I. etai. 
Dunham, I. etal. 
Dunham. I. etal. 
Dunham, (. etal. 
Dunham, t etal. 
Dunham, I. etal. 
Dunham, I. etat 
Dunham, I etal. 
Dunham, I etal. 
Dunham, 1. etal. 
Dunham, I. etal. 
Dunham, I. etat 
Dunham, t etat 
Dunham. I. etal. 
Dunham, I. etal. 
Dunham, i etat 
Dunham. 1. etal. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham, 1. etal. 
Dunham. I, etat 
Dunham. 1. etat 
Dunham. I. etal. 
Dunham, t etal. 
Dunham, 1. etal. 
Dunham, I. etal. 
Dunham. I. etal. 
Dunham, t etal 
Dunham, 1. etal. 
Dunham, I. etat 



strand 


Nt_position 


Plus 


2508896-2508992 


Plus 


2516164-2516310 


Plus 


2521424-2521555 


Plus 


3369495-3369571 


Plus 


2521424-2521555 


Plus 


4630388-4630645 


Plus 


4913749-4913805 


Plus 


5570729-5570925 


Plus 


6234778-6234894 


Plus 


7068795-7068896 


Plus 


7694407-7694623 


Plus 


7695440-7695697 


Pius 


7696625-7696707 


Plus 


7807688-7807795 


rlUS 


/ oOd25 3-/o0o3i 9 


Plus 


8156825-8157001 


Plus 


8380325^380441 


Plus 


8681004-8681241 


Plus 


9995140-9996373 


Plus 


13234447-13234544 


Plus 


13855828-13656307 


Plus 


16374312-16374458 


Plus 


19286024-19286515 


Plus 


22299047-22299299 


Plus 


24128651-24128827 


Plus 


24140688-24140872 


Plus 


24172082-24172161 


Plus 


25329710-25329802 


Plus 


25445952-25446064 


Plus 


26310772-26310909 


Plus 


26364087-26364196 


Plus 


26365925-26366004 


Plus 


26376860-26376942 


Plus 


26378175-26378268 


Plus 


29016748-29017410 


Plus 


29556922-29557002 


Plus 


29578878-29579047 


Plus 


34190585-34190718 


Plus 


13233040-13233126 


Plus 


7697068-7697236 


Plus 


8526397-8526522 


Plus 


19292807-19292916 


Plus 


22896767-22896920 


Plus 


24472654-24472853 


Plus 


27775128-27775290 


Plus 


29896789-29896874 


Plus 


33544784-33545121 
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332929 Dunham. I elal. Minus 

332930 Dunham, 1. elal Minus 
332983 Dunham, i. etal. Minus 

333009 Dunham. I. eLat. Minus 

333010 Dunham. 1. etal. Minus 
333013 Dunham, I etal. Minus 
333108 Dunham, tetat Minus 
333343 Dunham, I. etat Minus 
333455 Dunham, !. etal. Minus 
333459 Dunham. 1. etal. Minus 
333743 Dunham, I. etaL Minus 
333758 Dunham. I. etal Minus 

333904 Dunham, I. etat Minus 

333905 Dunham J. etat Minus 

334222 Dunham. tetat Minus 

334223 Dunham. tetat Minus 
334360 Dunham. tetat Minus 
334784 Dunham, t etal. Minus 
334789 Dunham. tetat Minus 
335004 Dunham. tetat Minus 
335115 Dunham, tetat Minus 
335342 Dunham. tetat Minus 
335544 Dunham, tetat Minus 
335610 Dunham, t etal. Minus 
335682 Dunham. tetat Minus 
335755 Dunham, tetat Minus 
335782 Dunham, tetat Minus 
335791 Dunham, tetat Minus 
335895 Dunham, tetat Minus 
335917 Dunham, tetat Minus 
335920 Dunham, tetat Minus 



336042 


Dunham, t etat 


Minus 


336150 


Dunham, t etat 


Minus 


336152 


Dunham, t etat 


Minus 


336416 


Dunham, i. etat 


Minus 


336449 


Dunham, t etat 


Minus 


336471 


Dunham, t etat 


Minus 


336512 


Dunham, t etat 


Minus 


336558 


Dunham, t etat 


Minus 


336560 Dunham. tetat 


Minus 


336676 Dunham, tetat 


Minus 


337968 Dunham. teUt 


Minus 


338451 


Dunham. I etat 


Minus 


338689 Dunham. tetat 


Minus 


339373 


Dunham, t etat 


Minus 


325622 


5867000 


Plus 


329655 


6448516 


Minus 


329899 


6563505 


Minus 


329960 


5091594 


Minus 


326213 


5867224 


Minus 


326474 


5867405 


Plus 


330084 


6015302 


Minus 


326816 


6552458 


Plus 


326817 


6552458 


Plus 


327110 


6117842 


Plus 


327195 


5867446 


Plus 


327283 


5867478 


Minus 


327313 


5867501 


Minus 


327450 


5867766 


Minus 


328059 


6117819 


Plus 


328492 


5868455 


Minus 


328304 


6004478 


Minus 


328857 


6381927 


. Minus 


329367 


5868842 


Minus 


329373 6682537 


Minus 



2020758-2020864 

2022565-2022497 

2631933-2631797 

2766043-2765856 

2766207-27661 IS 

2772278-2772039 

3240494-3240389 

4692886^692753 

2631933-2631797 

5144548-5144344 

7573218-7573060 

7666413-7666091 

8217374-8217261 

8217796^217670 

12732417-12732289 

12734365-12734269 

13728850-13728751 

16294548-16294360 

16306095-16305996 

20581911-20581794 

21388250-21388146 

22597448-22597284 

24650505-24650403 

25068943-25068841 

25421215-25421093 

25763806-25763747 

25908578-25908440 

25948563-25948411 

26975307-26975239 

27028481-27028377 

27034927-27034811 

29041694-29041500 

30150423-30150256 

30156053-30155870 

34047408-34047311 

34204707-34204577 

34215091-34214978 

34278373-34278275 

34375825-34375698 

34376814-34376596 

2022565-2022497 

7095797-7095680. 

20174286-20174193 

24893073-24892972 

33860127-33860047 

69994-70075 

35565-35843 

111058-111783 

1031-1162 

60751-60927 

16995-18101 

57019-59337 

198354-198436 

199909-200001 

94608-94785 

180921-181333 

567-962 

89734-89838 

47928-48076 

37052-37204 

46094-46241 

3884^952 

80557-81051 

87201-87587 

38950-39301 
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TABLE 14: Table 2 from BRCA 001-5 US 

Table 14, a subset of table 13, depicts a preferred group of genes highly upregulated in breast 
cancer cells. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

10 UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of tumor to normal breast tissue 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 





ExAccn 


unioene lu 


unigene. 1 lue 


Kl 


lUUvJO 






Ar r A COnUOi. 0 1 A 1 1 


ID./ 


mm Id 


UUU99D 


He nOQRO 


thy midy late synthetase 




innQTR 




nS.r 09UD 


lymphocyte Qftosolic protein 1 (L-plastin) 


OU.1 


ll/IUOl 




ns. 101/00 


mabix metaltoprot^nase 9 (gelafinase B; 9 


O/wC 


1U11U4 


LUruiO 


nS.i09zoo 


Human neuropepude Y receptor Yi (NPYY 


10.3 




Libido 


ns.9UU9j 


neat snocK /OKD protein 4 


17:4 


101339 


L4707fi 


no. 1 uuO^O 


norno sapiens \ceii line nL*0| aipna lopois 








ns.ooo 


tnterferon-stlmulated protein; 15 {^a 


lo.l 


lU I0U9 


iVlOOO*fiJ 


ns. 0^0/ 


nomo sapiens connextn (ojuz/ mnNA 








ns,oiu/i 


extracellular matrix protein 1 








Uc- -1 -f occc 

nS.nijDob 


nunnan clone zSJoB mKNA; partial cos 


15 


lUZOl# 


uyuyiw 


Ue> Q'37'l/( 


Human clone 23773 mRNA sequence 




109007 


AUoyoD 




heme oxygenase (decycling) 1 


OO 7 




Al /0h4 


Ue 07r47 


bi to o pnase transition 1 




IUjUOU 


AOf /DO 




matrix metalloproteinase 1 1 (stromelysin 3 


'47 Q 
1 f.O 


lUJ lOU 




Uc *i'577 


isocitrate dehydrogenase 2 (NADP+); mito 




103206 


X72755 


Hs 77367 


mnnnktnp inHtinpH Kv nAinms intprfpmn 


15.1 


103821 


AA1 57623 


Hs.198793 


KtAA0750 gene product 


23^3 


104115 


AA428090 


Hs.26102 


ESTs 


287 


104667 


AA007234 


Hs.30098 


ESTs 


16.6 


105186 


AA191512 


Hs-28005 


Homo sapiens mRNA; cDNA DKF2p564G 


19.3 


106103 


AA421104 


HS.12G94 


ESTs 


15.4 


107151 


AA621169 


Hs.8687 


ESTs 


19 


109415 


AA227219 


Hs.1 10826 


trinudeotide repeat containing 9 


20.1 


110189 


K20543 


Hs.6278 


OKFZP586B1621 protein 


16.6 


110561 


H59617 


Hs.5199 


ESTs; Weakly simOar to UBIQUITIN-CON 


19.5 


110734 


H98714 


Hs.24131 


ESTs 


30.2 


110915 


N46252 


Hs.29724 


ESTs 


23.2 


111179 


N67239 


Hs.10760 


ESTs 


37 


111357 


N91023 


Hs.87128 


ESTs 


15 


112134 


R46025 


Hs.7413 


ESTs 


•17.4 


113970 


W86748 


Hs.8109 


ESTs 


15 


114124 


Z38595 


Hs.125019 


ESTs; Highly similar to KIAA0886 protein 


22 


114292 


Z40715 


Hs.184641 


deita-6 fatty acid desaturase 


19.4 


114901 


AA236276 


Hs, 195437 


ESTs; Weakly similar to R26660_1; partial 


16.9 


114965 


AA250737 


Hs.72472 


ESTs 


35.1 


115652 


AA405098 


Hs.38178 


ESTs 


16.1 


115875 


AA433943 


Hs.43946 


ESTs; Weakly similar to Weak similarity t 


33:5 


116790 


H29532 


Hs.101174 


microtubule-assodated protein tau 


22,2 


116921 


H72948 


Hs.821 


biglycan 


20.7 


117412 


N26722 


Hs.42645 


ESTs 


18.1 


120241 


241815 


Hs.65946 


ESTs 


15.6 


120325 


AA195651 


Hs.104106 


ESTs 


15.2 


121598 


AA416740 


Hs.174104 


ESTs 


22.6 


123619 


AA609200 




ESTs 


23.1 


124006 


D60302 


Hs.270016 


ESTs 


20.6 


125852 


H09290 


Hs.76550 


Homo sapiens mRNA; cDNA DKFZp564B 


25.9 


126160 


Ng0960 


Hs.265398 


ESTs: Weakly simOar to transfonmation-rel 


16.4 



252 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



127677 AA916752 


Hs.264190 


ESTs; Highly similar to MEM3 (M.muscul 


17.3 


128595 U31875 


Hs. 152677 


short-chain alcohol dehydrogenase family m 


27.1 


128717 T30617 


Hs.l 04222 


Homo sapiens mRNA; cDNA DKFZp566L 


24.5 


129124 AA234530 


Hs.108802 


N-ethylmaleimide-sensitive factor 


20.7 


129366 H18027 


Hs.184697 


plexin CI 


18.2 


130455 X17059 


Hs.155956 


N-acetyltransferase 1 (arylamine N-acetylt 


26.4 


130604 X03635 


Hs.1657 


estrogen receptor 1 


39.9 


130913 W03592 


Hs.21198 


translocase of outer mitochondrial membra 


20.9 


130944 M97935 


Hs.21486 


signal transducer and activator of transcript 


18.8 


131472 AA608962 


Hs.27258 


calcyclin binding protein 


18.1 


131562 U90551 


Hs.28777 


H2A htetone family; member L 


18.8 


132180 AA405569 


Hs.418 


fibroblast acGvation protein; alpha; seprase 


15.4 


132406 F0g979 


Hs.4774 


ESTs 


15 


132465 AA0478g5 


Hs.49169 


ESTs 


15.4 


132994 AA505133 


Hs.279905 


solute carrier ^mlly 2 (facilitated glucose t 


26.4 


133294 R79723 


Hs.69997 


zinc finger protein 238 


30.4 


133634 U24166 


Hs.234279 


micFotubule-associated protein; RP/EB fam 


15.2 


134374 D62633 


Hs.8236 


ESTs 


15.2 


134405 J04177 


Hs.82772 


collagen; type XI; alpha 1 


15.3 


134470 X54942 


Hs-83758 


CDC28 protein kinase 2 


20.3 


134495 D63477 


Hs.84087 


K)AA0143 protein 


16.1 


134714 U89922 


Hs.890 


lymphotoxtn beta (TNF superfamily; memb 


35.7 


135237 AA454930 


Hs.9691 


ESTs 


19.5 


301884 AA312082 


Hs.105445 


GDNF family receptor alpha 1 


20.7 


302276 NM^004448Hs.323910 


EST cluster (not in UniGene) with exon hit 


21.6 


302290 AL1 17607 


Hs.175563 


Homo sapiens mRNA; cDNA DKFZp564N 


41.4 


309177 AI951118 




EST singleton (not in UniGene) with exon 


24,3 


309583 AW170035 




EST 


64.5 


310438 AW022192 Hs.200197 


ESTs 


39.1 


311166 AI821294 


Hs.l 18599 


ESTs 


24.1 


312153 AA7oS25> 


Hs.1 53028 


cytochrome b-561 


27.1 


313915 AI9693S0 


HS.163443 


ESTs 


27.1 


314506 AA833655 


Hs.206868 


ESTs 


27.8 


314558 AI873274 


Hs.190721 


ESTs 


22.5 


314691 AW207206 Hs.136319 


ESTs 


21,4 


314943 AI476797 


Hs.184572 


cell division cycle 2; G1 to S and G2 to M 


18.4 


315196 AA972756 


Hs.44898 


ESTs 


28,8 


316177 AI908272 


Hs.293102 


EST cluster (not in UniGene) 


32.6 


318073 AW167087 Hs.131562 


ESTs 


15.7 


318662 A1285898 


Hs.294014 


ESTs - 


16.3 


318740 NM_002543Hs.77729 


EST cluster (not in UniGene) 


21.3 


318744 AI793124 


Hs.144479 


ESTs 


35 


319668 NM_G02731Hs.87773 


EST cluster (not in UniGene) 


25.4 


320074 AA321166 


Hs.278233 


EST cluster (not in UniGene) 


16.7 


320211 AL039402 


Hs.125783 


DEME-6 protein 


24.3 


320727 U96044 


Hs.181125 


EST cluster (not in UniGene) 


15.3 


322818 AW043782 Hs.293616 


ESTs 


21 


322882 AW248508 Hs.279727 


DiGeorge syndrome critical region gene 2 


15.3 


324261 AL044891 


Hs.269350 


EST cluster (not in UniGene) 


50.1 


324432 AA464510 


Hs.1 52812 


EST cluster (not in UniGene) 


16.7 


324603 AW016378 Hs,292934 


ESTs 


23.1 


324620 AA448021 


Hs.94109 


EST cluster (not in UniGene) 


21.2 


324988 T06997 


Hs.121028 


EST cluster (not in UniGene) 


24.5 


330388 X03363 




HER2 receptor tyrosine kinase (c-erbB-2; E 


17.7 


330486 M13755 


Hs.833 


Interferon-stimulated protein; 15 kDa 


67 


330814 AA015730 


Hs.265398 


ESTs; Weakly similar to transformation-rel 


44.1 


331145 R72427 


Hs.129873 


ESTs; Weakly similar to CYTOCHROME 


41.9 


331306 AA252079 


Hs.63931 


dachshund (Drosophila) homolog 


15.1 


331890 -AA432166 


Hs.3577 


succinate dehydrogenase complex; subunit 


24.3 


332526 AA281753 


Hs.77515 


inositol 1;4;5-triphosphale receptor, type 3 


. 19 


332532 N63192 


Hs.l 892 


EST; Highly similar to PHENYLETHANO 


15.3 


332694 AA262768 


Hs.243901 


KIAA1067 protein 


15.2 


332958 




CH22-FGENES.48_15 


17.8 


333769 




CH2^.FGENES.27U 


483 


333968 




CH22.FGENES.307_4 


15.9 


334223 




CH2Z.FGENES.360^4 


335 


334264 . 




CH2^.FGENES.367J5 


185 
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335791 CH22.FGENES.611_7 27.3 

336512 CH22.FGENES.834_7 21.4 

338008 CH22_EMAC005500.GENSCAN.127-9 15.2 
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TABLE 14A 



Table 14A shows the accession numbers for those pkeys lacking unigenelD's for Table 14. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and AUgnment Tools (DoubleTwist, Oakland CaUfomia). The Genbank accession nimabers 
for sequences comprising each cluster are listed in the "Accession" colimm. 

10 

Pkey: Unique Eos probeset identifler number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

15 

Pkey CAT number Accession 

309583 1046029_-2 AW170035 

20 336512 CH22_3941FG_834_7^UNK.DJ 

338008 CH22^6490FG_LINK_EM:AC00 

333769 CH22_1036FG„271_8_LINK_EM 

333968 CH22_1245FG^307_4_LINK_EM 

335791 CH22_3160FG..611_7_UNK^EM 

25 309177 AI951118 

332958 CH2^„182FG_48_15_L1NK„EM: 

334223 CH22_1507FG_360_4_UNK_EM 

334264 CH22.1551FG_367^15_LiNK_E 

123619 371681.1 AA602g64 AA609200 
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TABLE 14B 



10 



15 



20 



25 



Table 14B shows the genomic positioning for those pkeys lacking unigene DD's and 
accession numbers in Table 14. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



Pkey; 
Ref: 

Strand: 
NLposition: 



Pkey Ref 



Unique number corresponding to an Eos probesel 

Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gi) numbers. "Dunham 1. et al." refers to the publication 

entitled "The DMA sequence of human chromosonfie 22." Dunham 1. et al,. Nature (1999) 402:489-495. 
Indicates DMA strand from which exons were predicted. 
Indicates nucleotide positions of predicted exons. 



Strand NtjKisitlon 



332958 
333769 
333968 
334264 
338008 
334223 
335791 
336512 



Dunham. 
Dunham, 
Dunham, 
Dunham. 
Dunham. 
Dunham, 
Dunham. 
Dunham. 



I. etal. 
I. etal. 
!. et.al. 
f. etat. 
I. etal. 
I. etal. 
Letat 
tetat 



Plus 

Plus 

Plus 

Plus 

Plus 

Minus 

Minus 

Minus 



2516164-2516310 

7696625-7696707 

8681004-8681241 

13234447-13234544 

7697068-7697236 

12734365-12734269 

25948563-25948411 

34278373-34278275 
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TABLE 15: Table 3 from BRCA 001-5 US 



Table 15 shows genes downregulated in breast cancer cells. 



10 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnlgenelD: Untgene number 

Unigene Title: Unigene gene title 

R1: Ratio of normal breast tissue to. tumor 



Pkey ExAccn 



UnlGene IDUnigene Title 



R1 

15 

100115 D00632 Hs-172153 glutathione peroxidase 3 (plasma) 1.7 

100499 TIGR:HT1428 Hs.283108 Globin.Beta 1.5 

100502 T1GR:HT1496 Hs,169228 Adrenal-Spedfic Protein Pg2 2.3 

100815 T)GRJfr4268 Hs.9739 L-GlyceroW-Phosphate:Nad+Qxidoreduct 1.7 

20 101125 L10373 Hs.82749 transmembrane 4 superfamily ntember 2 1.5 

101367 M12983 Hs.4 akx)ho) <iehydrogenase 1 (class I); alpha po 2.9 

101397 M15856 Hs.1d0878 lipoprotein lipase 1.6 

101883 Mg8399 Hs:75613 CD36 antigen (collagen type I receptor, ttir 1.6 

102227 U25138 Hs.93841 potassium large conductance caldum^ctiv 1.6 

25 102857 X00129 Hs.76461 retinoWmding protein 4; inters&fial 3 

103211 X73079 Hs.288579 polymeric Immunoglobulin receptor 1.8 

103496 Y09267 Hs.1 32821 flavin containing monooxygenase 2 1.5 

103562 Z21966 Hs.2815 POU domain; dass 6; transcripl/on factor 1 1.8 

104672 AA007629 g!ycerol-3-phosphate dehydrogenase 1 (sol 2.4 

30 105083 AA146619 Hs.18791 ESTs; Weakly similar to CALCIUM-BIND 1.7 

105138 AA164519 Hs:i5248 ESTs 1.5 

106075 AA417915 Hs.25930 ESTs 1.5 

106870 AA487576 Hs.26530 serum deprivatton response (phosphatidyls 1.6 

107099 AA609645 Hs.211568 eukaryotic translation initiation factor 4 gam 2.7 

35 107616 AA004901 Hs,261164 ESTs 1.6 

107997 AA037388 Hs,82223 Human DMA sequence from done 141H5 o 1.7 

108604 AA099820 Hs.49696 ESTs 2.4 

111130 N64265 Hs.l9515 yz44h12.s1 Morton Fetal Cochlea Homo sa 1.7 

111837 R36447 Hs.24453 ESTs 1.6 

40 112538 R70255 ESTs 1.9 

112808 R97970 Hs.281022 EST . . 1.5 

113086 T40652 Hs.209100 DKFZP434C171 protein 1.9 

115740 AA4 18033 Hs.283559 ESTs 1.6 

115949 AA443800 Hs.43125 ESTs 2 

45 115965 AA446661 Hs.173233 ESTs 2.2 

117224 N20300 Hs218707 ESTs 1.7 

117513 N32174 Hs.44317 SRY (sexKletermining region Y)-box 10 1.7 

119069 R15436 Hs.77889 Fnedreich ataxia region gene X123 1.7 

119175 R71792 Hs.301002 ESTs; Weakly similar to oeH death activato 2.8 

50 119359 T71021 . Hs.285681 ESTs; Highly similar to WSbaslc-helix-loo 1.9 

119798 W73386 Hs.249129 ESTs 3 

120889 AA365784 Hs.g7044 ESTs 16 

121381 AA405747 Hs.97984 ESTs; Weakly similar to WASP-family pro 1.8 

121750 AA421184 Hs.97549 ESTs 1.5 

55 122127 AA434447 Hs.108771 ESTs 2.5 

122348 AA443695 Hs.293410 ESTs 2.1 

122485 AA448300 Hs.160318 phosphoiemman .1.5 

123443 AA598841 Hs.167382 natriuretic peptide receptor A/guanylatecy 1.8 

123505 AA600135 ESTs; Moderately similar toll!! ALU SUB 1.5 

60 125284 W94688 Hs.103253 perilipin 1.7 

126300 D81972 HUM427D08B Human fetal brain (TFujiw 1.8 

126747 R72515 Hs.160318 phosphoiemman 1.6 

127218 AA309765 Hs.1 16017 ESTs; WeaWy similar to KIAA0795 protei 1.5 

127357 AA452788 Hs.75432 zx39g11.r1 Soan9SjotalJetus_Nb2HF8_9 1.7 
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127638 AA634405 Hs.122608 ESTs 1.5 

128213 AA972780 Hs.129194 ESTs; WeaWy similar to 11!! ALU SUBFA 1.5 

128351 AI092391 Hs.134886 ESTs 1.5 

128842 N44757 Hs.20340 ESTs 1.6 

5 128870 R71403 Hs75309 eukafyotic translation elongation factor 2 1.7 

129146 AA459944 Hs.108924 DKFZP586P1422 protein 1.5 

129285 T62068 Hs.11006 ESTs 2.1 

129331 N93465 Hs.279772 ESTs; Highly similar to CGI-38 protein [H 1.5 

130085 M62402 Hs.274313 insulin-Iil<e grovs^ factor binding protein 6 1.7 

10 130400 M25079 Hs.2B3108 hemoglobin; beta 1.7 

131267 AA211776 Hs.2504 myomesin 1 (skelemin) (185kD) 3.8 

131277 AA131466 Hs.23767 ESTs 1.9 

1 3 1 282 M1 2272 Hs.4 alcohol dehydrogenase 3 (class I); gamma p 2.2 

131304 AA295848 Hs,25475 aquaporin7 17 

15 131810 D49487 Hs.194236 leptin (murine obesity homolog) 2.5 

1 32788 AA045503 Hs.56874 ESTs; Weakly similar to Homo sapiens p2 1 .6 

132931 Z41452 Hs.6090 deleted in bladder cancer chromosome regj 1.5 

133120 X64559 , Hs.65424 tetranectin (plasminogen-binding protein) 2 

133314 U95367 Hs.70725 gamma-aminobutyric acid (GABA) A recep 1.5 

20 133507 X74295 Hs.74369 integrin; alpha 7 1.7 

133601 S95936 Hs.284176 transfemn 2.3 

133702 N568g8 Hs75652 glutathione S-transferase M5 1.9 

134111 N79674 Hs.8022 TU3A protein 4.6 

134699 U56814 Hs.88646 deoxyribonudease l^ke 3 1.5 

25 134749 L10955 Hs.89485 carbonic anhydrase IV 1.6 

135173 M72885 Hs.95910 Human G0S2 protein gene; complete ods 1.9 

300132 AW027556 Hs,156286 ESTs 1.7 

300732 AI369956 Hs.257891 ESTs 1,5 

300750 AA514805 Hs.293055 ESTs 1.8 

30 301140 AI807692 Hs.129129 ESTs 1.6 

301396 AA923549 Hs.224121 ESTs 2.1 

302910 N77976 Hs.251577 hemoglobin; alpha 1 1.8 

303798 V00505 Hs.36977 hemoglobin; delta 1.6 

303831 T04868 Hs.46780 EST cluster (not in UniGene) with exon hit 1.7 

35 303844 U94362 Hs.58589 glycogenin 2 1.5 

304182 H91086 EST singleton (not in UniGene) with exon 1.5 

304622 AA516384 EST singleton (not In UniGene) with exon 1.5 

304682 AA550994 EST singleton (not in UniGene) with exon 1.7 

305612 AA782347 Hs.272572 EST singleton (not in UniGene) with exon 1.5 

40 306193 AA923457 EST singleton (not in UniGene) with exon 1.5 

307206 All 92534 EST singleton (not in UniGene) with exon 1 .6 

307377 AI222691 EST singleton (not in UniGene) with exon 1.5 

308023 AI452732 Hs.251577 EST singleton (not in UniGene) with axon 1.9 

308359 AI612774 Hs.79372 retinoid X receptor; beta 1.5 

45 309838 AW296073 Hs.255504 EST 1.5 

310403 A172097d Hs.148006 ESTs; Moderately similar to aftematrvelys 1.8 

311671 AW241947 Hs.232478 ESTs 1.6 

311794 AW238092 Hs.254759 ESTs 2.1. 

312082 T79860 Hs.1 18180 ESTs 1.9 

50 312575 H25237 Hs.306814 ESTs 2.3 

313076 N49684 Hs.143040 ESTs 1,8 

313283 W32480 Hs,157099 ESTs 2.2 

313374 AW328672 Hs.132760 ESTs 1.9 

314701 AI754634 Hs.131987 ESTs 17 

55 315391 AA759098 Hs.192007 ESTs 1.8 

315688 AA680055 Hs.264885 ESTs 1.5 

316249 AA948612 Hs.130414 ESTs 1.6 

316586 AI205077 Hs.294085 ESTs 17 

316890 AA837079 Hs.24647 ESTs 1.5 

60 316983 AI480204 Hs.177131 ESTs 1.5 

317604 AI650625 Hs.300756 ESTs 1.6 

317951 AW206520 Hs.129621 ESTs 1.5 

319400 W26902 Hs.154085 ESTs 17 

320757 H22654 Hs.6382 EST cluster (not in UniGene) 1.5 

65. 321594 AA021402 Hs.1 1067 ESTs 1,7 

322102 H45589 EST duster (not in UniGene) 1.5 

322814 AI824495 Hs.211038 ESTs 2.2 
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322929 AI365585 Hs.146246 ESTs 2.3 

323831 AA335715 Hs.200299 ESTs .1.7 

324044 AL045752 Hs.22350 ESTs 1.8 

324675 AW014734 Hs. 157989 ESTs 2.2 

5 325272 CH.11_hsgi|5866902 1.5 

325558 CH.12_hsgil6056302 1.6 

325656 CH.14.hs gi|6056305 1 .6 

326120 CH.17^hsgl|5867194 1.5 

326139 CH.17.hsgil5867203 1.5 

10 326855 CH.20.hsgil6552460 1.5 

327438 CH.02.hsgil6004454 . 1.6 

329733 CH.14j)2gi|6065783 1.6 

330931 F01443 Hs.284256 ESTs ^ 4.6 

331591 N71677 Hs.42146 ESTs 1.9 

15 332159 AA621393 Hs.1 12984 EST 1.5 

332364 W94688 Hs.103253 periiipin 2.1 

332502 H21819 . Hs.14896 Homo sapiens clone 24590 znRNA sequenc 1.5 

334175 CH22„FGENES.349^10 1.5 

334347 CH22_FGENES.375^31 1.8 

20 334737 CH22.FGENES.424J2 1.8 

335352 CH22.FGENES.539_5 1.5 

335639 CH22_FGENES.584J9 1.6 

336244 CH22_FGENES.746^ 1.5 

336336 CH22_FGENES.814_8 1.7 

25 • 336865 CH22^F6ENES.305-1 1.6 

337494 CH2^_FGENES.799-12 1.6 

337764 CH22_EM:AG00a097.GENSCAN.119-1 1.8 

337983 CH22_EM:AC005500.GENSCAN.110-1 2 

338192 ' CH22_EMAC005500.GENSCAN.228-1 1.5 

30 339366 CH22.BA354I12.GENSCAN.34-2 1.5 
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TABLE 15A 



Table ISA shows the accession numbers for those pkeys lacking unigenelD's for Table 15. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the '^Accession" column. 

10 

Pkey. Unique Eos probeset identifier number 

CAT number Gene dusternumber 
Accession: Genbank accession numbers 

15 

Pkey CAT number Accession 

126300 25D375_2 D81972 BE003132 

20 112538 504579_1 AA908813 R70255 

123505 genbank_AA600135 AA6G0135 - 

104672 6735.7 AA349096 AI368018 F21390 F17759 R48772 AI421485 AI300352 H43971 AI378525 F33652 R47898 AI264177 F22289 N28263 
A1276281 R48205 AI245302 AI19Q036 A1281050 AW242903 H42892 AA910870 AW473816 H25721 AW451438 F19647 F22375 
H45809 f33AA7 AA774528 AA007629 H42537 001077 F32386 

25 322102 46708.1 H45589 H19807 AF075038 H19808 H42437 

336865 CH22_4590FG_3O5J_ 

338192 CH22_6755FG_UNK.EM:AC00 

329733 c14j)2 

326120 c17_hs 

30 326139 c17_hs 

326855 c20_hs 

335352 CH22_2699FG_539.5_LINK EM 

335639 CH22_2999FG_584_19_LINK E 

307206 AI192534 

35 307377 AI222691 

337494 CH22_5727FG_799 12. 

337764 CH22_6115FG_LINK_EMAC00 

337983 CH22_6438FG_LINK^EMAC00 

339366 CH22_8336FG_LINK BA354I1 

40 325272 cll^hs 

325558 c12_hs 

325656 c14_hs 

334175 CH22_1455FG^349_10J.INK_E 

304182 H91086 

. 45 334347 CH22J640FG_375_31_LINK E 

327438 c_2^hs 

304622 AA516384 

334737 CH22_2049F6_424_12_UNieE 

304682 AA550994 

50 336244 CH22_3542FG 746_2_LINI<LOA 

306193 AA923457 

336336 CH22_3746FG_814_8 UNK.BA 
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TABLE 15B 



Table 15B shows the genomic positioning for those pkeys lacking unigene ED's and 
accession numbers in Table 15. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
Usted. 

Pkey: Unique number corresponding to an Eos probesel 

Ref: Sequence source. The 7 digit numbers In this column are Genbank Identifier (G1) numbers. "Duiiham t. et ai." refers to the publication 

entitled The DMA sequence of human chromosome 22: Dunham I. et al.. Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

Ntj)OSition: tndk:ates nucleotide positions of predkited exons. 



Pkey Ref 



Strand NLposition 



334347 
334737 
335639 
337494 
334175 
335352 
335244 
33533e 
336865 
337764 
337983 
338192 
339366 
325272 
325558 
325656 
329733 
326120 
326139 
326855 
327438 



Dunham, i. elal. 
Dunham, I. elal. 
Dunham, I. etai. 
Dunham, I. etal. 
Dunham, I. etal. 
Dunham. I. etal. 
Dunham, I. etai. 
Dunham, I. etal. 
Dunham, 1. eta). 
Dunham. I. etal 
Dunham. L etal. 
Dunham, 1. etal. 
Dunham, I etal. 
5866902 
6056302 
6056305 
6065783 
5867194 
5867203 
6552460 
6004454 



Plus 

Plus 

Plus 

Plus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Minus 

Plus 

Minus 

Plus 

Plus 

Minus 

Minus 

Minus 



13663814-13663926 

15998517-15998685 

25173591-25173696 

33339024-33339148 

11668659-11668597 

22681512-22681384 

31402729-31402583 

33797209-33797076 

8622405-8622289 

4035640^035446 

7275495-7275271 

13248453-13248277 

33647431-33647293 

13247-13312 

70930-71030 

78190-78707 

163237-163450 • 

36116^6276 

218901-218960 

111390-111463 

199569-199692 
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TABLE 16: Table 4 from BRCA 001-5 US 

Table 16, a subset of table 15, depicts a preferred group of genes highly downregulated in 
breast cancer cells. 



Pkey: Unique Eos probeset identifier number 

. ExAccn: Exemplar Accession number, Genbank acoesston number 

10 UnigenelD: Unigene number 

Un^ene Title: Unigene gene title 

R1:. Ratio ofnomral breast tissue to fijmor 



15 



20 



25 



30 



35 



40 



45 



Pkey 


ExAccn 


UniGene (D 


Unigene TWe 


R1 


100502 


TIGR:HT1496 


Hs.169228 


Adrenal-Specific Protein Pg2 


2.3 


101367 


M12963 


Hs.4 


alcohol dehydrogenase 1 (dass 1); alpha 


2.9 


102857 


X00129 


Hs.76461 


retinol-binding protein 4; interstitial 


3 


104672 


AA007629 




glyceroI-3-phosphate dehydrogenase 1 


2,4 


107099 


AA609645 


Hs.211568 


eukaryotic translation initiation factor 4 gam 


2.7 


108604 


AA099820 


Hs.49696 


ESTs 


2.4 


115949 


AA443800 


Hs,43125 


ESTs 


2 


115965 


AA446661 


Hs,173233 


ESTs 


2.2 


119175 


R71792 


Hs.301002 


ESTs; Weakly similar to cell deaUi activator 


2.8 


119798 


W73386 


Hs.249129 


ESTs 


3 


122127 


AA434447 


HS.1G6771 


ESTs 


2.5 


122348 


AA443695 


Hs.293410 


ESTs . 


2.1 


129285 


T62058 


Hs.11006 


ESTs 


2.1 


131267 


AA211776 


Hs^504 


myomesin 1 (skelemin) (185kD) 


3.8 


131282 


M12272 


Hs.4 


alcohol dehydrogenase 3 (class 1); gamma 


2.2 


131810 


D49487 


Hs.194236 


leptin (murine obesity homolog) 


2,5 


133120 


X64559 


Hs.65424 


tetranectin (plasminogen-binding protein) 


2 


133601 


S95936 


Hs.284176 


transferrin . 


2.3 


134111 


N79674 


Hs.8022 


TU3A protein 


4.6 


301396 


AA923549 


Hs.224121 


ESTs 


2.1 


311794 


AW238092 


Hs.254759 


ESTs 


2.1 


312575 


H25237 


Hs,306814 


ESTs 


2.3 


313283 


W32480 


Hs. 157099 


ESTs 


2.2 


322814 


AI824495 


Hs.211038 


ESTs 


2.2 


322929 


AI365585 


Hs.146246 


ESTs 


2.3 


324675 


AW014734 


Hs.157969 


ESTs 


2.2 


330931 


F01443 • 


Hs.284256 


ESTs 


4.6 


332364 


W94688 


Hs.103253 . 


perilipin 


2.1 


337983 
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TABLE 16A 

Table 16A shows the accession numbers for those pkeys lacking unigenelD's for Table 16. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled usuig sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

10 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

15 

Pkey CAT number Accession 

104672 6735_7 AA349096 AI368018 F21390 F17759 R48772 AI421465 AI300352 H43971 A1378525 F33552 R47898 AI264177 F22289 N2a263 
A1276281 R48205 AI245302 AI190036 AI281050 AW242903 H42892 AA910870 AW473816 H25721 AW451436 F19647 F22375 
20 H45809F33447AA774528AA007629H42537C01077 F32386 
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TABLE 17: Table 1 from BRCA 014 P 

Table 17 shows accession numbers representing 759 sequences of breast cancer genes or 
fragments thereof encoding breast cancer modulating proteins. For each overexpressed gene 
identified, a ratio of the relative amoimt of expression in breast tumors versus normal breast 
tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to normal breast tissue 



Pkey ExAccn UnigenelO Unigene Title R1 

100227 AV654694 Hs.82316 interferon-induced, hepatitis C-associat 3 

100405 AW291587 Hs.82733 nidogen2 3.2 

100406 A(962060 Hs.118397 AE-binding protein 1 3.6 
100420 D86983 Hs. 11 8893 Melanoma associated gene 3.2 

.100911 X83300 Hs.289103.SMA4 5.2 

100960 J00124 Hs.117729 keratin 14 (epidermolysis bullosa simple 4.3 

101011 BE387036 Hs.1211 acid phosphatase 5. tartrate resistant 3 

101183 AA442324 Hs.795 H2A histone family, member O 3.2 

101194 L20971 Hs.188 phosphodiesterase 4B. cAMP-specific (dun 3 

101329 U66042 Hs.82171 Homo sapiens done 191B7 placenta expres 4.1 

101378 865^3085 Hs.833 interferon-stimulated protein. 15 kDa 5.3 

101474 R07566 Hs.73817 smalllnducible cytokine A3 (homologous 3.9 

101491 M25809 Hs.64173 ATPase, H-f- transportrng, lysosomal (vacu 4.5 

101530 M29874 Hs.1360 cytochrome P450, subfamily 118 (phenobar 9 

101602 AA353776 Hs.901 CD48 antigen (B-cell membrane prote'm) 3.4 

101663 NM_003528 Hs.2178 H2B histone family, member Q 5.6 

101758 BE019494 Hs.79217 pyrroIine-5-carboxylate reductase 1 3.6 

101767 M81057 Hs. 180884 carboxypeptidase B1 (tissue) 12 

101817 M89907 Hs.152292 SWl/SNF related, matrix associated, acli 3.2 

101851 BE260964 Hs.82045 midkine (neurite growth-promoting factor 4.1 

101878 M97815 Hs.183650 cellular retinoic acid-binding protein 2 6.5 

102209 NM_002038 Hs.265827 Interferon, alpha-inducible protein (do , 3 

102214 U23752 Hs.32964 SRY (sex determining region Y)-box 11 3 

102297 NM:.001504 Hs.198252 G protein^upled receptor 9 3,7 

102299 NM_005824 Hs.155545 37 kDa leucine-rich repeat (LRR) protein 3.7. 

102301 NM_005651 Hs.183671 tryptophan 2.3-dipxygenase 5,2 

102305 AL043202 Hs.90073 chromosome segregation 1 (yeast homolog) 3,5 

102369 U39840 Hs.299867 hepatocyte nuclear factor 3. alpha 3.9 

102591 U62325 Hs.324125 amyloki beta (A4) precursor protein-bind 4 

102721 H16646 Hs.118666 hypothetical protein PP591 3.5 

102739 AA363025 Hs.155572 Human done 23801 mRNA sequence 3.2 

102791 AF080229 gb:Human endogenous retrovirus K clone 1 3 

102804 NMJ)02318 Hs.83354 lysyl oxidase^'ke 2 3.2 

102903 M73779 Hs.250505 retinotc add receptor, alpha 3.3 

103010 X52509 Hs.l61640 tyrosine aminotransferase 12.4 

103042 T81656 Hs.252259 nlrasomat protein S3 4.5 

103117 X63578 Hs.295449 paralbumin 3 

103207 X72790 gb:Human endogenous retrovirus mRNA for 5,9 

103282 BE390551 Hs.77628 steroidogenic acute regulatory protein r 3.9 

103284 AI751601 Hs.8375 TNF receptor-associated factor 4 3.3 . 

103329 X85134 Hs.7^84 retinoblastoma-binding protein 5 3.1 

103364 X90872 Hs.279929 gp25L2 protein 3 

103385 NM_007069 Hs.37189 similar to rat HREV107 3.4 

103456 AA496425 Hs.9629 papillary renal cell cardnoma (transloc 3.2 
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103498 Y09306 Hs.30146 homeodomairUnteiacting protein kinase 3 3.4 

103558 BE616547 Hs.2785 keratin 17 3.7 

103563 L0291 1 Hs.150402 Activin A receptor, type ! (ACVR1) (ALK 3.2 

103612 BE336654 Hs.70937 H3 histone family, member A 4.5 

5 103825 AI571835 Hs.55468 ESTs 4 

104073 AW779318 Hs.88417 ESTs 3.8 

104103 AW021102 Hs.21509 ESTs 4.3 

104115 AF183810 Hs.26102 opposite strand to trichottiinophalangeal 7.6 

.104168 AA461618 Hs.31704 . ESTs. WeaWy similar to KIAA0227 lH.sapi 3.6 

10 104173 AAG84273 Hs.76561 ESTs. Weakly similar to S47072 finger pr 4 

104181 AF173296 Hs583740 D06 protein 3 

104189 AB040927 Hs.301804 KIAA1 494 protein 3.2 

104269 AI55S444 Hs.293960 ESTs 4.3 

104307 AI929700 Hs.1 11680 endosulfine alpha 3.1 

IS 104518 H20B16 Hs.112423 Homo sapiens mRNA; cDNA DKFZp58611420 (f 3.2 

104556 AV650851 Hs.96900 hypothetical protein; KiAA1830 protein 4.4 

104658 AA360954 Hs.27268 Homo sapiens cDNA: FU21 933 fis, clone H 3.2 

104748 AA015879 Hs,33536 ESTs 3.2 

104755 T49951 Hs.g029 OKFZP434G032 protein ' 4.5 

20 104825 AA035613 Hs.141883 ESTs 6.9 

104830 AW294092 Hs.21594 hypothetical protein IVIGCI 5754 11.1 

104865 T79340 Hs.22575 B-cefl CUTIymphoma 6, member B (zinc fi 3.5 

104906 BE298684 Hs.26802 protein kinase domains containing protei 6.5 

104961 H78517 Hs.33905 ESTs 3.6 

25 105038 AW503733 Hs.9414 KIAA1488 protein 4.5 

105088 H58589 Hs.35156 Homo sapiens cDNA FU 11 027 fis, done PL 3.8 

105092 AA1 48982 Hs.29068 ESTs 3 

105093 AL137566 Hs.32405 Homo sapiens mRNA; cDNA DKFZp586G0321 (f 4.8 
105304 AW134924 Hs.190325 ESTs 8.2 

30 105397 AA814807 Hs.7395 hypothetical protein FLJ23 182 3.1 

105409 AW505076 Hs.301855 DiGeorge syndrome critical region gene 8 4.2 

105431 AA252033 Hs.242413 hypothetical protein DKFZp434K1421 4.4 

105552 AA256750 Hs.28802 centaurin-alpha 2 protein . 3.2 

105598 AA279439 Hs.279763 hypothetical protein FU1 0504 3.5 

35 105650 W16741 Hs.25635 HSPC003 protein 3.7 

105686 AI299139 Hs.17517 ESTs 5.5 

105808 AI133161 Hs.286131 CGM01 protein 3.5 

105809 AW973653 Hs^0104 hypothetical protein FU00052 3.3 
105909 AA195191 Hs.5111 hypothetical protein FU20729 3.2 

40 105965 AA131657 Hs.23830 ESTs 3.3 

105135 AL117474 Hs,41181 Homo sapiens mRNA; cDNA DKFZp727C191 (fr 3.2 

106184 W28948 Hs.10762 ESTs 3.3 

106293 N39842 Hs.301444 klAA1673 4.1 

106400 BE397649 Hs.94109 Homo sapiens cONA FU1 3634 fis. done PL 3.1 

45 106474 BE383668 Hs.42484 hypothetical protein FU 1061 8 3.2 

106484 AA351978 Hs.4943 hepatocellular cardnoma associated prot 7.8 

106533 AL134708 Hs.145998 ESTs 3 

106614 AA648459 Hs.335951 hypothetical protein AF301 222 3.8 

106636 AW958037 Hs.286 ribosomal protein L4 3.3 

50 105661 AW499914 Hs7579 hypothetical protein FU 10402 3 

105743 BE613328 Hs.21938 hypothetical protein FUl 2492 4.2 

106844 AA485055 Hs.158213 spemi assodated antigen 6 3.4 
105864 AI311928 gb:qo89h04.x1 NCLCGAP_Kid5 Homo sapiens 4.4 

106865 AW192535 Hs.19479 ESTs 3.6 

55 106871 AW472981 Hs.321130 hypothetical protein MGC2771 4.1 

106942 AA995351 Hs.31314 retinoblastoma-binding protein 7 3.6 

106968 AF216751 Hs.26813 CDA14 5.3- 

107105 AW963419 Hs.155223 stanniocaldn 2 3.4. 

107158 N32849 Hs.31844 hypothetical protein FU1 2586 3.1 

60 107248 AW263124 Hs.315111 nudear receptor co-repressor/HDAC3comp 5.9 

107265 BE379594 Hs.49136 ESTs, Moderately similarfo ALLf7„HUMAN A 3.9 

107630 AW961576 Hs.60178 ESTs 4.6 

107710 AI955040 Hs^65398 ESTs. Weakly simflar to transfomtalion-r 3 

107890 AA025386 Hs.61311 ESTs. Weakly similar to S10590 cysteine 3.1 

65 107985 T40064 HsJ1968 Homo sapiens mRNA; cDNA DKFZp564F053 (fr 4.8 

108000 AI263307 Hs.239884 H2B histone (amily. member L 32 

108217 AA056666 H&62568 ESTs 3.8 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



108435 

108591 

108733 

108771 

108819 

108912 

109086 

109124 

109132 

109163 

109277 

109410 

109454 

109514 

109581 

109632 

109644 

109700 

109768 

109807 

109842 

109895 

110024 

110561 

110675 

110707 

110915 

111139 

111155 

111199 

111336 

111510 

111532 

111689 

111823 

111876 

111892 

111893 

112125 

112170 

112287 

112300 

112303 

112478 

112561 

112631 

112637 

112657 

112678 

112917 

113070 

113095 

113117 

113187 

113200 

113206 

113374 

113440 

113494 

113518 

113571 

113822 

113835 

113938 

113947 

113970 

114086 



T82427 

AB033073 

AA121022 

AF0682g0 

AA011449 

AA136674 

AF186114 

AK000684 

AI970536 

N23235 

AA196443 

AW504732 

AA232255 

AA234087 

R45584 

AA325138 

AW973964 

FO9609 

F06838 

R43646 

AW818436 

AK001680 

AW973152 

AA379597 

H89355 

AI239832 



N64683 

N66563 

AI767435 

AI457338 

R07856 

R08440 

AA602004 

R35252 

R38239 

AA421081 

AF070526 

AW379029 

BE246743 

AB033064 

H24334 

R54797 

R66067 

AI791493 

R82040 

R82331 

AW844878 

AI418466 

AA082465 

AB032977 

AA828380 

AW813731 

BE613410 

T57773 

BE262470 

T79925 

U54727 

T91451 

AW367788 

AI702609 

NM^004585 

A1912410 

W81598 

W84768 

W27249 

AA378776 



Hs.194101 
Hs.43857 

Hs.79741 

Hs.271627 

Hs,1 18681 

Hs.270737 

Hs.183887 

Hs.16603 . 

Hs.30567 

Hs.86043 

Hs.21275 

Hs.295232 

Hs.262346 

Hs.23025 

Hs.235873 

Hs.291531 

Hs.14763 

Hs.12422 

Hs.23590 

Hs.30488 

Hs,31050 

Hs.5199 

Hs.249159 

Hs.15617 

Hs.29724 

Hs.290943 

Hs.191358 

Hs^9822 

Hs.29894 

Hs.16355 

Hs.23260 

Hs.24944 

Hs.293246 

Hs.12388 

Hs.13429 

Hs.1 18338 

Hs.288529 

Hs.334806 

Hs.26125 

Hs.28664 
Hs.1 29873 

Hs.164599 
Hs.19769 
Hs.33665 
Hs.125031 



Hs.126733 

Hs.159153 

Hs.31575 

Hs.10263 

Hs.241471 

Hs.269165 

Hs.191445 

Hs.86538 

Hs.323954 

Hs.15713 

Hs.17466 

Hs.27475 



Hs.8109- 
.Hs^88649 



Homo sapiens cONA: FU20869 fis. done A 3 

similar to glucosamine-6-sulfatases Z2 

gb:zn84f10.r1 Stratagene lung carcinoma 3.9 

hypotheticai protein FU101 16 6.1 

ESTs 3.6 

EST 3.9 

tumor necrosis factor (ligand) superfami 3.7 

hypolhelicai protein FLJ22104 3.1 

hypothetical protein FLJ13163 3.7 

ESTs, Weakly similar to B34087 hypotiieti 4.5 

Homo sapiens cOfsIA FU 1 3558 fis, done PL 3.7 

hypothetical protein FU 1 1 01 1 4.6 

ESTs. Moderately similar to A46010 X-iin 6.4 

ESTs. Weakly similar to S72482 hypotheti 4.8 

ESTs. Weakly similar to ALU5_HUMAN ALU S 3.3 . 

hypothetical protein FLJ22672 3 

ESTs, Highly similarto 1203217A dehydro 3 

gb:HSC33H092 nonnalized infant brain cDN 3.2 

ESTs 3.2 

ESTs 3.8 

solute earner family 16 (monocarboxylic 3.3 

DKFZP434F091 protein 3.6 

ESTs 4.2 

HSPC150 protein similar to ubiqultin-con 5.1 

adrenergic. alpha-2A-, receptor 5.3 

ESTs, Weakly simiiar to ALU4^HUMAN ALU S 3.7 

hypothetical protein FU13187 37 

ESTs 4 

ESTs 3.1 

ESTs 4.5 

ESTs 5.4 

ESTs 3.2 

gb:yf 19f09.s1 Soaies fetal liver spleen 3.1 

ESTs 3.2 

ESTs. Weakly similar to 2109260A B cell 3.3 

ESTs, Weakly similar to putative p150 [H 3.1 

ESTs 3.4 

Homo sapiens clone 24787 mRNA sequence 3.3 

ESTs, Weakly similar to unnamed protein 4.4 

hypothetical protein FLJ22635 7.3 

KIAA1238 protein 3.2 

ESTs 4.4 

gb:yg87b07.s1 Soares in^t brain 1 NIB H 3.4 

ESTs 8.2 

ESTs. Weakly similar to A36i036 cytochrom 5.5 

gb:yj06b06.s1 Soares placenta Nb2HP Homo 3.9 

ESTs 5.4 

hypotherical protein M6C4174 3.2 

ESTs 47 

choline/ethanolaminephosphotransferase 3.7 

KIAA1 151 protein 3.1 

ESTs 3.4 

ESTs, Moderately similar to S65657 alpha 3.4 

SEC63, endoplasmic reticulum translocon 3.2 

ESTs 3.5 

RNB6 6.2 

ESTs, Weakly similar to ALU1_HUMAN ALU S 3.7 

ESTs 3 

ESTs 3.4 

postmeiotic segregation increased 2-like 3.1 

hypothetical protein MGC2776 3.1 

retinoic acid receptor responder (tazaro 3.9 

Homo sapiens cDNA FU1 2749 fis, done NT 3 

gb:zd88g02.s1 SoaresJetaLheaiLNbHH19W 4.5 

gb:zh53d03.s1 SoaresJetalJver_spleen^ 3.1 

hypothetical protein FLJ21080 ' 6.9 

hypothetical protein MGC3077 4.3 
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114148 AW470411 Hs.288433 neuratrimtn 4.1 

114424 AW780192 Hs.267596 ESTs 3.4 

114518 AW163267 Hs. 105469 suppressor of varl {S.cerevisiae)3-nke 3.1 

114563 AI979168 Hs.82226 glycoprotein {transmembrane) nmb 4.8 

5 114965 AI733881 Hs.72472 BMP-RIB 10.1 

114995 AA769266 Hs.193657 ESTs 3.6 

115121 AI634549 Hs.88155 ESTs 3.2 

115134 AW968073 Hs.194331 ESTs, Highly similar to A5571 3 inositol 4.2 

115167 AA749209 Hs.43728 hypothetical protein 3 

10 115253 BE149845 Hs.289038 hypothetical protein MGC4 126 3,6 

115277 AAB14100 Hs.86693 ESTs. 3.9 

115327 N46436 Hs.109221 ESTs 3.4 

115354 AA281636 Hs.334827 ESTs 4.8 

115657 AA405620 Hs.55158 ESTs. Weakly similar to T29520 hypotheU 3.5 

15 115676 AA953006 Hs,88143 ESTs 9.3 

115709 AW293849 Hs.58279 ESTs, WeaWy similar to ALU7_HUMAN ALU S 3.4 

115729 AA417812 Hs.38775 ESTs 4 

115787 AI126772 Hs.40479 ESTs 3.1 

115830 AW970529 Hs.86434 hypothetical protein FLJ21816 3.6 

20 115835 AA521410 Hs.41371 ESTs 3.1 

115850 NIVL014937 Hs.52463 KIAA0966 protein 3 

115900 AK001500 Hs.165186 hypothetical protein FU 13852 3.2 

1 15935 AA354549 Hs.41181 Homo sapiens mRNA; cDNA DKFZp727C191 (fr 3 

115948 AL042465 Hs.43445 poIy(A)-specific ribonuclease (deadenyla 3.1 

25 116092 AB041035 Hs.93847 NM_016931:Homo sapiens NADPH oxidase 4 ( 6.7 

116115 AL042355 Hs.70202 WD repeat domain 10 3.6 

116184 AW450737 Hs.128791 CGl-09 protein 3,1 

1 16192 AA464976 Hs.62528 ESTs. Moderately similar to A46010 X-lin 3.3 

116208 AI219083 Hs.42532 ESTs, Moderately similar to ALU8_HUMAN A 3.2 

30 116246 AF265555 Hs.250646 bacubviral lAP repeat-containing 6 3.6 

116443 AW962196 Hs.321264 LBP protein 32 4.1 

116470 AI272141 Hs.83484 SRY (sex detemriining region Y)-box 4 4.1 

116726 AKOOl 114 Hs.53913 hypothetical protein FU 10252 .8.6 

116845 AA649530 gb:ns44f05.s1 NCLCGAP_Alv1 Homo sapiens 3.2 

35 117026 H88256 Hs.50456 ESTs, Moderately similar to ZN75_HUMAN Z 3.5 • 

117216 AI569804 Hs.42792 ESTs, Weakly similar to 178885 serine/th 3.1 

117296 AL133427 . Hs.42506 Homo sapiens mRNA full length insert cDN 3.2 

117403 H84455 Hs.40639 ESTs 4,7 

117691 AB040959 Hs.93836 DKFZP434N014 protein 3 

40 118229 AW968941 Hs.166254 hypothetical protein DKFZp568l133 3.3 

118363 AI183838 Hs.48938 hypothetical protein FU21 802 4.3 

118416 N66028 Hs.49105 FKBP-^ssodated protein 3.1 

118470 AW970584 Hs.291033 ESTs 3.4 

118502 AL157488 Hs.50150 Homo sapiens mRNA; cDNA DKFZp564B182 (fr 5.2 

45 118695 AK000465 Hs.500B1 K1AA1 199 protein 3.4 

118925 N92293 Hs.206832 ESTs. Moderately similar to ALUS^HUMAN A 3,3 

119025 BE003760 Hs.55209 Homo sapiens mRNA; cDNA DKFZp434K0514 (f .19.7 

119036 R95872 Hs.1 17572 chemokine binding protein 2 3.7 

119063 R16833 Hs.53106 ESTs, Moderately similar to ALU1 .HUMAN A 4.1 

50 119075 M10905 Hs.287820 fibronectin 1 3.2 

119620 W47620 Hs.56009 2'-6'-oligoadenyiate synthetase 3 (100 k 3.3 

119741 AF041853 Hs.43670 kinesin family member 3A 3.1 

119747 AI970797 Hs.64859 ESTs 5 

119754 AL037824 Hs,194695 ras homolog gene family, member 1 3.8 

55 119905 AW449064 Hs.1 19571 collagen, type III. alpha 1 (Ehlers-Danl 3.1 

120084 W94472 Hs.59529 ESTs. Moderately similar to ALU1_HUMAN A 8.4 

120241 AA825686 Hs.321176 ESTs, Weakly similar to S65824 reverse t 3.6 

120326 AA196300 Hs.21145 hypothetical protein RG083M05.2 3.2 

120742 AA225084 gb:nc21d06.r1 NCLCGAP.PrI Homo sapiens 3.6 

60 120870 AA357172 Hs.292581 ESTs, Moderately similar to ALUI.HUMAN A 5.8 

120885 AA365515 Hs.301872 hypothetical protein MGC4840 3 

120970 AA398118 Hs.97579 ESTs, Weakly similarlo A46010 X-linked 3.7 

121054 AW976570 Hs.97387 ESTs 5.3 

121095 AA320134 Hs.196029 Homo sapiens mRNA far KIAA1657 protein. 4 

65 121103 AA398936 Hs.97697 EST 3.5 

121121 AA399371 Hs.1 89095 similar to SAU.1 (sal (DrpsophilaHike 6.3 

121337 AW885727 Hs.301570 ESTs 4.7 



267 



wo 02/059377 PCT/US02/02242 



121351 AW206227 Hs^87727 hypothetical protein aj23132 5 

121611 M31669 Hs.1735 inhit^, beta B (actrvin AB beta polypep 3.6 

121643 AA640987 Hs.193767 ESTs 5.6 

121770 NM_.015g02 Hs.278428 progestin induced protein 3.4 

S 122125 AK000492 Hs.98806 hypothetical proten 4.1 

122338 AA443311 Hs.98998 ESTs -3 

122417 AA446965 Hs.112092 ESTs 4.7 

122513 AI767879 Hs.99214 ESTs 3,8 

122544 AW973253 Hs,292689 ESTs 3 

10 122655 AA323296 Hs.97837 Homo sapiens mRNA; cONA OKFZp547J047 (fr 5.6 

122805 AA528911 Hs.82772 collagen, type XI, alpha 1 3.2 

122851 AW205931 Hs.99598 hypotheUcal protein MGC5338 8.6 

123105 AA487809 Hs.166011 catenin (cadherin-assodaled protein), d 3 

123111 AA228776 Hs.191721 ESTs 6.9 

■15 123249 AA371307 Hs.125056 ESTs . 3.6 

123273 AA491253 Hs.173611 Empirically selected from AFFX single pr 7 

123385 BE1496B5 Hs.17767 KIAAI 554 protein 3.1 

123419 T66087 Hs.112482 Homo sapiens unknown mRNA sequence 3.4 

123485 A1308876 Hs.103849 hypothetical protein DKFZp761D1 12 3.1 

20 123645 AI675944 Hs. 188691 Homo sapiens cDNAFU12033fis.cJone HE 3.8 

123819 AA580082 Hs.112264 ESTs 4.7 

124012 AA352723 Hs.241471 RNB6 3.8. 

124243 H69125 Hs.133525 ESTs 4.1 

124357 N22401 gb:yw37g07.s1 Morton Fetal Cochlea Homo 4.1 

25 124359 N22508 Hs.139315 Homo sapiens cDl^: FU21479 fts, done C 3.6 

124567 AW451645 Hs.151504 Homo sapitenscDNAFU11973ris. done HE. . 3.1 

124911 N34151 Hs.174195 interferon induced transmembrane protein 3.5 

124972 R41396 Hs. 101 774 hypothetical protein FU23045 4.3 

125006 6E065136 . Hs.145696 splidng factor (CC1. 3) 6 

30 125042 T78906 Hs.269432 ESTs. Moderately similar to ALU1_HUMAN A 8.1 

125184 W60326 Hs.288684 Homo sapiens cDNAFU11750fis, done HE 4.7 

125243 AW970536 Hs.105413 ESTs 3.1 

125286 AF086534 Hs.187561 ESTs, Moderately similar to ALU1_HUMAN A 3.3 

125304 AL359573 Hs.124940 GTP-binding protein 3 

35 125330 AW880562 Hs.114574 ESTs 3 

125331 AI422996 Hs.161378 ESTs 3.2 

125685 AI924630 Hs.4943 hepatocellular carcinoma associated prot 3.2 

126257 N99638 gb:za39g11.r1 Scares fetal bver spleen 4 

126474 AW975814 Hs.326714 Homo sapiens done IMAGE:713177, mRNA se 4 

40 126666' AA648886 Hs.151999 ESTs 3.8 

126872 AW450979 gb:UI-H-BI3'ala^-12-0-ULs1 NCLCGAP_Su 3 

127431 AW771958 Hs.l75437 ESTs. Moderately similar to PC4259 ferri 3.6 

127980 AA951459 Hs.125644 ESTs 4.1 

127997 AW068311 Hs.311054 Homo sapiens mRNA full length Insert cDN 3.3 

45 128420 AA650274 Hs.41296 fibronectin leucine rich transmembrane p 4.6 

128609 NM_003616 Hs.102456 sun/ival of motor neuron protein interac 3.9 

128946 Y13153 Hs.107318 kynurenine 3-monooxygenase (kynurenine3 . 3.1 

128955 AA775076 Hs.185807 Homo sapiens, Similar to PRO0478 protein 3.9 

129092 D56365 Hs.63525 poly(rC)-bindlng protein 2 3.3 

50 129270 AA357185 Hs.109918 ras homolog gene family, member H 3.1 

129301 AF182277 Hs.330780 cytochrome P450. subfamily IIB (phenobar 3.9 

129385 AA172106 Hs.110950 Rag C protein 6.2 

129619 AA20g534 H5.284243 tetraspan NET-6 protein 3.4 

129629 AKD00398 Hs.11747 hypotfteScal protein FU20391 3 

55 . 129725 X56411 Hs.1219 alcohol dehydrogenase 4 (dass II). pi p 3.2 

130069 AI754813 Hs.146428 collagen, type V, alpha 1 5.4 

130092 X03363 Hs.323910 v-erb-b2 avian erythroblastic leukemia v 4.4 

130298 AI347487 Hs.132781 class I cytokine receptor 4.6 

130382 NM^003450 Hs.155204 zinc finger protein 174 5.6 

60 130622 A1582291 Hs.16846 ESTs, WeaSdy simaarto04HUD1 debrisoqu 3 

130703 R77776 Hs.18103 ESTs 3.8 

130881 AA809875 Hs.25933 ESTs 4.2 

130954 AB014544 Hs.21572 KIAA0644 gene produd 4.7 

131095 AI399653 Hs.22917 ESTs 4.3 

65 131153 H09048 Hs.23606 ESTs 3.8 

131253 R71802 . Hs.24853 ESTs 3.5 

131372 AW293399 Hs.l44904 nudear receptor co-repressor 1 3.6 
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131507 A1826268 Hs.27769 ESTs, Weakly similar to MCAT.HUMAN MITOC 3.2 

131587 AI695549 Hs.183868 glucuronidase, beta 3.1 

131739 AF017986 Hs,31386 secreted frizzled-related protein 2 3.2 

131795 BE501849 Hs.32317 high-mobility group 20B 3.2 

5 131970 D86960 Hs.3610 KIAA0205 gene product 3.6 

131986 NM_002314 Hs.36566 LIM domain kinase 1 3.2 

132093 AA400091 Hs.39421 ESTs 3.2 

132122 AA426202 Hs.40403 Cbp/p300-lnteracting transaclivator, wit 3 

132159 D76435 Hs.41154 Zic family member 1 (odd-paired Dfosophi 3.2 

10 132333 AA192669 Hs.45032 ESTs 3.5 

132406 AL133731 Hs.4774 Homo sapiens mRNA; cDNA DKFZp761C1712 (f 9.2 

132482 AV660345 Hs.238126 CGM9 protein 8.2 

132543 BE568452 Hs.5101 protein regulator of cytokinesis 1 3.4 

132624 AA326108 Hs.33829 bHIii protein DEC2 3.2 

15 132700 AA319233 . Hs.5521 ESTs 4.8 

132725 NM,006276 Hs.184167 spfidng factor, arginineteerine^ch 7 3.6 

132799 W73311 Hs.169407 SAC2 (suppressor of actin mutations 2. y 3.2 

132847 T48195 .Hs.58189 eukaryotic translation initiatton factor 3.5 

132857 Y00272 Hs.184572 cell division cycle 2. G1 to S and G2 to 4.4 

20 132936 AL120659 Hs.6111 aryl-hydrocarbon receptornudeartransl 4.8 

133130 AI128606 Hs.6557 zinc finger protein 161 3.3 

133142 AW952412 Hs.65874 ESTs, Weakly similar to A40348 Elav/Sex- 3.5 

133167 AW162840 Hs.6641 kinesin family member 50 4.5 

133225 AW600291 Hs.6823 hypothetical protein FLJ1 0430 3.3 

25 133274 AA085191 Hs.6949 hypothetical protein MGC1 1275 3 

133275 Z93241 Hs.239934 CGI-96 protein 4.5 

133287 AW797437 Hs.69771 B-f actor, properdin 4.1 

133376 BE618768 Hs.7232 acetyl-Coenzyme A carboxylase alpha 5.1 

133462 AW675064 Hs.73875 fumarylacetoacetete hydrolase (fumaiy lac 3 

30 133740 AW162919 Hs.170160 RAB2. member RAS oncogene family-like 3.4 

133831 BE274552 Hs,76578 protein inhibitor of activated STAT3 3.9 

133976 AI908165 Hs.169946 GATA-binding protein 3 (T-cell receptor 6.2 

134665 BE391929 Hs,8752 transmembrane protein 4 3.1 

134710 A1433797 Hs.8889 serine hydroxymethyltransferase 1 (solub 3 

35 134731 D89377 Hs.89404 msh (Drosophila) homeo box homoiog 2 5.8 

134776 J05582 Hs.89603 mucin 1, transmembrane 4 

135230 AF064804 Hs.96757 suppressor of Ty(S.cerevisiae) 3 homdo 3.2 

135303 .R61253 Hs.98265 KIAA1 877 protein 3.3 

135400 X78592 Hs.99915 androgen receptor (dihydrotestosterone r 4.8 

40 135411 L10333 Hs.9g947 reticutoni 3.8 

300089 AI199738 . Hs.208275 ESTs, Weakly similar to ALUA^HUMAN I!!! 3.8 

300233 AW614220 Hs.189402 ESTs 4.2 

300254 AW183618 Hs.55610 solutecarrierfamily 30 (zinc transport 9.9 

300256 AW591433 Hs.298241 Transmembrane protease, serine 3 4.9 

45 300378 Z45270 Hs.235873 hypotheftcai protein FU22672 3.4 

300973 AA572949 Hs,207566 ESTs 3.5 

301111 R10799 Hs.191990 ESTs 3.8 

301341 AA887801 Hs.208229 G protein-coupled receptor 13.9 
301548 AI091631 Hs.203845 two pore potassium channel KT3.3 4.4 

50 301884 AA312082 Hs.105445 GDNF family receptor alpha 1 5.7 
301936 U79745 Hs.1 14924 solute canler family 16 (rhonocarboxylic . 8.6 
301976 T97905 gb:ye54c10.r1 Scares fetalliver spleen 3.9 

302001 AB020711 Hs.278346 KIAA0904 protein 7.7 
302067 BE542706 Hs.222399 CEGP1 protein 7.3 

55 302094 AW749321 Hs.6786 ESTs 3.3 
302099 AL049670 Hs.137576 ribosomal protein L34 pseudogene 1 4.2 
302145 NM_003613 Hs.151407 cartilage intermediate layer protein, nu 7.9 
302235 AL049987 Hs.166361 Homo sapiens mRN A; cDNADKFZp564F11 2 (fr 5.6 
302276 AW057736 Hs.323910 HER2 receptor tyrosine kinase (c-erb-b2. 5.4 . 

60 302290 AA179949 Hs.175563 Homo sapiens mRNA; cDNA DKFZp564N0763 (f 34.1 
302372 AL1 17406 Hs.200102 ATP-binding cassette transporter MRP8 6.7 
302378 AL109712 Hs.296506 Homo sapiens mRNA full length insert cON 4 

302384 AI678059 Hs.202676 synaptonemal complex protein 2 4.3 

302385 AJ224172 Hs.204096 ripophilin 8 (uteroglobin family member) 133 
65 302680 AW192334 Hs.38218 . ESTs 9.6 

302830 A1038997 Hs.132921 ESTs 5 
302857 AF282265 Hs.44836 innercentiomere protein antigens (135kD 3.4 
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302892 AW176909 Hs.42346 caldneurin-binding protein calsarcin-1 3.4 

302970 W05608 Hs.312679 ESTs, Weakly similar to A49019 dynein he 5.1 

303271 AA652687 Hs.96151 Human DNA sequence from done RP5-1103G7 3.7 

303289 AL121460 Hs.272673 hypothetical protein FU20508 4.1 

5 303357 AW006352 Hs.159643 ESTs, Wealdy similar to T32554 hypotheti 4.2 

303540 AA355607 Hs,309490 ESTs. Weakly similar to putative WHSC1 p 4.3 

303563 AA367699 Hs.10082 potassium intemaediate/smaH conductance 3.3 

303642 AW299459 gb:xs50d08 j(1 NCLCGAP.Kid1 1 Homo sapien 4.2 

303780 AI424014 Hs.18995 KIAA1304 protein 3.6 

10 303797 AW629759 gb:hh70e05;y1 NCLCGAP^GUI Homo sapiens 4.9 

303852 R53434 Hs.90207 hypothetical protein MGC1 1138 3.7 

304328 AA1 49951 Hs.62112 zinc finger protein 207 3 

304782 AA582081 gb:nn32h0B.s1 NCI_CGAP_Gas1 Homo sapiens 4.1 

305913 AA876109 gb:nx24h01 .si NCI_CGAP_GC3 Homo sapiens 3 

15 305917 AA876469 gb:oe48b04,s1 NCI_CGAP_Pr25 Homo sapiens 3.1 

307010 AI140014 gb:qa68fD9.x1 SoaresJetaLheart_NbHH19W . 3.5 

307041 AI144243 gb:qb85b12j(1 Scares Jetal_heait_NbHH19W 3.9 

308106 AI476803 gb:tj77e12.x1 Soares_NSF_F8 9W OT PA_P_S 4.3 

308307 AI581398 Hs.172928 collagen, type 1. alpha 1 4.6 

20 308615 AK000142 Hs.101774 hypothetical protein FU23045 4.4 

309177 AI951118 Hs.326736 Homo sapiens breast cancer ant^enNY-BR 17.3 

309328 AW024348 Hs.233191 EST. Weakly similar to A27217 glucose tr 3.2 

309574 AW168083 gbDcg59g04.x1 NCLCGAP^Ut4 Homo sapiens 3.1 

309583 AW170035 Hs.326736 Homo sapiens breast cancer antigen NY-8R 57.6 

25 310064 AI199712 Hs.148486 ESTs. Weakly similar to 1917210APro/Afg 4.6 

310098 AI685841 Hs.161354 ESTs 3.6 

310438 AW022192 Hs.200197 ESTs 4.6 

310683 AI939456 Hs. 160870 ESTs 3.2 

310727 AK000703 Hs.323822 Homo sapiens mRNA for KIAA1 551 protein, 3.6 

30 310781 AI380797 Hs. 158992 ESTs 10.2 

310895 AI955121 Hs.165724 N-acetylgalactosamine-4-O-suUotransfera 3.4 

310955 AI476732 Hs.2639l2 ESTs 10.9 

311117 AI671439 Hs.196029 Homo sapiens mRNA for KIAA1657 protein. 3.1 

311166 AI821005 Hs.118599 ESTs 10.8 

35 311237 AA641098 Hs.208809 ESTs. Moderately similar to ALU1_HUMAN A 4.3 

311465 AJ758660 Hs.206132 ESTs 4,4 

311587 A1828254 Hs.271019 ESTs, Weakly similar to A47582B-cengr 5.1 

311598 AW023595 Hs.232048 ESTs 5.8 

311774 AA700870 Hs. 14304 ESTs 3.3 

40 311785 AI056769 Hs.133512 ESTs 3.9 

311872 R12375 Hs.194600 ESTs 3.3 

311889 AA767342 Hs.122483 ESTs, Weakly similar to PSF.HUMAN PTB-AS 3 

311913 AI358522 Hs,270188 ESTs 3 

311923 T60843 Hs.189679 ESTs 5.6 

45 311935 AA216387 gb:nc16b02.s1 NCLCGAPJ'rl Homo sapiens 5.2 

312019 AA373630 Hs.188760 ESTs 3 

312021 AA759263 Hs.14041 ESTs 3.4 

312067 T78968 Hs.14411 ESTs 3.5 

312090 T80177 Hs.118064 similar to rat nudear ubiquitous casein 3.8 

50 312147 AI633744 Hs.195648 ESTs. Weakly similar to 138022 hypofheli 4.4 

312153 BE261944 Hs.118625 hexokinasel 5.2 

312168 T92251 Hs.198882 ESTs 3.3 

• 312182 T94344 Hs.326263 ESTs 3.3 

312187 AA700439 Hs. 188490 ESTs 3.4 

55 312199 AW438602 Hs.191179 ESTs 3.9 

312219 H73505 Hs.1 17874 ESTs 4 

312226 AA315703 Hs.199993 ESTs. Weakly similar to ALUB_HUMAN l!!l 4.9 

■ 312299 AA972712 Hs.269737 ESTs 5.7 

312544 AA516420 Hs.183526 ESTs, Weakly similar to 138022 hypotheti 6.3 

60 312638 AW439195 Hs.256880 ESTs. Weakly similar to 865557 alpha-IC- 4.9 

312826 AW291545 Hs.185018 ESTs 4.9 

312837 AW292286 Hs.255058 ESTs 44 

312980 AA497043 Hs.1 15685 ESTs 3.1 

313070 AM22023 Hs.161338 ESTs 4.3 

65 313079 N76497 Hs.1787 proteolipid protein 1 (PeFizaeus-Menzbac 3.3 

313089 AF026944 Hs.293797 ESTs 5.8 

313096 AW073310 Hs.163533 Homo sapiens cONA FU14142 fs, done MA 4.5 
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313126 AA746503 Hs.283313 ESTs 10 

313166 AI801098 Hs.151500 ESTs 3.5 

313197 AW979008 Hs.222487 ESTs 3.3 

313280 AW960454 Hs.222830 ESTs 47 

5 313325 A(420611 Hs.127832 ESTs 3.4 

313328 AW449211 Hs.105445 GDNF family receptor alpha 1 12.4 

313352 AW1 50945 Hs.1 44758 ESTs 4,1 

313385 AI032087 Hs.269819 ESTs 3 

313393 AI674685 Hs.20014l ESTs. 5.2 

10 313417 AA741151 Hs.137323 ESTs 3.5 

313434 W92070 gb:zh48g05.r1 Scares JetaLrwer.spteen. 37 

313569 AI273419 Hs.135146 hypothetical protein FU13984 3 

313591 AA046309 gb:zf12fD1.s1 SoaresJetaLhearLNbHHISW 5.6 

313615 A1540978 Hs.301997 hypothetica) protein RJ13033 3.2 

15 313915 C18863 Hs.163443 Homo sapiens cDNA FLJ1 1576 fis. done HE 26.3 

313975 AW175896 Hs.65114 keratin 18 3 

313979 AI535895 Hs.221024 ESTs 4.9 

313997 AV657317 Hs.28864g hypothetical protein MGC3077 3.9 

314043 AA627082 Hs.291872 ESTs 3.1 

20 314078 AW129357 Hs.329700 ESTs 8.3 

314097 AA648744 Hs.269493 ESTs 6.6 

314121 AI7320a3 Hs.187619 ESTs 6.2 

314129 AA228366 Hs.l15122 ESTs 4 
314138 AA740616 gb:ny97f11.s1 NCLCGAP^GCBI Homo sapiens 5.9 

25 - 314236 AA743398 Hs.189023 ESTs 3.1 

314244 AL035450 Hs.103238 ESTs 4 

314305 AI280112 Hs.125232 Homo sapiens cDNAFU 13266 fis. done OV 8 

314306 AI697901 Hs.192425 ESTs 37 
314322 AA907153 Hs.190060 ESTs 3.3 

30 314394 AW961597 Hs.130816 ESTs, Moderately similar to 138022 hypot 4.2 

314401 AI660412 Hs.234557 ESTs 3.3 

314465 AA602917 Hs.156974 ESTs 47 

314506 AA833655 Hs.206868 Homo sapiens cDNAFLJI 4056 fis. done HE 8.5 

314510 A1204418 Hs.19008O ESTs 4 

35 314546 AW007211 Hs.16l31 hypothetical protein FU 12876 3.4 

314547 AA399272 Hs.144341 ESTs 67 

314558 AI873274 Hs.190721 ESTs 27.4 

314627 AA425310 Hs.155766 ESTs. Weakly simitar to A47582B-ceflgr 4.4 
314648 AW979268 gb:EST391378 MAGE resequences. MAGP Homo 4.6 

40 314691 AW207206 Hs.136319 ESTs 20.7 

314729 AA457367 Hs.191638 ESTs 3.6 

314754 AW026761 Hs.134374 ESTs 3.6 

314814 BE350122 Hs.157367 ESTs, Weakly similar to 178885 serineAh 4.9 

314864 AW971198 Hs.294068 ESTs . 4.3 

45 314881 AI095087 . Hs.1 52299 ESTs. Moderately similar to S65657 alpha 37 

314882 AA828032 Hs.189076 ESTs 3.1 

314981 AW972359 Hs.293334 ESTs 3 

315006 A1538613 Hs,298241 Transmembrane protease, serine 3 10.9 

315021 AA533447 Hs.312989 ESTs 5.3 

50 315051 AW292425 Hs.163484 ESTs 12.9 

315060 AA551104 Hs. 189048 ESTs. Moderately similar to ALUC^HUMAN ! 5.8 

315073 AW452948 Hs.257631 ESTs 4.2 

315080 AA744550 Hs.136345 ESTs 3.7 

315175 A1025842 Hs.152530 ESTs 6 

55 315183 AW135134 Hs.220277 ESTs 3.9 

315193 A1241331 . Hs.131765 ESTs. Moderately similar to 138937 DNA/R 4.4 
315195 AI367347 Hs.44898 Honw sapiens done TCCCTA00151 mRNA sequ 8.2 
315198 AI741506 Hs.165900 ESTs. Weakly similar to ALU1_HUMAN ALU S 3.6 

315240 R38772 Hs.172619 myelin transcription factor 1-like 3.4 

60 315263 AW510994 Hs.220740 ESTs 3.4 

315282 A1222165 Hs.144923 ESTs 4^ 

315296 AA876905 Hs,125286 ESTs 4 

315368 AB037745 Hs.104698 KIAA1 324 protein 4.7 

315397 AA218g40 Hs.137516 fidgetin-fike 1 3.1 

65 315489 At378817 Hs.191847 ESTs 3.1 

315498 AA628539 Hs.116252 ESTs. Moderately sImUar to ALUI.HUMAN A 3.2 

315526 AI193043 Hs.128685 ESTs. Weakly similar to T17226hypotheti 4.1 
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315530 AW0li5415 Hs,127780 ESTs 8.9 

315562 AA737415 Hs.l52826 ESTs 5.5 

315634 AA8370a5 HS.2205B5 ESTs 6.3 

315647 AA648983 Hs.212911 ESTs 3.6 

5 315707 AI418055 Hs,161160 ESTs 5.1 

315772 AW515373 Hs.271249 Homo sapiens cDNA FU13580 fis. clone PL 3.1 

315850 AW270550 Hs.1 16957 ESTs 3.8 

315858 AA737345 Hs.294041 ESTs 5 

315878 AA683336 Hs.189046 ESTs 3.1 

10 315977 AW865916 Hs.151206 ESTs 4.7 

315978 AA830893 Hs.119769 ESTs 4.1 

315995 A(217477 Hs.194591 ESTs 4.1 

316012 AA764950 Hs.119898 ESTs ' 7 

316042 AI469960 Hs.170698 ESTs 4.9 

15 316052 AI962796 Hs.136754 ESTs 4.1 

316072 AW517524 Hs.135201 N0D2 protein 3.2 

316074 AW975114 Hs.293273 ESTs 3.8 

316100 AW203986 Hs.213003 ESTs 3.2 

316133 AI187742 Hs.125562 ESTs 3.7 

20 316177 AI904982 Hs.293102 ESTs, Moderately similar to ALU1 HUMANA 30.7 

316186 AI433540 gb:ti69g05.xl NCI_CGAP_Kid11 Homosapien 3.1 

316244 AI640761 Hs.224988 ESTs 3^ 

316303 AA740994 Hs.20g609 ESTs 3.8 

316313 AA741300 Hs.202599 ESTs, Weakly similarto 138022 hypotheti 4.4 

25 316364 AA747807 Hs.149500 ESTs 3.2 

316580 AA938198 Hs.146123 poly(A) polymerase gamma 9.4 

316697 AW293174 Hs.252627 ESTs 4.4 

316715 AI440266 Hs.170673 ESTs. Weakly similar to T24832 hypothetl 3 

316868 AI660898 Hs.195602 ESTs 3.2 

30 316869 AI954880 Hs.134604 ESTs 3.2 

316886 AA836331 Hs.134981 ESTs 4.4 

316897 AA838114 Hs.221612 ESTs 3.7 

316943 AW014875 Hs,137007 ESTs 4.6 

317069 AI732892 Hs.190489 ESTs 5.9 

35 317194 AW445167 Hs.126036 ESTs 4.1 

317360 AI125252 Hs.126419 ESTs 3.5 

317404 AI806867 Hs.126594 ESTs 5.1 

317452 AA972965 Hs.1 35568 ESTs 6.9 

317501 AI822034 Hs.137097 ESTs 4.6 

40 317674 AW294909 Hs.1 32208- ESTs 4.3 

317803 AW664g64 Hs.128899 ESTs 6.1 

317834 X56348 Hs,287270 ret proto^ncogene (multiple endocrine n 3.1 

317850 AI681545 Hs.152982 hypothetical protein FU131 17 3.4 

317881 AI827248 Hs.224398 Homo sapiens cDNAFUl 1469 fis. clone HE 9.6 

45 317902 AW102941 Hs.211265 ESTs 4.1 

317916 AI565071 HS.1599B3 ESTs 10.3 

318042 AW294522 Hs.149991 ESTs 3.1 

318223 AI077540 Hs, 134090 ESTs 3.9 

318327 AW294013 Hs.200942 ESTs 3 

50 318332 AI093930 Hs.163440 Homo sapiens cDNA: FU21000 fis. done C 4.4 

318418 AF107493 Hs.1 18498 Homo sapiens LUCA-1 5 protein mRNA. spHc 5.4 

318558 AW402677 Hs.146381 RNA binding mofif protein. X chromosome 4.4 

318625 AA526235 Hs.193162 Homo sapiens cDNAFLJ1 1983 fis, done HE 5.9 

318634 T49598 Hs.1 56832 ESTs 4 

55 318740 NM_002543 Hs.77729 oxidised low density tipoprotein (lectin 7.3 

318744 AI793124 Hs.144479 ESTs 17.8 

318781 F11802 Hs.6818 ESTs 3 

319191 NM_012391 Hs.79414 prostate epHhelium-spedfic Ets transcr 3.6 

319478 A1524124 Hs.270307 ESTs 4.6 

60 319510 W88532 Hs.254562 ESTs 3.3 
319551 AA761668 gb:nz24c08.s1 NCLCGAP^GCBI Homo sapiens 3.2 

319745 T79366 Hs, 108258 actin binding protein; macrophin (microf 3.3 

319834 AA071267 gb:zm61g01.r1 Stratagene fibroblast (937 6.2 

319840 C19035 Hs.164259 ESTs 3.3 

65 319977 AA534222 gb:nj21d02.s1 NCLCGAP.AAI Homo sapiens 4.3 

320074 AA321166 Hs.278233 ESTs . 3.4 

320167 AA984373 Hs.90790 Homo saptens cDNA: FU22930 fis. dbne K 4.1 



272 



BNS00C10: <WO q2a59377»2Ll_> 



wo 02/05937.7 PCT/US02/02242 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



320187 
320211 
320416 
320588 
320535 
320654 
320742 
320832 
320915 
321016 
321107 
321171 
321253 
321318 
321642 
321644 
321683 
321758 
321811 
321828 
321910 
321937 
321978 
322035 
322136 
322258 
322296 
322303 
322476 
322520 
322521 
322567 
322595 
322675 
322766 
322818 
322882 
322975 
323091 
323131 
323168 
323244 
323262 
323332 
323333 
323335 
323645 
323663 
323693 
323782 
323817 
323930 
323974 
324001 
324035 
324261 
324285 
324296 
324305 
324432 
324585 
324598 
324603 
324631 
324716 
324748 
324771 



T99949 
AL039402 
AI026984 
U78082 
N50617 . 
AI160015 
AI601188 
AA214584 
AI359144 
BE144167 
AI732643 
AI769410 
AA610649 
AB033041 
A1432ig9 
AW975944 
A1471598 
U29112 
D80630 
R59890 
H67065 
AL049351 
N77342 
AL137517 
AF075083 
BE265745 
W76326 
A1357412 
AW963372 
T55958 
AF147347 
AF155108 
W92147 
AA017656 
AW068865 
AW043782 
AW248508 
C16391 
A1902456 
AK002088 
AL120862 
AW675572 
AL133990 
A182g520 
AV651680 
AI655499 
AW445014 
BE081058 
AA317962 
AW951560 
AA410943 
AL043683 
AI825204 
AL044949 
AI472078 
BE069341 
AA431159 
A1524039 
AA642007 
AA4M510 
A1823959 
AW972227 
AW993522 
AA937116 
BE169746 
AW974941 
AA631739 



Hs.303428 Homo sapiens cONA FU14832 fis. done OV 5.3 

Hs.125783 DEME-6 protein 9.2 

Hs.293662 ESTs 3.1 

Hs.167738 RNApoJymefaseJltfanscriplionajreguIa 3.1 

Hs.80506 small nuclear nl)6nuCl8oprotein polypept 6.1 

Hs,118112 ESTs 3.5 

Hs, 120910 ESTs 3 

Hs.290167 ESTs 3.7 

Hs.143688 Homo sapiens cDNA: FU23031 fis. done L 3.1 

Hs.49994 hypothetical protein similar to RNA-bind 3.3 

Hs.144151 ESTs 12.3 

Hs.221461 ESTs 3.3 

Hs.333239 ESTs 3 

Hs.137507 vang (van gogh, DrosophilaHike 2 3.9 

Hs.247084 ESTs 3 

Hs.237396 ESTs 11-7 

.Hs.197531 ESTs 3.8 

.Hs.196151 ESTs 4.4 

gb:HUM091D02B Human fetal brain (TFujiwa 3.2 

Hs.83623 nuclear receptor subfamily 1 , group I, m 3.1 

Hs.271530 ESTs. Weakly similar to ALU7.HUMAN ALU 3 4.7 

Hs.302058 Homo sapiens mRNA;cONA DKFZp566C093(fr 3.5 

Hs.21851 Honw sapiens cDNA FU12900 fis, done NT 5 

Hs.334473 hypothetical protein DKFZp5640 1278 19 

gb•i^omo sapiens full length insert cDNA 3.6 

Hs.194359 ESTs. Weakly similar to ALUC.HUMAN !!!! 3 

gb:zd60d04.r1 SoaresJetaLheart_NbHH19W 4.4 

Hs.157601 ESTs 11-5 

Hs.46677 PRO2000 protein 3 

gb:yb35(05.r1 Stratagene fetal spleen (9 3 

gb:Homo sapiens full length insert cDNA 4.2 . 

Hs.2561 50 Homo sapiens. Similar to RIKEN cDNA 2810 4 

Hs. 118394 ESTs 5.4 

gb:ze39h01 .r1 Scares retina N2b4HR Homo 3.1 
Hs.288467 Homo sapiens cDNAFU12280 fis, done MA 5.2 
Hs.293616 ESTs 7.6 
Hs.279727 Homo sapiens cDNAFLJ14035 fis, done HE 5.9 
gb:C16391 Clontech human aorta poIyA mRN 16.5 
Hs.210761 ESTs. Weakly similar to 138022 hypotheti 4 
Hs.270124 Homo sapiens cDNA FU1 1226 fis, done PL 3.3 
Hs.124165 programmed cen death 9 (PDC09) 6.3 
Hs.193620 ESTs 4.6 
Hs.190642 ESTs . 10.5 
gbwll9c06JCl NCLCGAP^Utl Homo sapiens 6.2 
Hs.208558 ESTs 4.3 
Hs.161712 ESTs 9.2 
Hs.197746 ESTs 3.1 
Hs.243023 ESTs 4 
Hs.249721 ESTs, Moderately similar to PC4259 feni 3 
Hs.97600 ESTs 3.2 
BMP-RIB 8.4 
Hs.8173 hypothetical protein FU10803 3.3 

Hs.211408 ESTs . 4.5 
Hs.1 16298 ESTs 4.5 
Hs.303662 ESTs 8 4 

gb:QV3-BT0381-270100-d73-c08 BT0381 Homo 49,4 
Hs.122954 ESTs .3 
Hs.1 92524 ESTs 3 
Hs.1 16369 ESTs 3.3 
Hs.152812 ESTs 16.5 
Hs.132678 ESTs 3.3 
Hs. 163986 Homo sapiens cDNA:FU22765 fis. done K 5 
Hs.292934 ESTs 10.4 
Hs.293683 ESTs. WeaWysimflar to 154374 gene NF2 3.3 
Hs.12504 likely ortholog of mouse Arkadia 3.2 
Hs.292385 ESTs. Weakly slmHar to 178885 serine/th 3 
H5.335440 EST 3 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



324774 
324823 
324824 
324826 
324961 
324987 
324994 
325146 
325372 
325544 
327075 
332798 
334223 
334447 
335809 
335824 
338255 
409430 
428046 
432558 
436808 
448569 
453542 



AI031771 

AW516704 

AI826999 

AA704806 

AA613792 

AI375572 

AI805416 

Ai064690 



R21945 

AW812795 

R97268 

AA731602 

BE382657 

AW836724 

M97935 

M97935 

M55150 

M13755 

AI052047 

AA252033 

AA401739 

H18459 

R48744 

M31682 

AA416873 

D80240 

R49590 



X03363 



Hs.132586 
Hs.208726 
Hs.224624 
Hs.143842 

Hs.172634 
Hs.213897 
Hs.171176 



Hs.166975 

Hs. 155381 

Hs.177269 

Hs.120266 

Hs.21466 

HS.3396G0 



AA034918 



AF049569 
M13955 



HG4126HT4396 



ESTs 
ESTs 
ESTs 

ESTs. Weakly similar to 2004399A chromos 

gb:no97h03.s1 NCLCGAP.Pr2 Homo sapiens 

ESTs 

ESTs 

ESTs 

Phase 2 & 3 Exons 
Phase 2 & 3 Exons 
Phase 2 & 3 Exons 

C22000007:gi|12314195|emblCAB99338.1|(A 
NM_005080*:HonK) sapiens X-box binding pr 
NM_012429*:Homo sapiens SEC14 (S. cerevi 
NM_014509*:Homo sapiens kralcen-like (BK1 
ENSP00000249072*:DJ222E13.1 (N-TERMiNAL 
NM_014323*:Homo sapiens zinc finger prot 
splicing factor, argmine/serine-rich 5 
ESTs, Moderately similar to 138022 hypot 
ESTs 
ESTs 

s^nal transducer and activator of trans 

Homo sapiens mRNA expressed only in plac 

AFFX control: STAT1 

AFFX control: STAT1 

fumarylaoetoacetate 

interferon stimulated protein; 15 kDa 

ESTs 

ESTs: Weakly sinular to llll ALU SUBFAMILY J 
ESTs 

hepatocellular carcinoma associated protein: 
ESTs 

inhibrn: beta B (activin AB beta polypeptide) 
ESTs 

HUM5G11A Human fetal brain (TFujiwara) Homo 
ESTs 

CH22^FGENES.678^5 

CH22.FGENES.619_7 

CH22_FGENES.619_12 

CH22^EM:AC005500.GENSCAN.127 9 

CH22_EM:AC005500.GENSCAN.304 2 

CH22_FGENES.271_8 

CH22_FGENES.619_13 

CH22_FGENES.271_7 

CH22_FGENES.617_7 

CH.07_hsgl|6004473 

CH22_FGENES.264J 

HER2 receptor tyrosine kinase (c erbB 2; ERBB2: 

CH22_FGENES.617_9 

CH.07_iisgi|5868264 

CH.19^iBgi|5867439 

CH22_FGENES,63 

CH.17,hsgiI5867230 

CH.20_hsgi|6552458 

CH22_Ef^:AC005500,GENSCAN.148 22 

CH22_FGEf\ES.659J0 

K1AA1028 protein 

CH22_FGENES.48_12 

CH22_FGENES.118_2 

ESTs 

multiple UniGene matches 
CH22^FGENES.619.8 
CH22_FGENES.13 7 

CH22_FGENES.360^3 
CH22_FGENES.706„9 
CH.21_hsgi|6531965 
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3.4 
3.1 
.4.4 
3.9 
18.8 
3.3 
4.2 
4.4 
5.7 
3.8 
4.3 
26.2 
3.9 
10.1 
20 
9 
4 

4.6 

3.2 

3.9 

4.1 

3.7 

3.2 

3 

3 

4.5 
6.7 
3.2 
3.3 
3 

4.2 
3 
3 
4 

3.2 

16.8 

12.9 

11.3 

9.2 

8.5 

8.4 

8 

7.3 
7.2 
7.1 
6.8 

neu) 6.6 

6.5 

5.8 

5.7 

5^ 

5.1 

5.1 

4.7 

4.6 

4.6 

4.5 

.4.5 
.4.4 

4.3 

4.3 

4.3 

Zinc Finger Protein Hzf4 

.4.3 

4.3 

4.2 
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HG2614HT2710 
5 X83535 



10 



15 



20 



25 



30 



35 



40 



45 



HG4716HT5158 



HG4677HT5102 



AA714311 



HG2465HT4871 



AA707750 



AA976074 



CH.17^hsgil5857215 


4.1 


CH22_FGENES.6693 


4.1 




Collagen, Type Vfii, Alpha 1 4,1 


CH22_FGENES.48„18 


4.1 


matrix metaltoprDtelnase 14 (membrane inserted) 


4 


CH22^FGENES.271_6 


3.9 


CH2^FGENES.617_3 


3.9 


CH22_FGENES.290_8 


3.8 




Guanosine 5' Monophosphate Synthase 


CH22_FGENES.13 5 


3,8 


CH22^FGENES.13 2 


3.8 


CH.14_hsgi|6682474 


3.8 


CH.02_hsgiI5867750 


3.8 


CH22^FGENES.617JB 


3.7 




Oncogene Ret/Ptc2, Fusion Activated 


CH22LDJ32I10.GENSCAN.23 39 


3.7 


CH22.F6ENES.543.20 


3.7 


CH22«EM:AC005500.GENSCAN.96 1 


3.7 


CH22,FGENES.204_2 


3.5 


CH2a.FGENES,619_4 


3.5 


CH.16^hsgl|5867087 


3.5 


EST duster (not in UniGene) 


3.4 


CH22.EM:AC005500.GENSCAN.149 9 


3.4 


CH22 EM:AC005500.GENSCAN,421 5 


3.4 


CH22^FGENES.134 


3.3 


CH.07 hs gi|6004478 


3.3 


CH22^FGENES.360J 


3.3 




Dna Binding Protein Ap 2, Alt Splice 3 


CH22 FGENES.6 2 


3.3 


CH22.C20H12.GENSCAN.16 2 


3.2 


CH22_C65E1.GENSCAN.8 1 


3.2 


ESTs; Weakly similar to cis Golgi matrix 


3.1 


CH22 FGENES.307.4 


3.1 


CH22 EMAC005500.GENSCAN.24814 


3.1 


CH.06.hsgi|5902482 


3.1 


CH22_FGENES.669_5 


3.1 


CH22„OJ32I10.GENSCAN,19 8 


3.1 


CH22.FGENES.527_6 


3.1 


CH22_FGENES.330J0 


3.1 


CH22.FGENES.142 


3.1 


ESTs 


3 


CH22LFGENES.2267 


3 


CH2^„FGENES.133 


3 


CH22^EM:AC005500.GENSCAN.209 12 


3 


CH2^FGENES.271.3 


3 



3.8 



3.7 



3.3 
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TABLE 17A 

Table 17A shows the accession numbers for those pkeys lacking unigenelD's for Table 17. 
For each probeset, we have listed the gene cluster niunber from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" colimin. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene dusternumber 
Accesston: Genbank accession numbers 



Pkey CAT number Accession 



116845 393481J AA649530 AA659316 H64973 
103207 30635_-4 X72790' 

126257 182217_1 N99638 AW973750 AA328271 H90994 AA558020 AA234435 N59599 R94815 

102791 37186_1 AF080229 AF080231 AF080230 AF080232 AF080233 AF080234 BE550633 A1636743 AW614951 BE467547 AI680833 
AI633818 N29986 U87592 U87593 U87590 U87591 S46404 U87587 AA463992 AW206802 AI970376 AI583718 AI672574 
N25695 AW665465 AI818326 AA126128 AI480345 AW013827 AA248638 AI214968 AA204735 AA207155 AA206262 AA204833 
AW003247 AW496808 A1080480 A1631703 AI651Q23 A1867418 AW818140 AA502500 AI206199 AI671282 AI352545 BE501030 
A1652535 BE465762 AA206331 AW451866 AA471088 AA206342 AA204834 AA206100 AW021661 AA332922 N56048 
AA703396 H92278 AW139734 H92683 U87589 U87595 HSgOOl U87594 BE466420 AI624817 BE46661 1 A1206344 AA574397 
AA348354 AI493192 

126872 142696.1 AW450979 AA1 36653 AA1 36656 AW419381 AA984358 AA492073 BE168945 AA809054 AW238038 BE011212 BE011359 
BE011367 BE011368 BE011362 BE011215 BE011365 BE011363 
1746257^1 R82040 R70934 
176835J AA225a84 AA302713 

324239_1 AI31 1928 AA936030 T51931 AA609816 AA487195 AA664207 
genbanK.F09609 F09609 
genbank_R08440 R08440 



112631 
120742 
105864 
109700 
111532 
113938 
113947 
124357 
108733 
112303 
322136 
322296 
321811 
314648 
322520 
322521 
322675 
323332 
316186 
322975 
324261 
323817 
301976 
324981 
303642 
303797 
319551 
311935 
319834 
319977 
314138 
313591 



genbank.W81598 W81598 

genbank_W84768 W84768 

genbank_N22401 N22401 

504187_1 AA121022AA126422 

genbanleR54797 R54797 

46802_1 AF075083 H52291 H52528 

47334_1 W76326 AF086341 W72300 

1527481.1 D80630 D80896 D80895 

293660_1 AW979268 AA878419 AA431342 AA431628 

38916_1 T55958 T57205 AF147346 

38917.1 AF147347 T55426 T55503 . 

86787.1 AA017656AA017374AA019761 

179142.1 AI829520 A1791832 AA228414 AI791823 AA229211 AA229315 
425440 J AI433540 AA728984 AA804981 
1510563.1 C16391 016413 

273265 J BE069341 AW74&403 AL044891 AI908240 AA393080 

233566 J AA410943 AW948953 AA334202 AA332882 

128835.1 T97905AA101672 

376239J AA613792 AW182329 T05304 AW858385 

284260.1 AW299459 AA4171 12 

386364.1 AW629759 AW749955 AA633408 AI651005 

357371 J AA761668 AA573621 R92814 R09670 

174129.1 AA216387 T63548 AA228676 

112523.1 AA071267 T65940 T64515 AA071334 

345248.1 AA534222 AA632632 T81 234 

179950.1 AA740616 AA654854 AA229923 

103087 J AA046309 AI263500 AA046397 
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308106 AI476803 

338255 CH22 6856FG_LINK^EM'^COO 
335809 CH22 3181FG.617_6_LINK^EM 
335824 CH22_3197FG.619.11_UNK.E 
5 307010 AI140014 
307041 AI144243 
305913 AA876109 
305917 AA876469 
309574 AW168083 
10 325372 c12_hs 
325544 c12 hs 

332798 CH22J4FG_6^5„UNK^C4G1.G 
334223 CH22L1507FG^360.4^UNK_EM 
'327075 c21_hs 
15 334447 CH22^1746FG.387.7_UNK-EM 
304782 AA582081 

313434 441798_1 W92070 AW019952 W92053 
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TABLE 17B 

Table 17B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 17, For each predicted exon, we have Usted the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



10 



15 



20 



25 



Pkey: Unique number corresponding to an Eos prabeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Qt) numbers. "Dunham I. et al.' refers to the pubHcation 

entitled The DMA sequence of human chromosome 22." Dunham \. et al., Nature (1999) 402:489-495. 
strand: Indicates DNA strand from which exons were predicted. 

NLposilton: Indicates nudeofide positions of predicted exons. 



Pkey Ref 



334447 
335809 
335824 
332798 
334223 
338255 
325372 
325544 
327075 



Dunham, I. elal. 
Dunham. I. etal. 
Dunham, I. etal. 
Dunham, 1. et.al. 
Dunham, I. eta). 
Dunham, t. et.al. 
5866920 
6682452 
6531965 



strand 

Plus 

Plus 

Plus 

Minus 

Minus 

Minus 

Minus 

Plus 

Plus 



Hi j30Sition 

14308764-14308824 

26310772-26310909 

26376860-26376942 

232147-231974 

12734365-12734269 

15242294-15242231 

1117061-1117304 

171228-171286 

4041318-4041431 
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TABLE 18: Table 2 from BRCA 014 P 

Table 18 shows genes with atleast five times the expression in breast tumor tissue than is 
expressed in normal body tissues. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank aixession number 

10 UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to nonnat body tissue 



15 Pkey ExAccn UnigenelD Unigene Title 



R1 



Hs. 180884 
Hs.183650 
Hs.161640 



Hs.129873 
Hs.16459g 



HS72472 
Hs.206832 



101378 BE563085 Hs,833 
101530 M29874 Hs.1360 
101767 M81057 

20 101878 M97815 
103010 X52509 
104115 AF183810 Hs.26102 
104825 AA035613 Hs.141883 
107105 AW963419 Hs.155223 

25 108819 AA011449 Hs.271627 
112287 AB033064 Hs.334806 
112561 AI791493 
112637 R82331 
113206 BE262470 Hs.241471 

30 113970 W27249 Hs.8109 
114965 A1733881 
116925 N92293 
119905 AW449064 Hs.1 19571 
121611 M31669 Hs.1735 

35 129301 AF182277 Hs.330780 
133976 A1908165 Hs,169946 
134731 D89377 Hs.89404 
300254 AW183618 Hs.55610 
301884 AA312082 Hs.105445 

40 302001 AB020711 Hs.278346 
302067 BE542706 Hs.222399 
302276 AW057736 Hs.323910 
302290 AA179949 Hs.175563 
302372 AL1 17406 Hs.200102 

45 302385 AJ224172 Hs.204096 
309177 A1951118 Hs.326736 
309583 AW1 70035 Hs.326736 
310781 AI380797 Hs.158992 
311166 A1821005 Hs.1 18599 
50 311935 AA216387 

312153 BE2B1944 Hs.118625 
313328 AW44g211 Hs.105445 
313915 C18853 Hs.163443 
314097 AA648744 Hs.269493 
55 314138 AA740616 

314506 AA833655 H5^6868 
314558 A1873274 Hs.190721 
314691 AW207206 Hs.1363l9 
315006 AI538613 Hs.298241 
60 315021 AA533447 Hs.312989 
315051 AW292425 Hs.163484 
315060 AA551104 Hs,189048 



interferon-stimulated protein, 15 kDa 5.3 

cytochrome P450, subfamily IIB (phenobar 9 

carboxypeptidase 91 (tissue) 12 

cellular retinoic acid-binding protein 2 6.5 . 

tyrosii'^e aminotransferase 12.4 

opposite strand to trichortiinophalangeal 7.6 

ESTs 6,9 

stannbcaldn 2 5.3 

ESTs 6.1 

KIAA1238 protein 7.3 

ESTs. Weaidy similar to A36036 cytocfirom 8.2 

ESTs 5.4 

RIMB6 6.2 

hypothetical protein FU21080 6.9 

BMP-R1B 10.1 

ESTs. Moderately similar to ALU8„HUMAN A 1 9.7 

collagen, type 111, alpha 1 (Ehlers-OanI 8.4 

inhibin, beta B (activin AS beta polypep 5.6 

cytochrome P450, subfamily IIB (phenobar 6.2 

GATA-binding protein 3 (T-cell receptor 6.2 

msh (Drosophila) homeo box homolog 2 5.8 

solute canier family 30 (zinc transport 9.9 

GDNF family receptor alpha 1 5.7 

KIAA0904 protein 7.7 

CEGP1 protein 7.3 

HER2 receptor tyrosine kinase (c-erb-b2, 5.4 
Homo sapiens mRNA; cDNA DKFZp564N0763 (f 34.1 

ATP-binding cassette transporter MRP8 6.7 

lipophilin B (uteroglobin family member) 1 3,8 

Homo sapiens breast cancer antigen NY-BR 17,3 

Homo sapiens breast cancer antigen NY-BR 57.6 

ESTs 10^ 

ESTs 10.8 
gb:nc16b02.s1 NCLCGAP_Fn Homo sapiens 5.2 

hexokinase 1 5.2 
GDNF family receptor alpha 1 .12.4 

Homo sapiens cDNAFU11576ns. done HE 26.3 

ESTs 6,6 
gb:ny97f11.s1 NCLCGAP.GCB1 Homo sapiens 5.9 

Homo sapiens cONA FU14056 fis. done HE 8.5 

ESTs • 27.4 

ESTs 207 

Transmembrane protease, serine 3 10.9 

ESTs 5.3 

ESTs 12.9 

ESTs, Moderately similar to ALUC.HUMAN ! 5.8 

279 



RNSCIOCTD: <WO 



wo 02/059377 



PCT/US02/02242 



315196 A1367347 Hs.44898 Homo sapiens done TCCCTA001 51 mRNAsequ 8.2 

315530 AW015415 Hs.127780 ESTs 8.9 

315634 AA837085 Hs.220585 ESTs 6.3 

316012 AA764950 Hs.1 19898 ESTs 7 

5 316177 AI904982 Hs.293102 ESTs. Moderatety similar to ALU1_HUMAN A 30.7 

316580 AA938198 Hs.146123 poty(A) polymerase gamma 9.4 

317803 AW664964 Hs. 128899 ESTs 6.1 

317881 A1827248 Hs.224398 Homo sapiens cDNA FU 11469 fis, clone HE 9.6 

318740 NM_002543Hs.77729 oxidised low density lipoprotein (lectin 7.3 

10 318744 AI793124 Hs.144479 ESTs 17.8 

320211 AL039402 Hs.125783 DEME-6 protein 9.2 

321107 AI732643 Hs.144151 ESTs 12.3 

321644 AW975944 Hs.237396 ESTs 11.7 

321978 N77342 Hs.21851 Homo sapiens cDNA FU 12900 fis. clone NT 5 

15 322035 AL137517 Hs.334473 hypothetical protein DKFZp55401278 19 

322766 AW068805 H5.288467 Homo sapiens cDNA FU1 2260 fis. done m 5.2 

322818 AW043782 Hs.293616 ESTs 7.6 

322975 C16391 gb:C16391 Clontech human aorta poIyA mRN 16.5 

323262 AU33990 Hs.190642 ESTs 10.5 

20 323332 AI829520 gbMl19c06.x1 NCLCGAP_Ut1 Homo sapiens 6.2 

323817 AA410943 BMP-RIB 8.4 

324261 BE069341 gb:QV3.BT0381-270100^73-c08BT0381 Homo 49.4 

324432 AA464510 Hs.152812 ESTs 16.5 

324598 AW972227 Hs.163986 Homo sapiens cDNA: FU22765 Tb. done K 5 

25 ^ 324603 AW993522 Hs.292934 ESTs 10.4 

324987 AI375572 Hs.172634 ESTs 18.8 

325544 Phase 2 & 3 Exons 5.7 

330388 Hs.46 HER2 receptor tyrosine kinase (c-ert>-b2. 6.6 

334223 - NM_005080*:Homo sapiens X-box binding pr 26.2 

30 335809 NM_0145D9*;Homo sapiens kraken-Iike (BK1 10.1 

335824 ENSP00000249072*:DJ222E13.1 (N-TERMINAL 20 

AI052047 ESTs; Weakly similar to CYTOCHROME P450 6.7 

R72427 CH22_EM:AC0055dO.GENSCAN,127 9 5.5 

CH22_FGENES.619J3 9.2 

35 CH22^FGENES.617_9 8 

CH22^FGENES.271_7 6.5 

CH22_FGENES.619,7 7.3 

CH22_FGENES.271_8 12.9 

CH22_FGENES.619_12 8.4 

40 CH22_EM:AC005500.GENSCAN.304 2 11.3 

CH.07_hsgl(6004473 8.5 

CH22_FGENES.617_7 7.1 

(>i22J=GENES,678_5 7,2 

CH22.FGENES.678.5 16.8 
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TABLE 18A 
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15 



Table 18A shows the accession numbers for those pkeys lacking unigenelD's for Table 1 8. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and niRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" colimm. 



Pkey : U nique Eos probesel identifier number 

CAT number: Gene cluster number 
Accession: Genbank accession numbeiB 



Pkey CAT number Accession 



323332 179142_1 AI829520 AI791832 AA228414 AI791823 AA22921 1 AA229315 

20 322975 1510563J C16391 C16413 

324261 273265 1 BE069341 AW748403 AL044891 AI908240 AA393080 

323817 233566 1 AA410943 AW948953 AA334202AA332882 

311935 174129J AA216387 T63548AA228676 

314138 179980 1 AA740616 AA654854 AA229923 

25 335809 CH22_3i81FG.617_6.UNK^EM 

335824 CH2a.3197FG.619„11_LINK_E . 

32554^ c12_hs 

334223 CH22J507F6;.360.4.UNK_EM 
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TABLE 18B 

Table 18B shows the genomic positioning for those pkeys lacking unigene ID*s and 
accession numbers in Table 18. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



10 



15 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham 1. et a!." refers to the publication 

entitled *The ONA sequence of human chromosome 22.** Dunham 1. et al.. Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

Ntj>osition: tndi(^tes nucleotide positions of predicted exons. 



Pkey Ref 



Strand Nt_position 



335809 Dunham, I. elai. Plus 

20 335824 Dunham, 1. eLal. Plus 

334223 Dunham. I. elal. Minus 

325544 6682452 Plus 



26310772-26310909 
26376860-26376942 
12734365-12734269 
171228-171286 
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TABLE 19: 1045 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES 

5 Table 19 shows 1045 genes up-regulated in breast cancer compared to normal adult tissues. 
These were selected from 59680 probesets on the Affymetrix/Eos-Hu03 GeneChip array such 
that the ratio of "average" breast cancer to "average" normal adult tissues was greater than or 
equal to 2.5. The "average" breast cancer level was set to the 90* percentile value. The 
"average" normal adult tissue level was set to the 90* percentile value amongst 144 non- 
10 malignant tissues. In order to remove gene-specific background levels of non-specific 

hybridization, the 15"' percentile value amongst the 144 non-malignant tissues was subtracted 
from both the numerator and the denominator before the ratio was evaluated. 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Pke/: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Tiile: Unigene gene title 

R1 : Ratio of tumor to nonnal body tissue 



Pkey ExAccn UnigenelD UnigeneTKIe 



R1 



408591 AF015224 
406964 M21305 

400291 AA401369 
407277 AW17C035 
449746 AI668594 
426878 BE069341 

400292 AA250737 
427585 D31152 
408045 AW138959 
407178 AA195651 
407377 C16391 
450705 U90304 
407212 AA412108 
428848 NM_000230 
404551 

407980 AA046309 
447350 AI375572 
450375 AA009647 
422109 S73265 
435496 AW840171 
453160 A1263307 
420813 X51501 
415989 AI267700 
422505 AL120862 
424399 Aig05687 
423575 C18863 
429441 AJ224172 
431474 AL133990 
448595 AB014544 
427217 AA399272 
402578 

422805 AA436989 
424534 NM_003613 
456207 AA193450 
424085 A1351010 
459587 AA031956 



Hs.46452 mammaglobin 1 137.6 

gb:Human atptia satelfite and satellite 3 71.0 
Hs.190721 ESTs .68.4 

Hs.326736 Homo sapiens breast cancer antigen NY-BR 54.2 
Hs.176588 ESTs, Weakly similar to CP4Y^HUMAN CYTOC 46.4 

gb:QV3-BT0381-2701004}73-c08BT0381 Homo 44.8 

Hs.72472 BMP-RIB 37.4 

Hs.179729 collagen, type X. alpha 1 (Sdimid metaph 32.9 

Hs.245123 ESTs 31.9 

Hs.104106 ESTs 30.4 

gb:C16391 Clontech human aorta polyA mRN 27.7 

Hs.25351 iroquois homeobox protein 5 24.8 

Hs.269350 ESTs 22.0 

Hs.194236 teptin (murine obesity homolog) 21.9 

tnchortiinophatangeal syndrome I (TRPS1 ) 21 .8 

gb:zf12f01.s1 SoaresJetaLheart,NbHH19W 19.8 

Hs.172634 ESTs 17.3 

Hs.8850 a disintegrin and metailoproteinase doma 16.6 

Hs.1473 gastrin-reieasing peptide 16.5 

Hs.265398 ESTs. Weakly similar to transfomnation-r 16.0 

Hs.239884 H2B histone family, member L 1S.8 

Hs.99949 prdadin-induced protein 15.8 

Hs.317584 ESTs 15.5 

H5.124165 programmed cell death 9 (PPCD9) 14.8 

Hs.2533 aldehyde dehydrogenase 9 family, member 1 4.5 

Hs.163443 Homo sapiens cDNA FU11576 fis. done HE 137 

Hs.204096 llpophiTin B (uteroglobin family member) 1 3.6 

Hs.190642 ESTs 13.5 

Hs.21572 KIAA0644 gene product 1 3.0 

Hs.144341 ESTs 12.8 

C1001 134:gq21 173721pirI1165981 fatty ac 12.6 

Hs. 121017 H2A histone family, memberA 12.2 

Hs.151407 car^ge intermediate layer protein, nu 12.0 

gb:zr40e07.r1 Soares^NhHMPu^SI Homosapi 11.9 

Hs.102267 lysyl oxidase 119 

gb:zk15e04.s1 SoaresjjregnanLutenis.NbH 1 1 .5 
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416208 AW291168 Hs.41295 ESTs. Weakly stm«ar to MUC2^HUMAN MUCIN 11.5 

429170 NM_001394 Hs.2359 dual specifidly phosphatase 4 11.5 

407276 AI951118 Hs.326736 Homo sapiens brBast cancer antigen NY-BR 11.4 

434377 AW137148 Hs.305593 Homo sapiens cDNA FU11382 fis. ctone HE 11.3 

448390 AL035414 Hs.21068 hypothetical protein 11.1 

452401 NM_007115 Hs.29352 tumornecrosis factor, alpha-induced pro 11.0 

421037 AI684808 Hs.197653 programmed cell death 9 (PDCD9) 10.9 

452461 N78223 Hs.108106 transcription factor 10.7 

443348 AW873596 Hs.182278 calmodulin 2 (phosphorylase kinase, delt 10.6 

421155 H87879 Hs.102267 lysyl oxidase 10.5 

402606 NM_024626:Homo sapiens hypothetical prot 10.4 

409269 AA576953 Hs.22972 hypothetical protein FU 13352 10.4 

447268 AI370413 Hs.36563 hypothetical protein FU224 18 10.3 

447033 AI357412 Hs.l57601 ESTs 10.2 

400295 W72838 Hs.2533 aldehyde dehydrogenase 9 family, member 10.1 

424905 NM^002497 Hs.153704 N IMA (never in mitosis gene a)-related k 10.1 

432441 AW292425 Hs.163484 ESTs 9.9 

427365 AI873274 Hs.1 90721 ESTs 9.9 

438950 H23789 Hs,1 44530 EST 9.8 

422835 BE218705 Hs.121378 metaltothionein-like 5, testis-specific 97 

425692 090041 Hs. 155956 N-acetyltransferase 1 (ary)amine N-acety 97 

411869 W20027 Hs.23439 ESTs 9.6 

439820 AL360204 Hs!283853 Homo sapiens mRNA full length insert cDN 9^6 

445730 AJ624342 Hs.170042 ESTs 9.5 

459583 AI907673 gb:IL-BT152-080399-004 BT152 Homosapien 9.3 

452744 AI267652 Hs.30504 Homo sapiens mRNA; cDNA DKFZp434E082 (fr 9.2 

432596 AJ224741 Hs.278461 matrilin3 9.1 

400297 AI127076 Hs.334473 hypothetical protein DKFZp56401 278 9.1 

449448 D60730 Hs,57471 ESTs 9.1 

423945 AA410943 gb:zt32h03.r1 Scares ovary tumor NbHOT H 9.1 

406348 Target Exon 9.0 

424735 U31875 Hs.272499 short-chain alcohol dehydrogenase family 9.0 

453392 U23752 Hs.32964 SRY (sex delemnining region Y)-box 11 9.0 

433365 AF026944 Hs.293797 ESTs 8.8 

405654 NA C12001521:gi|7513934|pir||T31081 cca3 pr 8.8 

418601 AA279490 Hs.86368 calmegin 8.8 

451110 AI955040 Hs,265398 ESTs, Weakly similar to transfomfiation-r 87 

408771 AW732573 Hs.47584 potassium voltage-gated channel, delayed 8.5 

409041 AB033025 Hs.50081 KIAA11 99 protein 8.4 

442353 BE379594 Hs.49136 ESTs, Moderately similar to ALU7_HUMAN A 8.3 

451561 N52812 Hs.177403 ESTs 8.2 

424001 W67883 Hs.1 37476 paternally expressed 10 8.2 

429859 NM_007050 Hs.225952 protein tyrosine phosphatase, receptor t 8.1 

423887 AL080207 Hs.134585 DKFZP434G232 protein 8.1 

405095 NA Target Exon 8.1 

419296 AA236115 Hs.120785 ESTs 8.0 

447164 AF026941 Hs.17518 Homo sapiens cig5 mRNA. partial sequence 8.0 

413472 BE242870 Hs.75379 solute carrier family 1 (glial high affi 8.0 

416747 AW876523 Hs.15929 hypothetical protein FU 12910 8.0 

415385 R17798 Hs7S35 COBW-like protein 7.9 

434424 AI811202 Hs.325335 Homo sapiens cDNA: FU23523 lis, done L 7.9 

420931 AF044197 Hs.100431 smaU indudbte cytokine B subfamily (Cy 7.9 

406687 IWI31 126 Hs.272620 matrU metaltoproteinase 1 1 (MMP1 1 ; stro 7.8 

400285 NA Eos Control 7.7 

437207 T27503 Hs.15929 hypotheUcal protein FU 12910 7.6 

427119 AW880562 Hs.1 14574 ESTs 7.5 

429534 AW976987 Hs.163327 ESTs. Weakly similar to 21 09260ABceU 7.5 

433426 H69125 Hs.133525 ESTs 7.5 

411078 AI222b20 Hs.182364 CocoaCrisp 7.4 

426214 H59846 Hs.1 28355 ESTs. Moderately similar to ALU7_HUMAN A 7.4 

447475 AI380797 Hs.158992 ESTs 7.3 

415263 AA948033 Hs.1 30853 ESTs 7.2 

439569 AW602166 Hs.222399 CEGP1 protein 7.2 

414142 AW368397 Hs.150042 Homo sapiens cONA FU1 4438 fis. done HE 7.1 

426261 AW242243 Hs.168670 peroxisomal famesylated protein 7.0 

444783 ■ AK001468 Hs.62180 anillin (Drosophila Scraps homotog), act 6^ 

445885 AI734009 Hs.127699 KJAAl 603 protein 6.9 
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429432 A1678059 Hs.202676 synaptonemal complex protein 2 6.9 

410781 AI375672 Hs.165028 ESTs 6.9 

443788 A1732643 Hs.144151 ESTs 6.9 

421373 AA808229 Hs.167771 ESTs 6.8 

5 451398 AI793124 Hs.144479 ESTs 6,8 

404253 NMJ)21058*:Homo sapiens H2B histone fami 6.8 

441098 AI015591 Hs.131004 ESTs, Weakly similar to T1 7227 hypolhefi 6.7 

426215 AW963419 Hs.155223 stanniocalcin2 6.6 

428227 AA321649 Hs.2248 small inducible cytokine subfamily B (CX 6.6 

10 422956 BE545072 Hs. 122579 hypothetical protebi FU 10461 6.6 

411111 AW818127 gb:CM1-ST0277-061299-059-b07ST0277 Homo 6.6 

434988 At418055 Hs.161160 ESTs 6.6 

442580 AI733682 Hs.130239 ESTs 6,6. 

449611 AI970394 Hs.197075 ESTs 6.6 

15 408000 L1 1690 Hs.620 buHous pemphigoid antigen 1 (230/240kD) 6.5 

420757 X78592 Hs.99915 androgen receptor (dihydfotestosteroner 6.5 

431089 BE041395 Hs.283676 ESTs. Wealdy similar to unlmown protein 6.5 

400301 X03635 .Hs.1657 estrogen receptor 1 6.5 

427356 AW0234a2 Hs.97849 ESTs 6.5 

20 425704 U79293 Hs.159264 Human clone 23948 mRNA sequence 6.4 

441134 W29092 Hs.7678 cellular retinotc acid-binding protein 1 6.4 

424902 NM_003866 Hs.1 53687 inositol polyphosphate-4-phosphatase. ty 6.4 

448593 AW004854 Hs.228320 hypothetical protein FU23537 6.4 

'431448 AL137517 Hs.334473 hypothetical protein DKFZp56401 278 6.2 

25 444342 NM_014398 Hs.10887 similar to lysosome^ssociated membrane 6.1 

422168 AA586894 Hs.112408 SI 00 caldum-bindkig protein A7 (psorias 6.1 

453331 AI240665 Hs.8895 ESTs 6.1 

418007 M13509 Hs.d3169 matrix metalloproteinase 1 (MMP1; inters 6.0 

441233 AA972965 Hs.135568 ESTs 6.0. 

30 418092 R45154 Hs.106604 ESTs 6.0 

430044 AA464510 Hs.152812 ESTs - 5.9 

432837 AA310693 Hs.87329 HSPC072 protein 5.9 

433285 AW975944 Hs,237396 ESTs 5.9 

450701 H39960 Hs.288467 Homo sapiens cDNA FLJ 12280 fis, clone MA 5.9 

35 425707 AF1 15402 Hs.11713 E74-like factor 5 (ets domain transcript 5.9 

410785 AW803341 gb:IL2-UM0079^90300-050-D03 UM0079 Homo 5.9 

425398 AL049589 Hs.156369 hypothetical protein similar to tenascin 5.9 

414812 X72755 Hs.77367 monoltine induced by gamma interferon 5.8 

459371 R20991 gb:yg06h01.r1 Soares infant brain 1N1BH 5.8 

40 411284 N28519 Hs.135191 ESTs, Weakly simitar to unnamed protein 5.8 

453511 AIJ031224 Hs.33102 transcriptkm factor AP-2 beta (activati 5.8 

451807 W52854 Hs.27099 hypothetical protein FLJ23293 similar to 5.7 

430510 AW162916 Hs.241576 hypothetical protein PR02577 5.7 

415539 A(733881 Hs.72472 Bi\4P-R18 5.6 

45 438199 AW016531 Hs.122147 ESTs 5.6 

417866 AW067903 Hs.82772 coliagen. type XI. alpha 1 5.5 

430019 AA463893 Hs.220933 ESTs 5.5 

439809 R41396 Hs.101774 hypothetical protein FU23045 5.5 

423811 AW299598 Hs.50895 homeoboxC4 5.4 

50 434539 AW748078 Hs.214410 ESTs. Weakly similar to MUC2^HUMAN MUCIN 5.4 

439138 AI742605 Hs. 193696 ESTs 5.4 

453931 AL121278 Hs.25144 ESTs 5.4 

444078 BE246919 Hs.10290 U5 snRNP-specific 40 kDa protein (hPip8- 5.4 

447102 BE167434 Hs.98471 ESTs. Weakly similar to T1 8712 hypotheti 5.4 

55 451621 A1879148 Hs.26770 fatty acid binding protein 7. brain 5.4 

425236 AW057800 Hs.155223 stenniocaldn 2 5.3 

421464 AA291553 Hs.190086 ESTs 5.3 

450736 AW970060 gb:EST382140 MAGE resequences. MAGK Homo5.3 

428085 AA421081 Hs.1 2388 ESTs 5.3 

60 452838 U65011 Hs.30743 preferentially expressed antigen in mela 5.3 

445424 AB028945 Hs.12695 cortactin SH3 domain-binding protein 5.3 

456938 X52509 Hs.161640 tyrosine aminotransferase 5.3 

422857 L32137 Hs.1584 cartilage oligomeric matrix protein (COM 5.2 

438167 R28363 Hs.24286 ESTs 5.2 

65 433330 AW2070a4 Hs.132816 hypothetKsal proteff) MGC14801 5.2 

449765 N92293 Hs.206832 ESTs. Moderately similar to ALU8_HUMAN A 5.2 

416276 U41060 Hs.79136 UV-1 protein, esbnogen regulated 5.2 
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400300 X03363 HER2 receptor tyrosine kinase (<>erb-b2, 5^ 

418004 U37519 Hs.87539 aldehyde dehydrogenase 3 family, member 5.2 

439840 AW449211 Hs.105445 GDNF family receptor alpha 1 5.2 

428771 AB028992 Hs.193143 K(AA1069 protein 5.2 
5 455047 AW852530 g b:PM1-CT0243-07 1099-001 ^06 CT0243 Homo 5.2 

419169 AW851980 Hs.262346 ESTs. Weakly similar to S72482 hypothetl 5.2 

453197 A(916269 Hs.109057 ESTs. WeaWy similar to ALU5_HUMAN ALUS 5.1 

400298 AA032279 Hs.61635 six transmembrane epithelial antigen of 5.1 

431023 A1283133 Hs.297420 ESTs 5.1 

10 427666 AI791495 Hs.1 80142 calmodulin-fike skin protein 5.1 

427718 AI798680 Hs.25933 ESTs 5,1 

434531 AA642007 Hs.1 16369 ESTs 5.1 

429220 AW207206 Hs.136319 ESTs 5.1 

405494 NA C2001837*:gi|126979031dbilBAB21770.1| (A 5.1 

15 452930 AW1 95285 Hs.1 94097 ESTs, Weakly similar to 138022 hypotheti 5.1 

444910 AI201849 gb:qs76g04j(1 NCI.CGAP_Pr28 Homo sapiens 5.1 

453310 X70697 Hs.553 solute carrier family 6 (neurotransmitte 5.0 

444381 BE387335 Hs.283713 ESTs. WeaWy similar to S64054 hypothetl 5.0 

450603 R43646 Hs. 12422 ESTs 5.0 

20 416575 W02414 Hs.38383 ESTs 5.0 

438504 AW665281 Hs.224625 ESTs 5.0 

416209 AA236776 Hs.7g078. MAD2 (mitotic arrest deficient yeast h 5.0 

428804 AK000713 Hs.193736 hypothetteal protein FU20706 5.0 

420077 AW512260 Hs.87767 ESTs 4.9 

25 450480 X82125 Hs.25040 zinc finger protein 239 4.9 

437637 AJ003029 .Hs.65792 syntrophin, gamma 2 4.9 

431808 M30703 Hs.270833 amphiregulin (schwannoma-derived growth 4.9 

418836 AI655499 Hs.161712 ESTs 4.8 

442441 A1820662 Hs.129598 ESTs 4.8 

30 435635 AF220050 Hs.1 81 385 uncharacterized hematopoietic stem/proge 4.8 

400286 NA C16000922:gll7499103|pirl|T20903 hypothe 4.8 

407505 U71 600 gb:Human zinc finger protein zfp31 (zf31 4.8 

420026 AI831190 Hs.166676 ESTs 4.8 

441377 BE218239 Hs.202656 ESTs 4.8 

35 457726 A1217477 Hs.194591 ESTs 4.8 

412785 AW997556 Hs.78621 KIAA1717 protein 4.8 

428368 BE440042 Hs.83326 matrix metaHoiirDteinase 3 (stromelysin 4.7 

436026 AI349764 Hs.217081 ESTs 4.7 

409110 AA191493 Hs.48778 niban protein 4.7 

40 400284 NA estrogen receptor 1 4.7 

410102 AW248508 Hs.279727 Homo sapiens cDNA FU14035 fis. clone HE . 4.7 

407819 R42185 Hs.274803 ESTs 4.7 

430486 BE062109 Hs.241551 chloride channet calcium activated, fam 4.7 

422896 AW961489 Hs.154116 ESTs 4.7 

45 453616 NM.003462 Hs33846 dynein. axonemal. light intennediate po! 47 

427427 AF077345 Hs.177936 ESTs 4.6 

421751 AW613731 Hs.159153 ESTs, Moderately simHar to S65657 alpha. 4.6 

454074 R63503 Hs.28419 ESTs 4.6 

405718 C4000799*:gi|6330365|dbj|BAA86508.1|(AB 4.6 

50 444649 AW207523 Hs.197628 ESTs 4.6 

429431 240313 Hs.106330 Homo sapiens done IMAGE:23371. mRNAseq 4.6 

427811 M81057 Hs.1 80884 carboxypeptidaseBI (tissue) 4.6 

447342 AI199268 Hs.19322 Homo sapiens. Similar to RIKEN cDNA 2010 4.6 

430345 AK000282 Hs,239681 hypothetical protein FU20275 4.6 

55 454307 AW855717 gb:RC1-CT0279-081299^13-b01 CT0279 Homo 4.6 

400303 AA242758 Hs.79136 UV-1 protein, estrogen regulated 4.6 

438180 AA808189 Hs.272151 ESTs 4.6 
451340 AW936273 gb:QV0-DT0O20-O9020O-107-g07 DT0020 Homo 4.6 

458711 AL036877 Hs.282878 ESTs 4.6 

60 457430 AA514660 Hs.128443 ESTs 4.6 

416030 H15261 Hs.21948 ESTs 4.6 

447233 AW246333 Hs.17901 Homo sapiens, clone 1MAGE:393701 5, mRN A. 4.6 

445537 AJ245671 Hs.12844 EGF-like-domain. muiUple 6 (EGFL6) 4.5 

424590 AW966399 Hs.46821 hypothetical protein FU20086 4.5 

65 432374 W68815 Hs,301885 Homo sapiens cDNA FU 11 346 fis. done PL 4.5 

423833 AW503329 gb:UI-HF-BN0-akx.e-02-O-Ul.r1 NIK.MGC^50 4.5 

406747 AI925153 Hs.217493 annexinA2 4.5 
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412102 H56435 gb:yq98e09.r1 Soares fetalliver spleen 4,5 

431716 D89053 Hs.26fi012 fatty-add-Coenzyme A Ugase, long-chain 4.5 

41 1050 AW814902 gb:MR1-ST0206-120400^22-f08 ST0206 Homo 4.5 

401418 NA C14000338*:giI7459502|plr||S74665 cuter 4.5 

5 435194 AK001074 Hs.333435 Homo sapiens cONA FU10212 fis, done HE 4.5 

435211 AK0015B1 Hs.334828 hypothetical protein FU1071 9; KIAA1794 • 4.4 

414080 AA135257 Hs.47783 B aggressive lymphoma gene 4.4 

424115 AA335497 Hs.293965 ESTs. Weakly similar to 138022 hypolheti 4.4 

415786 AW419195 Hs.257924 hypothetical protein FLJ13782 4.4 

10 442117 AW664964 Hs.128899 ESTs 4.4 

452784 BE463857 Hs.151258 hypothetical protein FU21082 4.4 

432731 R31 178 Hs.287820 fibronecfini 4.4 

410534 AW905138 gb:QVO-N?n071-280400-207-g07 NN1071 Homo4.4 

405196 NA C2000662*:giI7512792|pir||T12482hypothe 4.4 

15 430217 N47863 Hs.336901 ribosomal protein S24 4.4 

401793 C17001545:gi|5360127|gb|AAD42882.11AF155 4.4 

415747 AA381209 gb:EST94257 Activated T-cells I Homo sap 4.4 

423679 AB007975 Hs.l31454 KIAA0506 protein 4.4 

400238 NA C19000274*:gi|127413271reqXP^008833.21 4.4 

20 425627 AF019512 Hs.297007 membrane-bound transcription factor prot 4.4 

400608 C1000l899:gi|7508633lpirliT25392 hypothe 4.4 

458634 AV657310 Hs.282898 ESTs 4.3 

407771 AL138272 Hs.62713 ESTs 4.3 

405906 NA Target Exon 4.3 

25 405925 NA Target Exon 4.3 

439382 BE247684 Hs.1O307O ESTs 4.3 

445263 H57646 Hs.42586 K1AA1 560 protein 4.3 

407162 N63855 Hs.142634 anc finger protein 4.3 

419536 AA603305 gb:np12d11.s1 NCl_CGAP_Pr3 Homo sapiens 4.3 

30 454359 N71277 gb:za36e03.s1 Scares fetalliver spleen 4.3 . 

411558 AA102670 Hs.70725 gamma-aminobutyric add (GABA) A recepto 4.2 

450715 A1266484 Hs.31570 ESTs. Weakly similar to KIAAI 324 proteh 4.2 

421451 AA291377 Hs,50831 ESTs 4.2 

452864 AA033714 Hs.287629 hypothetical protem FU14260 4.2 

3 5 409757 NM_001898 Hs.1231 14 cystatin SN 4.2 

413043 BE158766 gb:IL2-HT0397-071299-024-F02 HT0397 Homo 4.2 

413499 BE144884 gb:CM0-HT0182-041 099-065^1 l.HT0182Homo4.2 

444619 BE538082 Hs.d172 ESTs, Moderately similar to A46010X-lin 4.2 

408380 AF123050 Hs.44532 diubiquitin 4.2 

40 406992 S82472 gb:beta -pol^DNA polymerase beta (exon a 4.2 

404285 NA C6001909;gi|704441|dbj|BAA18909.1|{D298 4.2 

425247 NM 005340 Hs.155324 matrix metatloproteinase 11 (MMPll;stro 4.2 

428046 AW812795 Hs.155381 ESTs. Moderately sImUar to 138022 hypot 4.2 

446163 AA026880 Hs,25252 prolactin receptor 4.2 

45 421147 AW592167 Hs.293299 ESTs 4.2 

426451 AI908165 Hs.169946 GATA-binding protein 3 (T-cell receptor 4.2 

415227 AW821113 Hs.72402 ESTs 4.2 

452176 AA024538 Hs.282990 Human DNA sequence from done RP1-28H20 4.2 

452862 AW378065 Hs.8687 ESTs 4.2 

50 443646 AIQ85198 Hs.1 64226 ESTs 4.2 

425523 AB007948 Hs.1 58244 KIAA0479 protein 4.1 

424687 J05070 Hs.1 51738 matrix metalloproteinase 9 (gelatinase B 4.1 

430009 AA894564 Hs.22242 ESTs 4.1 

434469 AA634806 gb:ab28c02r1 Slratagene lung (937210) H 4.1 

5 5 451381 BE241831 Hs.172330 hypothetical protein MGC2705 4.1 

450229 R18717 Hs.8929 hypothetical protein FLJ1 1362 4.1 

455700 BE0681 15 gb:CM1-BT0368-06 1299-060^07 BT0368 Homo4.1 

431924 AK000850 Hs.272203 Homo sapiens cDNA FLJ20843 fis. done AD 4.1 

438885 AI886558 Hs.184987 ESTs 4.1 

60 401451 NI\A_004496*:Homo sapiens hepatocyte nude 4.1 

431676 AI685464 gb*88f04j(1 NCLCGAP.Pi28 Homo sapiens 4.1 

409092 A1735283 Hs.172608 ESTs 4.1 

429270 W60379 Hs.57773 ESTs 4.1 

443903 AI220547 Hs.135223 ESTs • 4.1 

65 427122 AW057736 Hs.323910 HER2 receptor tyrosine kinase {o«b4)2. 4.1 

410275 U85658 HS.6179S transcription factor AP-2 gamma (activat 4.1 

432912 BE007371 Hs.200313 ESTs 4.1 
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403585 Target Exon 4.1 

438295 AI394151 Hs.37932 ESTs 4.1 

420380 AA640891 Hs.102405 ESTs 4.1 

431118 BE264901 Hs.250502 carbonic anhydrase Vlll 4.1 

5 416182 NM_004354. Hs.79069 cyc!inG2 4.1 

418994 AA298520 Ks.89546 selectin E (endothelial adhesion motecul 4.1 

400555 Target Exon 4.1 

410079 U94362 Hs.58589 g!ycogenin2 4.0 

427674 NM_003528 Hs.2178 H2B histone family, member Q 4.0 

10 427131 .AA448460 Hs.112017 GE36gene 4.0 

439759 AL359055 Hs.67709 Homo sapiens mRNA full length insert cDN 4.0 

429353 AL1 17406 Hs.200102 ATP-binding cassette transporter MRP8 4.0 

421296 NIVL002666 Hs.103253 perilipin 4.0 

418819 AA228776 Hs.191721 ESTs 4.0 

15 424188 AW954552 Hs.142634 zinc finger protein 4.0 



455431 AW938484 gb:CM0-OT0057-290200-253Kl06 DT0057 Homo4.0 



404142 NA Target Exon 4.0 

441143 AI027604 Hs,1 59650 ESTs 4.0 

444540 AI693927 Hs.265165 ESTs 4.0 

20 415579 AA165232 Hs.222069 ESTs 4.0 

452891 N75582 Hs.212875 ESTs. Weakly similar to DYH9_HUMAN CILIA 4.0 
414605 BE390440 gb:601283601F1 NIH.MGC 44 Homo sapiens c 4.0 

452281 T93500 Hs.28792 Homo sapiens cDNA FU1 1041 fis. clone PL 4.0 

417801 AA417383 Hs.82582 integrin. beta-like 1 (with EGF-Bke rep 4.0 

25 446232 At281848 Hs.194691 retinoic acid induced 3 4.0 

447377 X77343 Hs.334334 transcription factor AP.2 alpha (activat 4.0 

437854 AL119723 gb:DKFZp761A2124„r1 761 (synonym: hamy2) 4.0 

446140 AA356170 Hs.26750 hypothetical protein FLJ21 908 4.0 

462240 AI591147 Hs.61232 ESTs 4.0 

30 459574 A1741122 Hs.101810 Homo sapiens cDNA FLJ 14232 fis. clone NT 4.0 

458673 N99626 gb:za39d1 1 .r1 Scares fetai liver spleen 4.0 

444858 A1199738 Hs.208275 ESTs. Weakly similar to ALUA.HUMAN i!!! 4.0 

452166 AI948607 Hs.264680 ESTs 4.0 

452681 AF153330 Hs.30246 solute canrier family 19 (thiamine trans 3.9 

35 450192 AA263143 Hs.24596 RAD51 -interacting protein 3.9 

405554 NA TaigetExon 3.9 

416259 AA573005 Hs.19173 ESTs 3.9 

445813 Z42023 Hs.106576 alanlne-glyoxylate aminotransferase 2-li 3.9 

451024 AA442176 gb:zw63b08.r1 Soares_totaLfetus_Nb2HF8^ 3.9 

40 413930 M86153 Hs.75618 RAB1 1 A. member RAS oncogene family 3.9 

401781 Target Exon 3.9 

415296 F05086 Hs,328142 ESTs 3.9 

452564 AA026777 gb:ze93c11.r1 SoaresJetaLhearLNbHH19W 3.9 

442500 AI819068 Hs^122 ESTs 3.9 

45 419759 Z21336 Hs.135411 actin related protein 3.9 

424638 AI472106 Hs.49303 Homo sapiens cDNA FLJ1 1663 fis, ctone HE 3.9 

439699 AF086534 Hs.187561 ESTs. Moderately similar to ALU 1 .HUMAN A 3.9 

428042 AA419529 Hs.76391 myxovirus (influenza) resistance 1. homo 3.9 

452501 AB037791 Hs.29716 hypothetical protein FU10980 3.9 

50 453049 BE537217 Hs.30343 ESTs 3.9 

443213 BE568414 Hs.145497 Homo sapiens cDNA: FLJ22097 fis, ctone H 3.9 

443489 AI073512 Hs.133916 ESTs 3.9 
455092 BE152428 gb:CM0-HT0323-151299-126-b04 HT0323 Homo3.9 

401785 NMJ02275*:Homo sapiens keratin 15 (KRT1 3.9 

55 426427 M86699 Hs.169840 TTK protein kinase 3.9 

446009 AI989885 Hs.231926 ESTs ' 3.9 

436033 H75391 Hs.255748 ESTs 3.9 
451067 BE172186 gb:MRO-HT0559-110300-005-h11 HT0559 Honio3.8 

419348 AA236545 Hs.98274 ESTs 3.8 

60 444635 AI184268 Hs.339665 ESTs 3.8 

412140 AA219691 Hs.73625 RAB6 interacting, kinesin-Iike (rabkines . 3.8 

403593 Target Exon 3.8 

442323 AW016669 Hs.29190 ESTs 3.8 

419854 AW664873 Hs.87836 Homo sapiens PAC done RP5-1087M19 from 3.8 

65 433871 W02410 Hs.205555 ESTs 3.8 

445253 AI217928 Hs.144762 ESTs 3.8 

409542 AA503020 Hs.36563 hypothetical protein FU2241 8 3.8 
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443162 
458194 
422475 
440705 
447290 
403426 
427821 
454288 
443801 
410658 
410672 
428579 
445495 
447995 
401747 
420633 
423545 
•433138 
434715 
428664 
450951 
402696 
446868 
458154 
422026 
419440 
421524 
417283 
401508 
410303 
420362 
433384 
434302 
443938 
448420 
458712 
433404 
405232 
430491 
455609 
450164 
453948 
436061 
401049 
418867 
420179 
458663 
437259 
428309 
450522 
451952 
412209 
425201 
443830 
439255 
414869 
409064 
407721 
445135 
404091 
409731 
405153 
423248 
403639 
404360 
422352 
423338 



T49951 
AW383618 
AL359938 
AA904244 
AI476732 

AA470158 
BE222648 
AW206942 
AW1 05231 
AW794600 
NM_005756 
BE622641 
AI742618 

NMJ)14581 
AP000692 
AB029496 
BE005346 
AK001666 
AA018534 
NA 

AV660737 
AW816379 
U80736 
AB020689 
AA312082 
N62840 
NA 

AA324597 
U79734 
A1021992 
AA629055 
R55373 
BE623004 
AI347502 
T32982 

AL109791 
BE011668 
A1239923 
AI970797 
AI248584 
NA 

D31771 
N74530 
AV658444 
A1377755 
M97815 
At6g8839 
AL120173 
AW901456 
AA352111 
AI142095 
BE164500 
AA157291 
AA062954 
Y12735 
AK000054 
NA 

AA125985 

AA380177 
NA 

AA766296 
AB007981 



Hs.9029 DKRP434G032 protein 3.8 

Hs.265459 ESTs, Moderately simffar to ALU2^HUMAN A 3.8 

Hs.117313 Meis(mouse)homolo93 3.8 

Hs.153205 ESTs 3.8 

Hs.263912 ESTs 3.8 

Target Exon 3.8 

Hs.98202 ESTs 3.8 

Hs.279458 ESTs. Highly similar to c38QA1.lbIH.sap ' 3.8 

Hs.253594 ESTs ' 3.8 

Hs.192035 ESTs 3.8 
gb:RC6-UM0014-170300-O22-CO5 UM0014 Homo3.8 

Hs.184942 G protein-coupled receptor 64 3.8 

Hs.3848g ESTs. Weakly sMar to I38Q22 hypotheti 3.8 

Hs.181733 ESTs. Weak^ similar to nitrilasehomolo 3.7 

Homo sapiens keratin 17 (KRT17) 3.7 

Hs.274480 odorant4)inding protein 2A 3.7 

Hs.1 29781 chromosome 21 open reading frame 5 3.7 

Hs.59729 semaphorin sem2 3.7 

Hs.1 16410 ESTs 3.7 

Hs.189095 similar to SALL1 (sal (DrosophiTaHike 3.7 

Hs.103334 ESTs ^ 3.7 
C3002523:gi|6686211|sp|Q27533|YH2WLCAEEL3.7 

Hs.135100 ESTs 3.7 

Hs.335018 ESTs 3,7 

Hs.110826 trinucleotide repeat containing 9 3.7 

Hs,90419 KIAA0882 protein 3.7 

Hs.105445 GDNF family receptor alpha 1 3.7 

Hs.48648 ESTs 3.7 

NM_024817:Homo sapiens hypothetical prot 3.7 

Hs.21851 Homo sapiens cONA FU12900 fis, done NT 3.7 

Hs.97206 huntingtin interacting protein 1 3.7 

Hs.124244 ESTs ' ■ 3.7 

Hs.1 16301 ESTs ' 3.7 

Hs.20864 ESTs 3.7 
gb:601441282F1 NIH_MGC_72 Homo sapiens c 3.7 

Hs.107872 hypothetical protebiFLJ20761 3.7 

Hs.102720 ESTs 3.7 

NM,015832:Homo sapiens methyl-CpG bindin 3.7 

Hs.241559 Homo sapiens mRNA full length insert cDN 3.7 
gb:CM3-BN0223-100500.177-a04 BN0223 Homo3.7 

Hs.30098 ESTs 3.7 

Hs.64859 ESTs 3.7 

Hs.190745 Homo sapiens cONA: FU21326 fis. done C 3.7 

TaigetExon 3.6 

Hs.89404 msh(DfOSophfe)homeoboxhomolog2 3.6 

Hs.21168 ESTs . 3.6 

Hs.280776 tankyrase; TRFI-interading ankyrin-iela 3.6 
Hs-120695 ESTs 3.6 
Hs.183650 cellular retinoicadd-binding protein 2 3.6 
gb:wd31102j(1 Soares_NFl_T„GBC.S1 Homo s3.6 
Hs.301663 ESTs 3.6 
gb:RC0-NN1012-270300-031-c07 NN1012 Homp3,6 
gb:EST60061 Activated T-cells XX Homo sa 3.6 
Hs.143273 ESTs 3.6 
gb:RC4-HT0469-230300-014-e10 HT0469 Homo 3.6 
Hs.21479 ubinudeini 3.6 
Hs.141883 ESTs 3.5 
Hs.380i8 dual-spedfidty tyrosine-(Y)-phosphoryl 3.(^ 
Hs.12347 hypothetical protein FU20047 3.6 
TaigetExon 3.6 
Hs.56145. thymosin, beta, Wenfified in neuroblast • 3.6. 

TaigetExon 3.6 
Hs.125845 ribulos&6-phosphate-3^lnierase 3.6 
ENSP0000Q233023*:CDNA FU12662 lis, don 3.6 
C7001385:gi|12082809|gb|AAG48618.1|AF315 3.6 
Hs.99200 ESTs 3.6 
Hs.127338 KIAA0492 protein 3.6 

289 



RMjEvinnirv «wn ososssttaz' r > 



wo 02/059377 PCT/US02/02242 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



424202 
431750 
439907 
453596 
406446 
418454 
434360 
409079 
440132 
448706 
440671 
407647 
459023 
402820 
417009 
447048 
449978 
428062 
452909 
400610 
417843 
419335 
451592 
443270 
423948 
449424 
423841 
416806 
422060 
433104 
415778 
413054 
416636 
424639 
424827 
437782 
411514 
413783 
421106 
431291 
440623 
455838 
458771 
442942 
436550 
418649 
424420 
430916 
432030 
439405 
405917 
452727 
426320 
421070 
424625 
428508 
455651 
410555 
447754 
418636 
404097 
434205 
421072 
402421 
405248 
407638 
403000 



BE350295 
AA514986 
AA853978 
AA441838 
NA 

AA315308 

AW015415 

W87707 

AI697121 

AW291095 

AW297920 

AW850158 

AW968226 

AA191719 

AW393080 

Ald06335 

AA420683 

NM-015368 

NA 

W07361 . 

AW960146 

AI805416 

NM 004272 

AW392342 

AW448937 

AW753967 

N(VL000288 

R20893 

AL043002 

H84847 

AW316843 

N32536 

AI917494 

AI057094 

A1370876 

AW850178 

AA314337 

AA877124 

N25521 

A1935016 

BE145808 

AW295151 

AW167087 

Z50158 

AW474547 

BE614743 

AW505021 

Ai908400 

AF086224 

NA 

AW993582 

W47595 

AA283185 

AW904466 

BE252383 

BE064962 

U92649 

AW073310 

AW749855 

NA 

AF119861 
AI215069 
NA 
NA 

AJ404672 
6E247275 



Hs.15032 RAN binding protein 17 3.6 

Hs.283705 ESTs 3.6 

Hs.124577 ESTs 3.6 

Hs.62905 hypothetical protein nj14834 3.6 

Target Exon 3.6 

Hs.195870 hypothetical protein RJ14991 3.6 

Hs.127780 ESTs 3.6 

H5.82065 interleukin 6 signal transducer (gp130. 3.6 

Hs.202466 ESTs,- WeaWy similar to S65824 reverse t 3.6 

Hs.21814 interleukin 20 receptor, alpha 3.6 

Hs.130054 ESTs 3.5 
gb:RC0-CT0379-290100«2.b04 CT0379 Homo 3.5 

Hs.60798 ESTs 3.5 

NM_017646*:Homo sapiens tRNA isopentenyl 3.5 

Hs.314714 ESTs 3.5 

Hs.228320 hypothetical protein FU23537 3.5 

Hs.200829 ESTs. Weakly similar to T30171 ninein - 3.5 

Hs.98321 hypothetical protein FU14103 3.5 

Hs.30985 pannexini 3.5 

Target Exon 3.5 

Hs.22545 Homo sapiens cDNA FU1 2935 fis. clone NT 3.5 

Hs.284137 hypothetical protein FU12688 3.5 

Hs.213897 ESTs 3.5 

Hs.337737 Homer, neuronal immediate early gene» IB 3.5 

Hs.2d3077 centrosomal P4.1-assocfated protein; unc 3.5 

Hs.197030 ESTs 3.5 
gb:RC2-CT0304-080100-011-h12 CT0304 Homo 3.5 

Hs J9993 peroxisomal biogenesis factor 7 3.5 

Hs.325823 ESTs, Moderately similar to ALUS^HUMAN A 3.5 

Hs.128246 ESTs, Moderately similar to unnamed prot 3.5 

Hs.49391 hypothetical protein LOC54149 3.5 

Hs.66309 hypothetical protein MGC11061 3.5 

Hs.42645 solute carrier family 16 (monocarboxylic 3.5 

Hs.9812 Homo sapiens cONA FU 14388 fis. clone HE 3.5 

Hs.96867 Homo sapiens cDNA: FLJ231 55 fis, clone L 3.5 

Hs.79090 exportin 1 (CRM1 , yeast homolog) 3.5 

gb:(L3-CT0219-271099-022-H12CT0219 Homo 3.5 

Hs.301547 ribosomal protein S7 3.5 

Hs.172844 ESTs 3.5 

Hs.25275 Knjppel-type zinc finger protein 3.5 

Hs.216639 ESTs 3.5 
gb:MRO-HT0208-101299-103-f11 HT0208 Homo 3.5 

Hs.163612 ESTs 3.5 

Hs.131562 ESTs 3.5 

Hs.270235 ESTs. Weakly similar to MMHUB1 larranin b 3.5 

Hs.53565 Homo sapiens PiG-M mRNA for mannosyltran 3.5 

Hs.146688 prostaglandin E synthase 3.5 

Hs.88414 BTB and CNC homology 1, basic leucine zi 3.5 

Hs.143789 ESTs 3.5 

Hs.55238 ESTs 3.5 

C17000575:giI7290703|gb|AAF46150.1| (AEO 3.5 

Hs.176220 ESJs 3.5 

Hs.169300 transfonnihg growth factor, beta 2 3.4 

Hs.19327 ESTs 3.4 

Hs,321 197 PDZ domain protein (Drosophila inaD-fike 3.4 

Hs.184668 SBBi31 protein 3.4 
gb:RC1-BT0313-130400-016-c02 BT0313 Homo 3,4 

Hs.6431 1 a disintegrin and metaUoproteinase doma 3.4 

Hs.163533 Homo sapiens CDNAFU14142 fis, done MA 3.4 
gb:QV4-BT0534-281299-053-c05 BT0534 Homo 3.4 

C5000242*:gi|9369379|gblAAF87128.1|AC006 3.4 

hypothetical prated PRO2015 3.4 

ESTs 3.4 

C1001578*:gi|5759903bb|AAF28099.1| (AFl 3.4 

Target Exon 3.4 

hypothetical protein FLJ23571 3.4 

U5 snRNP-spediic protein, 1 16 kD 3.4 
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433393 AF038564 Hs.98074 itchy (mouse homolog) E3 ubiquitin prate 3.4 

432239 X81334 Hs.2936 matrix metaltoproteinase 13 (collagenase 3.4 

458747 BE618395 Hs,257391 hypothetical protein DKFZp761J1523 3.4 

442082 R41823 Hs7413 ESTs; calsyntenin-2 3.4 

5 417974 AA210765 gb:zf90c06.r1 NCLCGAP_GCB1 Homo sapiens 3.4 

446002 AI346468 Hs.145789 ESTs 3.4 

448995 Ai613276 Hs.5662 guanine nudeotide binding protein (6 pr 3.4 

436007 AI247716 Hs.232168 ESTs 3.4 

424698 AA164366 Hs.l51973 hypothetical protein FU23511 3.4 

10 435202 AI971313 Hs.1 70204 KIAA0551 protein 3.4 

410467 AF102546 Hs.63931 dachshund (Drosophila) homolog 3.3 

405460 NA TaigetExon 3.3 

441826 AW503603 Hs.129915 phosphotriesterase related 3.3 

453472 AL037925 gb:DKFZp564M037„r1 564 (synonym: hfbr2) 3.3 

15 447078 AW885727 Hs.301570 ESTs 3.3 

441690 R81733 Hs.33105 ESTs 3.3 

420092 AA814043 Hs.88045 ESTs 3.3 

418478 U38945 Hs.1174 cyclin-dependent kinase inhibitor 2A (me 3.3 

408908 BE296227 Hs.250822 serine/threonine kinase 15 3.3 

20 414737 AI160386 Hs.125087 ESTs 3.3 

449650 AF055575 Hs.23838 caldum channel, voltage-dependent, L ty 3.3 

418912 NM 000685 Hs.89472 angiotensin receptor 1 3.3 

436405 AA1~60079 Hs.172932 Homo sapiens mRNA for partial 3'UTR, seq 3.3 

453911 AW5Q3857 Hs.4007 Sarcolemmai-associated protein 3.3 

25 409361 NM_005982 Hs.54416 sine oculis homeobox (Drosophila) homolo 3.3 

429548 AW138a72 Hs.135288 ESTs * 3.3 

420807 AA280627 Hs.57846 ESTs 3.3 

409595 AA296g51 gb:EST112514 Adrenal gland tumor Homo sa 3.3 

445189 A1936450 Hs.147482 ESTs 3.3 

30 402892 NA TaigetExon 3,3 

426631 AA994896 Hs.22514 ESTs 3.3 

458722 AA741545 Hs.282832 ESTs. WeaWy similar to T24961 hypotheti 3.3 

409430 R21945 Hs.166975 splicing factor, arginine/serine-rich 5 3.3 

443194 Ai954968 Hs.279009 matrix Gla protein 3.3 

35 445432 AV653771 gb J^V653771 GLC Homo sapiens cDNA done 3.3 

410908 AA121686 Hs.10592 ESTs 3.3 

406151 NA TaigetExon 3.3 

436461 AW511956 Hs.293261 ESTs 3.3 

411171 AW820260 gb:QV2-ST0296-150200-040-c10 ST0295 Homo 3.3 

40 432415 T16971 Hs.289014 ESTs, Weakly similar to A43932 mucin 2 p 3.3 

439310 AF086120 Hs.102793 ESTs 3.3 

401575 NA TaigetExon 3.3 

420900 AL045633 Hs.44269 ESTs 3.3 

445628 AI344166 Hs.155743 ESTs 3.3 

45 . 448243 AW369771 Hs.52620 integrin. beta 8 3.3 

445102 AW204610 Hs.22270 ESTs 3.3 

442118 AA976718 Hs.202242 ESTs 3,3 

416783 AA206186 Hs.79889 monocyte to macrophage differentlation-a 3.3 

435039 AW043921 Hs.130526 ESTs 3.3 

50 451474 T70874 Hs.207636 ESTs 3.2 

442559 T10213 Hs.159993 gycosyltranslerase 3.2 

453921 AI824O09 Hs.44577 ESTs 3.2 

420036 R60336 Hs.52792 Homo sapiens mRNA; cDNADKFZp586l1 823 (f 3.2 

435627 W88774 Hs.1 18370 ESTs 3.2 

55 411598 BE336654 Hs.70937 H3 histone family, member A 3.2 

446733 AA863360 Hs.26040 ESTs. Weakly similar to fatty add omega 3.2 

410153 BE311926 Hs.l5830 hypothetical protein FU 12691 3.2 

403537 NA C3001106*:gl|10047201!dbi|BAB13394.1| (A 3.2 

405547 NM.018833*:Homo sapiens transporter 2. A 3.2 

60 427878 C05766 Hs.181022 CGI-07 protein 3.2 

451871 AI821005 Hs.1 18599 ESTs 3.2 

410313 R10305 Hs.185683 ESTs 3.2 

416856 N27833 Hs.269028 ESTs, Weakly simflar to 138022 hypotheti 3.2 

449490 AI652777 Hs.197069 ESTs 3.2 

65 450506 NM 004460 Hs.418 fibroblast activation protein, alpha 3.2 

440684 AI253123 Hs.127356 ESTs, Highly similar to S21424 nestin (H 3.2 

459055 N23235 Hs.30567 ESTs. Weakly similar to B34087 hypotheti 3.2 

291 



wo 02/059377 



452190 H26735 Hs.91668 Homo sapiens clone PP1498 unknown mFlNA 3.2 

430965 AA489732 Hs.154918 ESTs 3.2 

405394 Target Exon 3.2 

424693 BE169810 Hs.47557 ESTs 3.2 

454265 H03556 Hs.30O949 ESTs. Weakly siirular to thyroid hoimone 3.2 

437687 AA765917 Hs.122840 ESTs 3,2 

428372 AK00a684 Hs.183887 hypothetical protein FU22104 3.2 

414083 AL121282 Hs.257786 ESTs 3.2 

411670 AW856552 gb:RC1-CT0294-O80100-012-a04 CT0294 Homo 3.2 

416283 NM_C05429 Hs.79141 vascular endothetial growth factor C 3.2 

437488 AA758239 Hs.180330 ESTs 3.2 

428398 AI249368 Hs.98558 ESTs 3.2 

452042 H38857 Hs.243901 Homo sapiens cDNA FU20738 fis. clone HE 3.2 

421477 AI904743 Hs.104650 hypothetical protein FU10292 3.2 

438078 AI016377 Hs.131693 ESTs 3.2 

448816 AB033052 Hs.22151 K1AA1226 protein 3.2 

419519 AI198719 Hs.176376 ESTs 3.2 

404580 NM_0141 12*:Homo salens trichorhinophala 3.2 

447046 AA326187 Hs.17170 G protein-coupled receptor 4 3.2 

457473 AW974903 Hs.291231 ESTs 3.1 

429838 AW904907 Hs.30732 hypothefical protein FU 1 3409; KIAA1711 3.1 

459702 AI204995 gb;an03c03.x1 Stralagene schizo brain SI 3.1 

400195 NA NM_C07057*:Homo sapiens ZWIOinteraclor 3.1 

.417860 AW408557 Hs.235498 hypothetical protein FU14075 3.1 

417995 AW974175 Hs.188751 ESTs. Weakly similar to MAPB^HUMAN MJCRO 3.1 

422589 AA312735 Hs.30512 Homo sapiens mRNA for KIAA0556 protein. 3.1 

435870 AA701 327 Hs.17949 ESTs 3.1 

440801 AA906366 Hs. 190535 ESTs 3.1 

426274 D38122 Hs.2007 tumor necrosis factor (ligand) superfami 3.1 

423728 AW891294 Hs. 132 136 solutecarrierfamily 4. sodium bicarbon 3.1 

439677 R82331 Hs.164599 ESTs 3.1 

452834 AI638627 Hs. 105685 ■ KIAA1688 protein 3.1 

431349 AA503653 Hs.156942 ESTs. Moderately similar to ALU2_HUMAN A 3.1 

417576 AA339449 Hs.82285 phosphoribosylglycinamide fonnyltransfer 3.1 

430264 AA470519 gb:nc71f10.s1 NCJ CGAP Pri Homo sapiens 3.1 

418827 BE327311 Hs.47166 HT021 3.1 

410835 AW806906 gb:QV4-ST0023-l6Q400-172-d12 ST0023 Homo 3.1 

426269 H15302 Hs.l68950 Homo sapiens mRNA; cDN A DKFZp566Al 046 (f 3.1 

405336 NA Target Exon 3.1 

437783 AI683150 Hs^1550 ESTs, Weakly similar to ALU INHUMAN ALU S 3.1 

440931 AI583052 Hs.270058 ESTs 3.1 

455945 BE160636 gb:PM1-HT0422-291299-002-c08HT0422 Homo 3.1 

430437 AI768801 Hs.169943 Homo sapiens cDNA FU13569 fis, clone PL 3.1 

405848 NA Target Exon 3.1 

455685 BE066976 gb:PM0-BT0340-211299-003-c12BT0340 Homo 3.1 

406970 M29994 gb:Human alpha-l spectrin gene, exon 12. 3,1 

409602 W26713 Hs.256972 ESTs 3.1 

423518 D45027 Hs.l29732 R3H domain (binds single-stranded nuclei 3.1 

425653 AI065104 Hs.249718 ESTs. Weakly similar to A46010 X-linked 3.1 

426326 BE165753 Hs.250528 Homo sapiens, done IMAGE:4098694. mRNA. 3.1 

433805 AA706910 Hs.1 12742 ESTs ' 3.1 

437152 AL050027 gb.'Homo sapiens mRNA; cONA DKFZp566C03243.1 

448602 AI541305 Hs.48778 niban protein 3.1 

452844 AW407181 Hs.218377 Homo sapiens cDNA FU 11 927 fis. done HE 3.1 

407366 AF026942 gbrHomo sapiens cig 33 mRNA, partial sequ 3.1 

408254 AW807227 gb:MR4-ST0062-1fl0200-001^10ST0062 Homo 3.1 

424085 NM_002914 Hs.139226 replicatton factor C (activator 1) 2 (40 3.1 

416790 R83066 Hs.7043 succinate-CoA ligase. GDP-fonning. alpha 3.1 

420020 BE295866 Hs.94382 adenosine kinase 3.1 

426119 W94997 Hs.189917 ESTs 3.1 

426968 U07616 Hs.173034 amphiphysin (Stiff-Mann syndrome with br 3.1 

457421 AL117431 Hs.112165 Homo sapiens cDNAFU12198 fis, done MA 3.1 

453403 BE466639 Hs.61779 Homo sapiens CDNA FU1 3591 fis, done PL 3.1 

454141 AW138413 Hs.139336 ATP-binding cassette, sub-family C (CFTR 3.1 

426650 AA382814 gb:EST96097 Testis 1 Homo sapiens cDNA 5 3.1 

450865 AI248013 Hs. 106532 ESTs. Weakly sim»ar to 138588 reverse t 3.1 

407993 AW135274 Hs.12433 ESTs 3.1 
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446466 H38026 Hs.308 arrestin 3. retinal (X-arreslin) 3.1 

457888 BE219794 Hs.293471 ESTs 3.1 

420058 AK001423 Hs.94694 Homo sapiens cDNA FLJ10561 fis, done NT 3.0 

409248 AB033035 Hs.51965 KIAA1 209 protein 3.0 

5 452747 BE1 53855 Hs.61460 Ig superfamily receptor LNIR 3.0 

418926 AA232658 Hs.105794 UDP-^lucoseiglycoproteinglucosyltransfe 3.0 

419346 AI830417 Hs.44143 polybnomol 3.0 

429826 N93266 Hs.40747 ESTs 3.0 

435147 AL133731 Hs.4774 Homo sapiens niiRNA:cDNADKFZp761C171 2 (f 3.0 

10 420139 NM_005357 Hs.95351 lipase, homione-sensitive 3.0 

405609 NA ENSP00000241065*:CDNA 3.0 

404274 NIVL002944*:Homo sapiens v-ros avian UR2 3.0 

449777 AI971362 Hs.231945 ESTs 3.0 

415459 H07118 Hs.6099 ESTs 3.0 

15 415245 N59650 Hs.27252 ESTs 3.0 

406291 NA Target Exon 3.0 

414210 BE383592 gb:601297871F1 NIH_MGC_19 Homo sapiens c 3.0 

432055 AW972359 Hs.293334 ESTs 3.0 

442246 AI79198B Hs,129115 ESTs 3.0 

20 451353 N21043 Hs.42932 ESTs 3.0 

451177 AI969716 Hs.13034 ESTs 3.0 

418026 BE379727 Hs.83213 fatty acid binding protein 4, adipocyte 3.0 

401326 NA C10000447*:gil1l683751sp|P43467|AGA1^PED 3.0 

409920 BE169746 Hs.12504 likely ortholog of mouse Arkadra 3.0 

25 432887 AI926047 Hs.162859 ESTs 3.0 

411789 AF245505 Hs.72157 DKFZP564I1922 protein 3.0 

401045 CI 1001883*:gil6753278|refINP_033938.1| c 3.0 

432162 AA684062 Hs.272798 hypothetical protein FU2041 3 3.0 

434627 A1221894 Hs.39311 ESTs 3.0 

30 442611 BE077155 Hs.177537 hypothetical protein DKFZp761B1 514 3.0 

425477 AW958879 Hs.270535 ESTs 3.0 

457183 H91882 Hs.118569 Dvl-bindtng protein IDAX Cmhibitlon of 3.0 

433014 NM_014711 Hs,279912 K1AA0419 gene product 3.0 

415542 R13474 Hs.290263 ESTs. Weakly similar to 138022 hypotheti 3.0 

35 416173 R52782 gb:yg99d09.r1 Scares Infant brain 1 NIB H 3.0 

408155 AB014528 Hs.43133 KIAA0628 gene product 3.0 

453913 AW004683 Hs.78934 mutS (E. co^ homolog 2 (colon cancer. 3.0 

435495 AI754212 Hs.21951 Homo sapiens Xq pseudoautosomal regbn; 3.0 

423629 AW021173 Hs.18612 Homo sapiens cDNA:FLJ21 909 fis. clone H 3.0 

40 411836 AW901879 Hs.314453 ESTs 3.0 

415030 D31118 Hs.1 91 735. hypothetical protein MGC10520 3.0 

419606 AW294795 Hs.198529 ESTs 3.0 

440310 AA878939 Hs.125406 ESTs 3.0 

443608 AI375957 Hs.289074 F-box only protein 22 3.0 

* 45 414140 AA281279 Hs.23317 hypothetical protein FU14681 3.0 

444781 NM_014400 Hs.11950 GPI-anchored metastasis-associated prote 3.0 

435393 AA701259 Hs.189299 ESTs 3.0 

454071 AI041793 Hs.42502 ESTs 3.0 

446922 BE175605 gb:RC5-HT0580-100500-022-H07HT0580Homo3.0 

50 448062 AW295923 Hs.255472 KIAA1 843 protein 3.0 

427711 M31659 Hs.18O408 solute carrier family 25 (mitochondrial 3.0 

450382 AA397658 Hs.60257 Homo sapiens cDNA FU 13598 fis. clone PL 3.0 

424866 W01938 Hs.337243 ESTs. Weakly similar to ALU7_HUMAN ALU S 2.9 

433043 W57554 Hs.125019 lymphoid nuclear protein (LAF-4) mRNA 2.9 

55 420802 U22376 Hs.1334 v-myb avian myeloblastosis viral oncogen 2.9 

445625 BE246743 Hs.288529 hypothetical protein FU22635 2.9 

403677 NA C4001462:gi|48877151gb|AAA79329.2|(L088 2.9 

41 1 093 BEG67650 gb:MR4-BT0358-09030OO03-e01 BT0358 Homo 2.9 

435255 W87434 Hs.106015 ESTs. f^toderately similar to ALU1.HUMAN A 2.9 

60 443127 6E568102 Hs.180312 mitochondrial ribosomai protein S16 2.9 

448104 AI674818 Hs.316433 Homo sapiens cDNAFU11375 fis. done HE 2.9 

427315 AA179949 Hs.175563 Homo sapiens mRNA; cDNA DKFZp564N0763 (f 2,9 

430414 AW365665 Hs.120388 ESTs 2.9 

423600 AI633559 Hs.310359 ESTs 2.9 

65 458562 N34128 Hs.145268 ESTs 2.9 

402109 NA Target Exon 2.9 

429629 BE501732 Hs.30622 Homo sapiens cONAFUl 3010 fis, done NT 2.9 
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442295 
419752 
404721 
445107 
401987 
430566 
444517 
445563 
427691 
455551 
401458 
421039 
459504 
424962 
409617 
416931 
413221 
409732 
433687 
434340 
454529 
421379 
452291 
457402 
449051 
408761 
401093 
435061 
447985 
449340 
426384 
411905 
405953 
420854 
434265 
428365 
442861 
448337 
452554 
412248 
449450 
434757 
409038 
454545 
439842 
428479 
424800 
411086 
400250 
449168 
456482 
426044 
431854 
405873 
440400 
458265 
413708 
423739 
424408 
453095 
421825 
417742 
402765 
444378 
419172 
401497 
402376 



AI208121 Hs.147313 



A1827248 Hs.224398 Homo sapiens cDNAFU11469ris. done HE 2.9 
AA249573 Hs.152618 ESTs, Modefately similar to ZN91_HUMANZ 2,9 
NM_005596*:Homo sapiens nuclear factor I 2.9 
ESTs. Weakly similar to 138022 hypothefi 2.9 
NM_002737^Homo sapiens protein kinase C 2.9 
Hs.190149 ESTs 2.9 
Hs.146883 ESTs 2.9 
Hs.149006 ESTs 2.9 
Hs.20726 ESTs 2.9 
Hs.246358 ESTs. Weakly similar to T32250 hypotheti 2.9 
Target Exon 2.9 
Hs.101299 cul!in5 2,9 
gb:601315974F1 NIH_MGC_8 Homo sapiens cD2.9 
Hs.153954 TRAM-nke protein 2,9 
Hs.55209 Homo sapiens mRNA; cONA DKF2p434K0514 (f 2.9 
Hs.80485 adipose most abundant gene transcript 1 2.9 
gb:PM0-HT0425-141299-001-F08 HT0425 Homo2.9 
Hs.56148 NY-REN-58 antigen 2.9 
gb:ny57g01.s1 NCLCGAP Pri 8 Homo sapiens 2.9 
Hs.128685 ESTs. WeaklysimtlartoTI 7226 hypotheti Z9 
Hs.270425 ESTs 2.9 
Hs.1 03982 small inducible cytokine subfemily B (Cy 2.9 
Hs.28853 CDC7 (cell division cycle 7, S.cerevisl 2.9 
Hs.149342 acGvatton-induced cytidine deaminase 2.9 
HS.333S26 HER2 receptor tyrosine kinase (c-efb-b2. 2.9 
Hs.238936 ESTs. Weakly similar to (defBne not ava 2.9 
C12Q00586*:gi|6330167|dbl|BAA86477.1| (A 2.9 
Hs.163944 ESTs 2.9 
Hs.200949 ESTs 2.9 
Hs.195359 hypothetical protein MGC10954 2.9 
Hs.303662 ESTs 2.9 
gb:601 193893F1 NIH_MGC_7 Homo sapiens cD 2.9 



AA481282 
AI939339 
AW873606 
AW194426 
AI868634 

NM 003478 
BE514127 
NM 012288 
BE003760 
D45371 
BE161151 
NM^016122 
AA743991 
A(193043 
Z45439 
Y15221 
AF015592 
AW452648 
AW981400 
AA057264 

AI651474 
AI681475 
AW235786 
Ai472078 
BE265067 
NA 

AW296927 

AA846811 

AA295331 

AA243837 

AW206453 

AW452434 

BE1 76480 

AL039852 

AI038997 

T97490 

AW806899 

AI910896 

Y00272 

AL035588 

BE070800 

NA 

NM_016206 

AA485224 

AA502490 

AA383550 
NA 

AA994364 

AI075375 

BE158791 

AA398155 

A1754813 

AW294631 

AA298758 

R64719 

R41339 
AW338625 



Target Exon 2.8 
gb:UI-H-BW0-ajc-<>07-0-Ul.sl NCLCGAP.Su 2.8 
Hs.130554 Homo sapiens cDNA: FU23089 fis, done L 2.8 
Hs.183851 Homo sapiens cDNA FU20042 fis, done CO 2.8 
Hs.57787 ESTs 2.S 
Hs.3782 ESTs 2.8 
Hs.58006 ESTs. Weakly simitar to ALU5_HUMAN ALU S 2.8 
gb:RC3-HT0585-160300-022-c02 HT0585 Homo 2,8 
Hs.491 36 ESTs. Moderately similar to ALU7.HUMAN A Z8 
Hs.132921 ESTs 2.8 
Hs.50002 small inducible cytokine subfamily A (Cy 2.8 
gb:QV4-ST0023-160400-172-c12 ST0023 Homo 2.8 
Hs.132413 ESTs 2.8 
Hs.184572 ceH division cycle 2. G1 to S and G2 to 2.8 
Hs.153203 MyoD family inhibitor 2.8 
gb«C3-BT0502-251199-011-c07 BT0502 Homo 2.8 
Eos Control 2.6 
Hs.23142 colon cardnoma related protein 2.8 
gb:aa41b12.s1 NCLCGAP.GCB1 Homo sapiens2.8 
Hs.336695 ESTs 2.8 
Hs.271699 polymerase (DNA directed) iota 2.8 
Target Exon 2.8 
Hs.125594 ESTs, Weakly similar to T25472 hypotheti 2.8 
Hs.128193 ESTs, Weakly simUar to 1RX2_HUMAN IROQU 2.8 
gb:IL2-HTO397-O91299-O25-D02 HT0397 Homo 2.8 
Hs.97600 ESTs 2,8 
Hs.146428 coUagen. type V, alpha 1 2.8 
Hs.11325 ESTs Z8 
Hs,183747 ESTs, Moderately similar to CALB.HUMAN C 2.8 
gb:EST22d11 WATM1 Homo sapiens cONA don 2.8 
C100362r:gi|12407405|gbIAAG53491.1|AF22 2.8 
Hs,12569 ESTs 2.8 
Hs,22120 ESTs 2.8 
Target Exon 2.8 
C19000763*:gi|1363912|plrI|JC4296 ring f 2.8 
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405041 NA C3001706*:gi|1345652|spIP15989|CA36.CHIC 2.8 



408758 NM„003686 Hs.47504 exonudeasel 2.8 

431917 D16181 Hs.2868 peripheral myefin protein 2 2.8 

437583 AA761190 Hs.244627 ESTs 2.8 

5 453737 AA744862 Hs.194293 ESTs. Weakly simaar to 154374 gene NF2 2.8 

458094 AF086325 gbiHomo sapiens full length insert cDNA 2.8 

401283 NA Target Exon 2.8 

4 10784 AW803201 gb:IL2-UM0077-070500-086-E06 UM0077 Homo 2.8 

417601 NM 014735 Hs.82292 KiAA0215 gene product 2.8 

10 418236 AW994005 Hs.337534 ESTs 2.8 

435532 AW291488 Hs. 117305 Homo sapiens, ctone lhtAGE:3682908. mRNA 2.8 
454714 AW815098 gb:QV4^T0212J091199-O23-f1OST0212 Homo 2.8 

418629 BE2475S0 Hs.88859 growth factor receptor-bound protein 7 2.8 

442101 AI651930 Hs.135684 ESTs 2.8 

15 405080 AK000375 Hs.88820 HDCMC28P protein 2.8 

414661 T97401 Hs.21929 ESTs 2.8 

425589 AI6SQ633 Hs.143688 Homo sapiens cDNA: FU23031 fis. done L 2.8 

429638 AI916662 Hs.211577 kinectin 1 (kinesin receptor) 2.7 

428824 W23524 Hs.173059 ESTs 2.7 

20 421568 NM_000399 Hs.1395 early growth response 2 (Krox-20 (Drosop 2.7 
414596 BE386870 gb:601 27527 1F1 NIH_MGC_20 Homo sapiens c 2,7 

440868 R79707 Hs.263339 ESTs, Moderately similar to 138022 hypot . 2.7 

452943 BE247449 Hs.31082 hypotheUcal protein FU 10525 2.7 

443772 AV646449 Hs.282872 ESTs 27 

25 432361 A1378562 Hs.159585 ESTs 27 

430375 AW371048 Hs.93758 H4 histone family, member H 27 

406504 NA C50Q0558:gi|4504675|ref|NP_002175.1| Int 27 

423279 AW959861 Hs.290943 ESTs 27 

424871 NM_004525 Hs.1 53595 low density lipoprotein-related protein 27 

30 453619 H87648 Hs.33922 Homo sapiens, done MGC:9084. mRNA, comp 27 

423961 D13665 Hs.136348 osteoblast specific fador 2. (fasddin 27 

422156 N34524 gb:yy56d10.s1 Soares_multiple_sderosis_ 2.7 

415752 BE314524 Hs.78776 putative transmembrane protein 27 

419987 NM^005014 Hs.94070 osteomodulin 27 

35 406182 NA Target Exdn 27 

416495 X69970 Hs.7g350 RYK receptor-fike tyrosine kinase 2.7 

444701 AI916512 Hs.198394 ESTs 27 

408171 AA301228 Hs.43299 hypothetical protein FU12890 27 

430153 AW968128 Hs.336679 ESTs 2,7 

40 413383 AA128978 Hs.154706 hypoftefical protein FU1491 7 2.7 

414831 M31158 Hs.77439 protein kinase, cAMP-dependent regulate 27 

413278 BE563085 Hs.833 interferon-stimutated protein. 15 kDa 2.7 

433132 AB026264 Hs.284245 hypothetical protein IMPACT 2.7 

437030 AA742577 Hs.303781 EST 2.7 

45 439031 AF075079 gb:Homo sapiens full length insert cDNA 27 

449532 W74653 Hs.271593 ESTs, Moderately similar to A47582 B-cel 27 

406153 Target Exon 27 

406625 Y13647 Hs.119597 stearoyl-CoA desaturase (delta-9-desatur 2.7 

444698 AI188139 Hs.147050 ESTs 27 

50 432328 A1572739 Hs.195471 6-phosphofructo-2-kinase/fructose-2.6-bi 27 

429628 H09604 Hs.13268 ESTs 2.7 

420149 AA255920 Hs.88095 ESTs 27 

431207 AA495925 Hs.9394 ESTs 27 

438394 BE379623 Hs.27693 peptidylprolyl isomerase (cydophilinH 27 

5 5 443304 AI050073 Hs,135338 ESTs 27 

427660 AI741320 Hs.1 14121 Homo sapiens cDNA: FU23228 fis. done C 27 

408460 AA054726 Hs.285574 ESTs 27 

416515 N91716 Hs.194140 ESTs. Weakly similar to J38022 hypotheti 2.7 

429922 Z97630 Hs.226117 HI histone family, member 0 27 

60 418203 X54942 Hs.83758 CDC28 protein Wnase 2 27 

439509 AF086332 Hs.58314 ESTs 2.7 
402184 NA ENSP00000245238*:CDNAFLJ10922fis.don 2.7 

450496 AW449251 Hs.257131 ESTs 2.7 

451963 AI825440 Hs.224952 ESTs 2.7 

65 457938 AI373638 Hs.133900 ESTs 2.7 

441541 AA938663 Hs.199828 ESTs 2.7 

441111 AI806867 Hs.126594 ESTs 27 
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423020 
445354 
427961 
410889 
445234 
413903 
406069 
447410 
401256 
415139 
420218 
455511 
438825 
409564 
452837 
434876 
421585 
453279 
430785 
456986 
433066 
421952 
429208 
430733 
441720 
418986 
432481 
434338 
417061 
410530 
456672 
425071 
408868 
451531 
405822 
418301 
417315 
434699 
443204 
405638 
452542 
403943 
404535 
402800 
449144 
454934 
424717 
428303 
427970 
450638 
453034 
455097 
427317 
408875 
427510 
423201 
406271 
442696 
454018 
435420 
434398 
455708 
439347 
407523 
425101 
435153 
409139 



AA383092 

AV653485 

AW293165 

X91662 

AW137636 

AA496493 

NA 

AI470235 
NA 

AW975942 

AW958037 

BE144762 

BE327427 

AA045857 

AL121053 

AF160477 

AK001122 

AWa33940 

Z30201 

D38299 

Nri.006456 

AA300900 

AA447gg0 

AW975920 

A!346487 

AI123555 

AW451645 

AW754311 

Af675944 

M25809 

AK002016 

NM_013989 

AW292286 

AA018311 

AW976201 
AI080042 
AA643687 
AW205878 

AW812256 

Z25884 
NA 

A)989503 

AW846080 

H03754 

AW974476 

AA418187 

AK001826 

BE246010 

AW855802 

AB028955 

NM_015434 

Z47542 

NrA_000163 

BE566962 

AW016892 

AI928513 

AA1210g8 

BE069326 

W24320 

X64984 

AA830431 

AA668763 

AI681917 



Hs.1608 
Hs.6390 
Hs.143134 
Hs.66744 
Hs.146059 
Hs^136 



replication protein A3 (14kD) 2.7 
Homo sapiens done FLB3344 PRO0845 mRNA, 2.7 



ESTs 2,7 

twist (Orosophlla) homolog (acrocephalos 2.7 

ESTs 2.7 

ESTs 2.7 

Target Exon 2.7 

Hs.172698 EST 2.7 

NM_024089*:Homo sapiens hypothetical pro 27 

Hs.48524 ESTs 2.7 

Hs.286 nbosomai protein L4 2.7 
gb:CM0-HT0180^1099-065-b04 HT0180 Homo2.7 



Hs.79953 
Hs.54943 
Hs.5534 
Hs.61460 
HS.1Q5859 
Hs.59698 

Hs.170917 

Hs.288215 

Hs.98849 

Hs.190478 

Hs.283361 

Hs.28739 

Hs.81796 

Hs.151504 

Hs.188691 
Hs.64173 
Hs.1 14727 
Hs.154424 
Hs.255058 
Hs.1 14762 

Hs.53913 
Hs.336901 
Hs.149425 
Hs.29643 



ESTs 

fracture callus 1 (rat) homolog 

Homo sapiens cONA FU12961 fis, done NT 

Ig superfamlly receptor LNIR 

hypothetical protein FU10260 

ESTs 

gb:HHEA22G Atrium cDNA library Human hea 
prostaglandin E receptor 3 (subtype EP3) 



ESTs. Moderately simitar to AF161511 1 H 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens cDNA FLJ1 1973 fis, clone HE 



2.6 
2.6 
2.6 
2.6 
2.6 
2.6 
2.6 
2.6 
2.6 
2.6 
2.6 
2.6 
2.6 
2.6 
2.6 



Hs.121483 

Hs,233405 
Hs.314324 
Hs,152213 
Hs.183601 
Hs.330515 
Hs.25245 
Hs.271468 

Hs.175780 
Hs.48604 
Hs.179312 
Hs.125180 

Hs.7063 
Hs.100855 
Hs.59203 
Hs.3838 



gb:CM1-CT0337-141299-068-f07 CT0337 Homo 2.6 

Homo sapiens cONA FLJ1 2033 fis, done HE 2.6 

ATPase, H transporting, lysosomal (vacuo 2.6 

Homo sapiens, clone MGC:16327, mRNA, com 2.6 

deiodinase, iodothyronine. type II 2.6 

ESTs 2.6 

ESTs 2.6 

Target Exon 2.6 

hypothetical protein FU10252 2.6 

ribosomal protein S24 2.6 

Homo sapiens cDNA FLJ1 1980 fis, done HE 2.6 

Homo.saplens cDNA FU13103 fis, done NT 2.6 

Target Exon 2.6 
gb:RC0-ST0174-191099-031-a07 ST0174 Homo 2.6 

C5000355:gi|4503225|reflNP.000765.1| cyt 2.6 

chloride channel 1 . skeletal musde (Th 2.6 

Target Exon 2.6 

ESTs 2.6 

ESTs 2.6 

wingless-type MA/fTV integration site fami 2.6 

regulator of G-protein signalling 1 6 2.6 

ESTs 2.6 

hypothetical protein FU11269 2,6 

Homo sapiens mRNA for FU00038 prioteln. 2.6 



gb:RC1-CT0279-170200-023^f08 CT0279 Homo 2.6 

KIAA1 032 protein 2.6 

DKFZP434B168 protein .2.6 

small nudear RNA activating complex, po 2.6 

growth hormone receptor 2.6 

Target Exon 2.6 

Homo sapiens cDNA: FU20913 fis. done A 2.6 

ESTs 2.6 

ESTs 2.6 

serum-inducible kinase 2.6 



gb:QV3-BT0381-170100-060^03BT0381 Homo 2.6 

Hs.102941 Homo sapiens cONA: FU21531 fis. done C 2.6 

gb*i1.sapiens mRNA HTPCRX10 for olfadory 2.6 

Hs.180811 ESTs 2.6 

Hs.291939 ESTs ^6 

Hs.3321 ESTs. Highly similarto IRXI.HUMAN IROQU 2.6 
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455100 BE160198 gb:QV1 -HT041 3-01 0200-059-h03 HT0413 Homo 2.6 

414612 BE274552 HsJ6578 protein inhibitor of activated STATS 2.6 

440283 AI732892 Hs.190489 ESTs 2.6 

423025 AA831267 Hs, 12244 hypothetical protein FU20097 2.6 

5 431473 AA825686 Hs.321176 ESTs, Wealdy similar to S65824 reverse t 2.6 

404440 NM_021048:Homo sapiens melanoma antigen. 2.6 

403388 NA C3001398*:gi|12248917|dbjlBAB20375.l|(A 2.6 

403775 NA Target Exon 2.6 

405037 NA NM_.021628*:Homo sapiens arachidonate lip 2.6 

10 407447 AF290544 gb:Homo sapiens aminopeptidase mRNA. par 2.6 

420952 AA282067 Hs.88972 ESTs, Moderately similar to A46010 X-Bn 2.6 

435447 A1872932 gbwm72e03jc1 NCLCGAP_Ut2 Homo sapiens 2.6 

440202 AW516211 Hs.125300 ring linger protein 21, interferon-respo 2.6 

445854 AI702a85 Hs.145568 ESTs 2.6 

15 421247 BE391727 Hs.102910 general transcription factor IIH, polype 2.6 

414870 N72264 Hs.30O67O KIAA1204 protmn 2.6 

457411 AW085961 Hs.130093 ESTs 2.6 

424676 Y08565 .Hs.1 51678 UDP-N-acetyl-a!pha-l>gaiaclosamine:polyp 2.6 

404443 C8001428*:gil65722421emb|CAB62951.1| (Z9 2.6 

20 452268 NM-003512 Hs.28777 H2A hlstone family, member L 2.6 

430832 At073913 Hs.100586 ESTs, WeaWy similar to JE0350 Anterior 2.6 

444779 A1192105 Hs.147170 ESTs 2.6 

408633 AW963372 Hs.46677 PROLOG protein 2.6 

459089 F13036 Hs,27373 Homo sapiens mRNA; cONA DKFZp56401763 (f 2.6 

25 447197 R35075 gb:yh88b01 .si Soares placenta Nb2HP Homo 2.6 

454111 AW081681 Hs.269064 ESTs, Weakly similar to T42689 hypothetl 2,6 

411-165 NM_000169 Hs.69089 galactosidase, alpha 2.6 

406922 S70284 gb-'Stearoyl-CoA desaturase [human, adipo 2.6 

456045 H62943 Hs.154188 ESTs 2.6 

30 413111 BE065837 gb:RC2-BT0318-11O100-0l2-gl2 BT0318 Homo 2.6 

423123 NM_012247 Hs.124027 SELENOPHOSPHATE SYNTHETASE ; Human selen2.6 

432201 AI538613 Hs.298241 Transmembrane protease, serine 3 2.5 

427032 AF012023 Hs.1 73274 integrin cytoplasmic domain-associated p 2.5 

445417 AK001058 Hs.12680 Homo sapiens cDNA FLJ10196 fis. done HE 2.5 

35 422225 BE245652 Hs.l 18281 zinc finger protein 266 2.5 

428330 U22524 Hs.2256 matrix metalloproteinase 7 (MMP7; uteiin 2.5 

410011 AB020641 Hs.57856 PFTAIRE protein kinase 1 2.5 

426310 NM_0D0909 Hs.169266 neuropeptide Y receptor Y1 2.5 

419589 AW973708 Hs.201925 Homo sapiens cDNA FU13446 fis. done PL 2.5 

40 437770 AA767881 Hs.122897 ESTs 2.5 

442818 AKd01741 Hs.8739 hypothetical protein FLJ10879 2.5 

414251 AL042306 Hs.97689 VASA protein 2.5 

428301 AW628666 Hs.98440 ESTs. Weakly similar to 138022 hypotheti 2.5 

431956 AK002032 Hs.272245 Homo sapiens CDNAFU11170 fis. done PL 2.5 

45 455732 BE080908 gb:QV1-BT0631-280200-a84-h07 BT0631 Homo 2.5 

458624 AI362790 Hs.278639 KIAA1684 protein; likely homotog of mous 2.5 

428257 BE394723 Hs.275243 SlOO caldum-bmding protein A6 (cateyd 2.5 

418054 NR<L002318 Hs,83354 lysyl oxidase-like 2 2.5 

458652 AW375610 Hs.117102 hypothetical protein FU13046 similar to 25 

50 458012 AI424899 Hs.188211 ESTs 2.5 

422996 BE091089 gb:PM4-BT0724-130400-006-cb7 BT0724 Homo 2.5 

410804 U64820 Hs.66521 Machado^oseph disease (spinocerebellar 2.5 

457211 AV\/972565 Hs.32399 ESTs, WeaWy similar to S51797 vasodilat 2.5 

440029 AW089705 Hs.293711 ESTs, Weakly similar to S64329 probable 2.5 

55 448141 AI471598 Hs.197531 ESTs 2.5 

409163 AA065081 gb:2m13a03.s1 Stratagene pancreas (93720 2.5 

431385 BE178536 Hs.l 1090 membrane-spanning 4-<lomair«, subfamily A 2.5 

414569 AF109298 Hs.l 18258 prostate cancer associated protein 1 2.5 

439963 AW247529 Hs.6793 platelet-activating factor acetylhydrola 2.5 

60 455935 BE1 58687 gb:CM0-HT0395-280100-169-b09 HT0395 Homo23 

425025 AW953168 Hs.12407 ESTs 2.5 

416589 AA652687 Hs.98151 Human DNA sequence from done RP5-1103G7 2.5 

404826 Target Exon- 2.5 
422938 MM 001809 Hs.1594 centromere protein A (17kD) 2.5 
65 421991 NWL014918 Hs.110488 KIAA0990 protein 2.5 
417404 Nl\1.007350 H5.82101 pleckstrin homology-like domain^ family 2.5 

448516 AW898595 gb:RC1-NN0073-2604004)11-g09 NN0073 Homo2.5 
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403356 
404983 
418282 
427409 
431806 
443367 
421246 
439217 
400925 
404552 
417168 
418841 
426853 
427738 
457384 
447128 
454693 
434657 
402077 
400289 
. 409723 
447020 
455068 
431232 
408938 
411571 
A26504 
428248 
408813 
423504 
425441 
443066 
443556 
428943 
425320 
430388 
423242 
416241 
440244 
409239 
452464 
410718 
408877 
445150 
407756 
407633 
449754 
419316 
429118 
440331 
449344 
459006 
423165 
411337 
438290 
406414 
424498 
443464 
424856 
440304 
409045 
422648 
428619 
412520 
430602 
408031 
403133 



NA 

AA215535 
AW467143 
AF186114 
AW071349 
AW582962 
AF086041 

NA 

AL133117 

NM_062332 

U32974 

NM_000318 

AA501760 

AI271898 

AW813428 

AA641876 

NA 

X07820 

AW885757 

T27308 

AI807B94 

AI024353 

AA059013 

AA122393 

AW162919 

An26772 

A(580090 

N80077 

AA449644 

AW297921 

AA256769 

AW086180 

U29344 

AA356923 

AL039402 

N52639 

A1743977 

AA740875 

AW500507 

AI920783 

AA479033 

AI446747 

AA1 16021 

NKA_.007069 

H00820 

AA236255 

H20669 

AL046412 

AI640355 

AW298631 

AI937547 

AW837349 

AA843719 

AB033043 

BE548446 

AA347746 

BE159984 

AA635062 

D86983 

AL135623 

AA442324 

D13752 

AA081395 



ENSP00000251525*:Hypothetical protein Kl 2.5 

ENSP00000252242*:Kefatrn. type It cytosk 2.5 

Hs.98133 ESTs 2.5 

Hs.135411 adin related protein 2.5 

Hs.270737 tumor necrosis factor (ligand) superfami 2.5 

Hs.215937 ESTs 2.5 

Hs.102897 CGM7 protein 2.5 

Hs.42975 ESTs 2.5 

Target Exon 2.5 

ENSP00000220888*:Z!NC FINGER TRANSCRIPTI2.5 

Hs.81376 Homo sapiens mRNA; cDNA DKFZp5fi6L1 121 (f 2.5 

Hs.89137 tow density iipoprotein-related protein 2.5 . 

Hs.172777 baculoviral lAP repeat-containing 4 2.5 

Hs.180612 penoxisoma! membrane protein 3 {35kD, Ze 2.5 

Hs.15806 Homo sapiens mRNA; cDNA DKFZp434H2019 (f 2.5 

Hs.164866 cydinK 2.5 

gb:MR3-ST01 92-010200-210-005 ST0192 Homo 2.5 

Hs.191840 ESTs 2.5 

TaigetExon 2.5 

Hs.2258 matrix metaltoproteinase 10 (MMP10;str 2.5 

Hs.257862 ESTs 2.5 

Hs.16986 hypothetical protein FU1 1046 2.5 
Hs.47274 Homo sapiens mRNA; cDNA DKFZp564B176 (fr 2.5 

Hs.131755 hypothetical protein FU14298 2.5 

Hs.22607 ESTs 2.5 

Hs.70811 hypothetical protein FU20516 2.5 

Hs.170160 RAB2, member RAS oncogene famHy-fike 2.5 

Hs,40479 ESTs 2.5 

Hs.48295 RNA helicase famfly 2.5 

Hs.24792 chromosomei 12 open reading frame 5 2.5 

Hs.193063 Homo sapiens cDNA FLJ14201 fis. done NT 2,5 

Hs.255703 ESTs 2.5 

Hs.94949 methylmalonyl-CoA eptmerase 2.5 

Hs.37636 ESTs. Weakly similar to KIAAl 392 protein 2.5 

Hs.83190 fatty add synthase 2.5 

Hs.240770 nuclear cap binding protein subunit 2, 2 2.5 

Hs.1 25763 DEME-6 protein 2.5 

Hs.32683 ESTs 2.5 

Hs.205144 ESTs 2.5 

Hs.44307 ESTs. Moderately similar to 138022 hypot 2.5 

Hs.192619 KIAA1600 protein 2.5 

Hs.191435 ESTs 2.5 

Hs.130315 ESTs. Weakly similar to A47582B-cellgr 2.5 

Hs.338704 olfactory receptor, family 7, subfamily 2.5 

Hs.38260 ubtquitin spedfrc protease 18 2.5 

H5.37189 similar to rat HREV107 2.5 

Hs.30977 ESTs. Weakly similar to B34087hypotheti 2.5 

Hs.298419 ESTs 2.5 

Hs.35406 ESTs. Highly similar to unnamed protein 2.5 

Hs.202151 ESTs 2.5 

Hs.312691 ESTs 2.5 

Hs.27721 Wolf-Hirschhom syndrome candidate 1-lik 2.5 

Hs.124915 hypothetical protein 1^GC2601 2.5 

gb:QV2-LT0038-270300-108-d12L7T0038 Homo 2.5 

Hs.122341 ESTs 2.5 

C5000506*:gi|124941|sp|P186l4[rrA1^RAT I 2.5 

Hs.149377 hypothetical protein DKFZp761L0424 2,5 

Hs.5167 Homo sapiens mRNA; cONA DKFZp434F1 52 (fr 2.5 

Hs.9521 ESTs. Weakly similar to ZN43_HUMAN ZINC 2,5 

Hs.125395 ESTs 2.5 
Hs,50094 Homo sapiens mRNA; cDNA DKFZp434O0515 (f 2.5 

Hs. 118893 Melanoma assodated gene 2.5 

Hs.193914 KIAA0575 gene product 2.5 

Hs.795 H2A histone family, member O 2.5 

Hs.184927 cytochrome P450, subfamily XIB (steroid 2.5 

Hs.42173 Homo sapiens cDNA FLJ10366 fis. done NT 2.5 

Target Exon 2.5 
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413189 BE070231 gb:QV4-BT0407-260100-087-f12 BT0407 Homo 2.5 

400346 AB041269 Hs.272253 Homo sapiens mRNA for keratin 19, partia 2.5 

435509 AI458679 Hs.181915 ESTs 2.5 

458145 A1239457 Hs.130794 ESTs 2.5 
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Table 19A shows the accession numbers for those pkeys lacking unigenelD's for Table 19. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California), The Genbank accession numbers 
for sequences comprising each cluster are Usted in the "Accession" colmnn. 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Pkey: Unique Eos probeset identifier nunnber 

CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey 

407647 
407980 
408254 



CAT number Accessions 



409163 110418 J 



1007366_1 AW860158 AW862385 AW860159 AW862386 AW862341 AW821869 AW821893 AW062660 AW062655 
103087 J AA046309 A1263500 AA046397 

1049346J AW807227 AW807576 AW807137 AW807157 AW807495 AW807494 AWB07417 AW807083 AW845786 AW845801 
AW807130 AW807335 AW807081 AW807349 AW807339 AW807164 AW807341 AW807224 AW845903 AW177424 
AW807159 AW807123 BE141576 AW807340 AW807334 AW807520 AW807205 AW807505 BE141574 AW807390 
AW807395 AW845789 AW807101 AW807089 AW80751 9 AW807239 AW807509 AW807356 AW807526 AW807098 
AW807307 AW807153 AW807295 AW807313 AW807322 AW807265 AW807513 AW807516 AW807166 AW807501 
AW807120 AW807168 AW807121 AW807527 AW807151 AW807203 AW807489 AW807511 AW807158 AW845800 
AW807507 AW845793 AW845796 AW807296 AW845781 AW807444 AW845871 AW807512 AW807242 AW807141 
AW807522 AW807487 AW8Q7514 AW807142 AW807232 AW807379 AW8071 14 AW807518 AW807199 AW80721 1 
AW807498 AW807D86 AW807492 AW807218 AW807082 AW807525 AW807493 AW807523 AW807087 AW845784 
AW807037 AW807128 AW807080 AW8071 18 AW845807 AW807524 AW845803 AW807249 AW845795 AW807160 
AW807343 AW807515 AW807233 AW807289 AW177102 AW807352 AW807394 AW177105 AW807176 AW177103 
AW845870 AW177099 AW177101 AW807528 AW807336 AW807038 AW177100 AW80741 1 AW807088 AW845865 
AW807226 AW807517 AW807397 AW807303 AW807177 AW807154 AW807136 AW807146 AW807085 AW807521 
AWa07488 AW807385 AW807355 AW807223 AWa07155 

AA065081 AA075017 AA084791 AA071015 AA081560 AA071459 AA545727 AA083100 AA085366 AA115845 AA075457 
AA064704 AA082878 AA075742 AA0691 62 
409695 1 14876J AA296961 AA296889 AA076945 AA077528 AA077497 
410534 1207247.1 AW905138 AW753008 R13818Z43519 
410672 1214882_1 AW794600 AW794730 

410784 1221005_1 AW803201 BE079700 BE062940 

410785 1221055_1 AW803341 AW803265 AW803403 AW803466 AW803402 AW803413 AW803268 AW803396 AW803334 AW803355 
410835 1223785.1 AW806906 AW808915 AW866460 AW866475 AW866462 AW866448 AW866372 AW866604 

411050 1230330.1 AW814902 BE156656 BE156667 BE156590 BE156441 BE156447 

411086 1231500.1 BE070800AW375226 BE149115 

411093 1231970.1 BE067650 AW817053 

411111 1232669.1 AW818127 AW81B161 R09719 

411171 1234393_1 AW820260 AW820332 R94406 

411337 1239217.1 AW837349 AW837355 AW882717 

411514 1248638_1 AW850178 AW850233 AW850445 AW85G446 

411670 1253680.1 AW856552 AW861101 AW856574 AW861099 AW861100 AW856573 AW856576 AW856562 
411905 1265181.1 BE265067 BE264978 AW875420 
412102 1277395.1 H 56435 H56572 AW892929 
412209 1283610.1 AW901456 AW901450 AW901441 
412248 1285000.1 BE176480 AW903298 AW903313 

413043 1346556.1 BE158766 BE061699 BE147360 BE147362 BE051666 BE051697 BE051647 BE061678 

4131 1 1 1349546.1 BE065837 BE065805 BE065799 BE065818 BE055839 BE065831 BE065894 BE065789 BE065792 

413189 1352723.1 BE070231 BE070229 BE070255 

413221 1353887 J BE161 151 BE162495 BE161002 BE072205 BE160989 BE162482 
413499 1373910.1 BE144884 H97942 

413708 1384140.1 BE158791 BE158806 BE158748 BE158744 BE158740 BE158739 BE158811 BE158700 BE158741 BE158683 BE158585 
414210 1426051.1 BE383592 BE281671 
414596 1465004.1 BE386870 Z41986 H08501 
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414605 


1465790_-1 


415747 


155189_1 


416173 


1574973J 


417742 


1696282^1 


417974 


171237 1 


418636 


177402J 


419536 


185688 1 


420854 


197072_1 


422156 


212379_1 


422996 


223666_1 


423833 


232451_1 


423841 


232507,1 


423945 


233556J 


425201 


247933.1 


426650 


270283J 


426878 


273265J 


430264 


315008,1 


430785 


323486_1 


431676 


33641 1J 


433687 


373061J 


434338 


383982,1 


434469 


387447_1 


435447 


405400_1 


437152 


43386 1 


437854 


44418J 


439031 


46798J 


439255 


470321J 


444910 


624951 1 


445432 


63943 1 


446922 


69865 1 


447197 


711623 1 


448420 


76273 1 


448516 


765241 1 


450522 


837264 1 


450736 


844652_1 


451024 


85565_1 


451067 


85759J 


451340 


86640.1 


452542 


921410.1 




92227 1 


453472 


968371J 


454307 


1106070.1 


454359 


1130674 1 


454545 


1223779,1 


454693 


1229132J 


454714 


1230493,1 


455047 


1250536,1 


455092 


1252971,1 


455097 


1253130,1 


455100 


1253334,1 


455431 


1289854,1 


455511 


1321229,1 


455609 


1337548.1 


455651 


1348732L1 


455685 


1350393,1 


455700 


1351264,1 


455708 


1352232,1 


455732 


1353874,1 


455838 


1374605,1 


455935 


1384144,1 


455945 


1385588.1 


456207 


165078.-1 


456482 


192289.1 


458094 . 


47311.1 


458673 


679507J 



BE390440 

AA381209 AA381245 AA167683 

R52782 R17313 H24192 R19876 

R64719Z44680R12451 

AA210765 T95700 H94407 

AW749855 AA225995 AW750208 AW750206 

AA603305 AA244095 AA244183 

AW296927 AI684514 AI263168 AA281079 

N34524 AA305071 AW954803 AA502335 AI433430 A1203597 AW026670 AW265323 AW850787 AA317554 AW9g3643 

AW835572 AW385512 AI334966 W32951 H6265e H53902 R88904 AW835732 

BE091089 BE09i123 AA319959 

AW503329 N46610 AA331571 

AW753967 AA370795 AA331630 AW962550 

AA410943 AW948g53 AA334202 AA332882 

AA3521 1 1 AW962247 AA429695 

AA382814 AA402411 AA412355 

BE069341 AW748403 AL044891 AI908240 AA393080 

AA470519 BE303010 BE302954 BE384120 

230201 AA486132 T72025 

AI685464 AW971335 AA513587 AA525142 

AA743991 AA604852 AW272737 

AW75431 1 AA630185 AW803285 

AA634806 C18732 AA729161 AA729860 

AI872932 AA682306 BE220163 W88695 T81307 H91447 * 

AL050027 BE089051 

AL119723 AL119874 A1909018 U50537 

AF075079 H48601 H48795 

BE164500 AA832198 BE164502 

A1201849 BE069007 AW946544 

AV653771 BE089370 

BE175605 Z43529 F06610 BE175602 AV661027 

R35075AI366546 R36167 

BE623004 AA380669 BE263627 BE246433 

AW898595 AW898588 AW89859d AW898663 AW898592 AI525093 

AI698839 A1909260 A1909259 

AW970060 Ai732366 AI792313 AW83g644 

AA442176AA259181 

BE172186 AA059279 AA020815 AA013437 
AW936273 AW340350 AA017208 
AW612256 AW812257 AI906423 AI906422 
AA026777 N50065 R09961 N54721 
AL037925 AL037931 AL037957 
AW855717 AW362452 AW362443 
N71277AW390764 

AW806899 AW866451 AW856393 AW866297 AW817869 
AW813428 AW813444 AW813367 AW813368 AW813429 AW813424 
AW815098 BE154843 BE154831 
AW852530 AW852527 AW852526 
BE152428 AW855572 AW855607 

AW855802 AW855794 AW855797 AW855806 AW855796 AW855808 AW855793 AW855807 
BE160198 AW935898 T1 1520 AW935930 AW856073 AW861034 
AW938484 BE001245 BE001190 
BE144762AW979091 

BE011668 BE011689 BE011627 BE011679 BE0t1699 BE011678 BE011696 BE01 1675 BE01 1622 BE011635 
BE054952 BEG64979 BE064853 BE064857 BE064856 BE064977 BE064960 BE064860 BE064815 BE064957 BE064804 
BE064816 BE064850 BE064806 BE064796 BE064818 BE064975 BE064819 BE064810 BE064668 BE065059 
BE066976 BE066928 BE066927 

BE0681 15 BE06d104 BE068102 BE068096 BE0681O3 6E068154 BE06d198 

BE06g326 BE069290 BE069352 

BE080908 BE072258 BE072190 BE072236 

BE145808 BE145807 BE181883 

BE158687 BE158688 

BE160636 BE160608 BE180703 

AA1 93450 

AA485224 AA287308 AA258121 
AFa86325 W72956 W73221 AA219112 
N99626A1302701 
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TABLE 19B 



10 



15 



Table 19B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 19. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbanit identifier (Gl) numbers. 'Dunham I. et a\ ' refers to the pubfication 

entitied The ONA sequence of human chromosome 22.' Dunham I. et al., I^ture (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nudeotideposiSons of predicted exons. 
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401508 
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401575 


7229804 
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76253-76364 
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9789672 
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81747-82094 
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402765 


9357757 


Plus 
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402800 
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Plus 


43921-44049,46181-46273 
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82274-82443 
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8086844 
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194384-194645 
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Plus 
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8569930 


Plus 
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403388 


9438331 


Plus 


1 12733-113001.1 14599-1 14735 
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157156-158183 
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8101208 
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62554-62712.69449-69602 


403637 


8671936 


Minus 


142647-142771.145531-145762 


403639 


8671948 


Plus 


113234-113326.115186-115287.119649-119786 


403677 


7331517 


Minus 


55008-55083.62860-63051 
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Plus 
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404091 
404097 
404142 
404253 
404274 
404285 
404360 
404440 
404443 
404552 
404561 
404580 
404721 
404826 
404983 
405037 
405041 
405095 
405153 
405196 
405232 
405248 
405336 
405394 
405460 
405494 
405547 
405809 

405638 
405654 
405718 
405822 
405848 
405873 
405906 
405917 
405925 
405953 
406069 
406151 
408153 
406182 
406271 
406291 
406348 
406414 
406446 
406504 
406554 



7684554 

7770701 

9856692 

9367202 

9885189 

2326514 

9858450 

7528051 

7579073 

7243881 

9795980 

6539738 

9856648 

6572184 

4432779 

7543748 

7547195 

8072599 

9965565 

7230083 

7249042 

7259728 

6094635 

6624123 

7684569 

8050952 

1054740 

5757553 

6289229 
4895155 
9795467 
6273498 
7651809 
6758747 
7705124 
7712162 
6758795 
7960374 
9117732 
7144806 
9929734 
5923650 
7534217 
5686274 
9255985 
9256407 
9454509 
7711360 
7711566 



Minus 82121-83229 

Plus 55512-55781 

Minus 80316-80459 

Minus 55675-56055 

PJus 104127-104318 

Plus 32282-32416 

Minus 122873-122966.151324-151469,153093-153253 

Plus 80430-81581 

Minus 87198-87441 

Plus 19854-20010 

Minus 69039-70100 

Minus 240588-241589 

Minus 173763-174294 

Plus 47726-48046 

Minus 51178-51374.52000-52173 

Minus 127374-127578 

Plus 121230-U1714 

Plus 138877-139066 

Minus 175317-175500 

Minus 135716-135851 

Plus 125904-126063 

Plus 637-777 

Plus 33267-33563 

Minus 31900-32373 

Minus 52223-52389 

Minus 70284-70518 

Pius 124361-124520.124914-125050 

Minus 42814^3010.43583^3783.44863-45033,46429-46554.47815^8018.49961-50153,51624-51727,51823-51959.52702- 

52918.55469-55601.571 1 1-57307,58169-58296,60215-60332,61482-61727 

Plus 199260-199372.199826-199929 

Minus 53624-53759 

Plus 113080-113266 

Minus 1 54660-1 54974. 1 55203-1 55379 

Minus 28135-28244 

Minus 32129-32764 

Minus 10835-11059 

Minus 106829-107213 

Plus 129935-130282 

Minus 65101-65574 

Plus 68880-69374 

Minus 94087-94285 

Minus 12902-13069 

Minus 28256-28935 

Plus 36179-36692 

Plus 9562-9867 

Minus 71754-71944 

Plus 49593-49850 

Minus 1 16424-1 1 6527,1 18721-1 18859.121187-121364 
Minus 107068-107277 
Plus 106956-107121 
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TABLE 20: 544 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES THAT ARE 
LIKELY TO ENCODE EXTRACELLULAR OR CELL- 
5 SURFACE PROTEINS 

Table 20 shows 544 genes up-regulated in breast cancer compared to normal adult tissues 
that are likely to encode extracellular or cell-surface proteins. These were selected as for 
Table 19, except that the ratio was greater than or equal to 3.0, the "average" normal adult 
10 tissue level was set to the 85^^ percentile value amongst 144 non-malignant tissues, and the 
96*^ percentile value amongst the 73 breast cancer specimens was greater than or equal 100 
units, and the predicted protein contained a structural domain that is indicative of 
extracellular localization ^(e.g. ig, fii3, egf, 7tm domains, signal sequences, transmembrane 
domains). The predicted protein domains are noted. 

15 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenefD: Unigene number 
20 Pred.ProlDomalns: Predicted Protein Domains 
Unigene Title: Unigene gene title 

R1 : Ratio of 93^ percentile of tumor to 85^ percentile of norma) body tissue 



25 Pkey ExAccn UnlgenelD Pred.ProtOomains 
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408591 
400291 
449746 
407277 
400292 
424735 
426878 
428848 
407178 
408000 
427585 
429441 
450375 
420931 
422109 
445730 
451110 
400297 
420813 
452744 
424634 
420757 
424399 
447350 
456207 
431448 
427217 
456938 
435496 
402578 
453160 
422505 
444342 



AF015224 Hs.46452 

AA401369 Hs.190721 

AI668594 Hs.176588 

AW170035 Hs^26736 

AA250737 Hs.72472 

U31875 Hs.272499 
BE069341 

NM_000230Hs.194236 

AA195651 Hs.104106 

L11690 Hs.620 

D31152 Hs.179729 

AJ224172 Hs.204096 

AA009647 Hs.8850 

AF044197 Hs.100431 



S73265 

A!624342 

AI955040 

AI127076 

X51501 

A1267652 



Hs.1473 

Hs.170042 

Hs.265398 

Hs.334473 

Hs.99949 

Hs.30504 



NM_003613Hs.151407 
X78592 Hs,99915 



Hs^533 
Hs.172634 



AI905687 
AI375572 
AA1 93450 

AL137517 Hs.334473 

AA3g9272 Hs.144341 

X52509 Hs.161640 

AW840171 Hs265398 

At263307 Hs.239884 
AL120862 Hs.124165 
NM.014398HS.10887 



SS.Uterogtobin^SS.Uteroglobin 
TM 

.SS,p450 
TM 

deattiZU5.TM,Activin.recp.pkinase. 
.SS JM 
TM 

SS.Leptin.SS.L^ptin, 
,SS.D0iydroorotase. 

Pleclin_repeatSH3,spectrin.SS,P!ectin_r 
SS.C1q.Collagen.SS,C1q,Co1lagen, 
.SS.Uteroglobin. 

,SS JM,disintegnn,Pep.M12B_propep,Repit} a disintegrin and metalloproteinase doma 



UnigeneTifle 

mammagiobin 1 
ESTs 

ESTs, Weakly similar to CP4Y_HUMAN CYTOC 
HoRK) sapiens breast cancer antigen NY-BR 
BMP-RIB 

short-chain alcohol dehydrogenase family 
gb:QV3-BT0381-270100-073-c08BT0381 Homo 
leptin (murine obesity homolog) 
ESTs 

bullous pemphigoid antigen 1 (230/240kD) 
collagen, type X, alpha 1 (Schmid metaph 
lipophilin 6 (uteroglobin family member) 



SS,1L8,SS 
SS.Bombesin.SS 
,SS.TM,Cation_efflux 
SS 
TM 
SS SS 

,SS.TM,GNS1^SUR4.cNMP_binding.Rlla 
ig.tsp_1,SSMA 

honnone_recAndrogen_iBcep,zf-G4, 
SS 

.pkinase. 

.SSJM.p45ap450 

TM 

SS 



small inducible cytokine B subfamily (Cy 
gastrin-releastng peptide 
ESTs 

ESTs. Weakly simitar to transformation-r 
hypothetical protein DKFZp56401278 
prolactin-induced protein 
Homo sapiens mRNA; cDNA DKFZp434E082 (fr 
cartilage Intermediate layer protein, nu 
androgen receptor (dihydrotestosterone r 
aldehyde dehydrogenase 9 family, member 
ESTs 

gb:zr40e07.f1 Soares_NhHMPu„S1 Homosapi 
hypothetical protein DKFZp56401278 
ESTs 



,SS,TM.aminolran_1_2,Cadherin_C_lenn,cad tyrosine aminotransferase 

SS ESTs, Weakly similar to transformation-r 

SS,p450.SS.TM.p450 C1001134:gi|2117372|pir||I65981 tatty ac 

SS H2B histone family, member L 

SS programmed cell de^th 9 (PDCD9) 

Lamp,SS,TM.Lamp. similar to lysosome-assodated membrane 
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Hs.155956 
Hs.137476 



449765 N92293 Hs.206832 
428227 AA321649 Hs.2248 
425692 D90041 
424001 W67883 
5 448595 AB014544 Hs.21572 
449448 D60730 Hs.57471 
418007 M13509 Hs.83169 
418994 AA296520 Hs.89546 
453596 AA44183a Hs.62905 
10 452401 NM_007115Hs.29352 
446591 H44186 Hs.15456 
419296 AA236115 Hs.120785 
452838 U65011 Hs.30743 
422805 AA436989 Hs.121017 
15 448390 AL035414 
447342 AI199268 
411869 W20027 
443709 AI082692 
459587 AA031956 
20 442580 AI733682 
400289 X07820 
411598 BE336654 Hs.70937 
415263 AA948033 Hs.130853 
433805 AA706910 Hs.1 12742 
25 407276 A1951118 Hs.326736 
443348 AW873596 Hs.182278 
421037 AI6B4808 Hs.197653 
424086 AI351010 
400295 W72838 
30 452461 N78223 
427365 AI873274 
433365 AF026944 Hs.293797 
409269 AA576953 Hs.22972 
432596 AJ224741 Hs,278461 
35 408771 AW732573 Hs.47584 
432912 BE007371 Hs.200313 
447033 A1357412 Hs.157601 
421155 H87879 Hs.102267 
424905 NWL002497HS.153704 
40 425398 AL049689 Hs.156369 
438167 R28383 Hs.24286 
459583 AI907673 
423945 AA410943 
439820 AL360204 Hs.283853 
45 402606 

445263 H57646 
430217 N47853 
447164 AF026941 Hs.17518 
431385 BE178536 Hs.1 1090 
50 423887 AL080207 Hs.134585 
415385 R17798 
425704 U79293 
429859 NM„007050Hs.225952 
- 425523 AB007948 Hs.158244 
5 5 428368 BE440042 Hs.83326 
418912 NM„000685Hs.89472 
422026 U80736 Hs.1 10826 
451952 AL120173 Hs.301663 
438199 AW016531 Hs.122147 
60 400608 

413472 BE242870 Hs.75379 
432374 W68815 Hs.301885 
402408 NA 

445537 AJ245671 Hs.12844 
65 451621 A1879148 Hs56770 
405654 NA 

434988 AM18055 Hs.161160 



Hs.21068 
Hs.19322 
Hs.23439 
Hs.134662 

Hs.1 30239 
Hs.2258 



Hs.102267 
Hs.2533 
Hs.1 08105 
Hs. 190721 



Hs.42586 
Hs.336901 



Hs.7535 
Hs.159264 



SS 

SS.IL8, 

.SS^cetyitransC 
.pkinase, 

LRRCT,LRR,SS,LRRCT,serine_cart)pept 
SS 



ESTs. Moderately similar to ALU8_HUMAN A 
small inducible cytokine subfamily B (CX . 
N-acetyltransferase 1 (aiytamine N-acety 
paternally expressed 10 
K1AA0644 gene product 
ESTs 



SS,hemopexin,Peptidase_M10,SS,Peptidase_matiixmetalloproteinase 1 (MMPI; inters 
SS,Iectin„c,sushi,EGF,SS.EGF.lectin_c.su selectin E (endothefial adhesion molecul 



SS 

.SS,CUB.XIink, 

PDZ.SS 

SS 

SS.SS 

htstone.SS.histone.hisfone 

SS 

.SS.lipocalin 
,SS,Peptidase_M1, 
,SSJM,SNF 
.SS.LIM, 
SS 



hypothec! protein FU14834 
tumor necrosis factor, alpha-induced pro 
PDZ domain containing 1 
ESTs 

preferentially expressed antigen in mela 
H2A histone family, member A 
hypothetical protein 

Homo sapiens. Similar to RIKEf^ cDNA 2010 

ESTs 

ESTs 

g b:zk1 5e04.s1 Soares j)regnant_uterus_NbH 
ESTs 



hemopexin,Peptidase_M10.SS,Peptidase^M10matrix metelloproleinase 10 (MMPIO; str 



histone.SS.histone.histone 
,SS,hislone,histone,linker_histone 
,SS,RibosomaLL7Ae. 
TM 

.SS.DENN 
SS 

.SS.Lysyl^oxidase 
SS 

.SS.G9a,PHD, 
TM 

SS TPR 

SS,TM,UPF0016.SS,TM,UPF0016 . 

SS,EGF,vwa.SS,TM,vwa, 

TM,K_tetra,ion_trans.SS,TM.K-.tetra,ionJ 

,SS,TM,>oiate_carrier 

SS 



H3 histone family, member A 
ESTs 
ESTs 

Homo sapiens breast cancer antigen NY-BR 
calmodulin 2 (phosphorylase kinase, deft 
programmed cell death 9 (PDCD9) 
iysyl oxidase 

aldehyde dehydrogenase 9 family, member 

transcription factor 

ESTs 

ESTs 

hypothetical protein FLJ13352 
matrifin 3 

potassium voltage-gated channel, delayed 

ESTs 

ESTs 



SS.Ly&yLoxldaseAldose_epim.Epimerase,S Iysyl oxklase 



pkinase,SS,TM.pkfnase,po)yprenyLsynt» 
SS 

,SS.TM,7tn\_1.p450,mn 
.pkinase, 

death,ZU5,TM,Activin_iBcp,pkinase. 

SS 
SS 

,SS.Acyltransfefase. 



NIMA (never in mitosis gene a^eiated k 
hypothetical protein similar to tenascin 
ESTs 

gb:iL-BT152-080399-004 BT152 Homo sapien 
gb:zl32h03.r1 Scares ovary tumor NbHOT H 
Homo sapiens mRNA full length insert cDN 
t^iVL024626:Komo sapiens hypothetical prot 
KIAA1560 protein 



,SS,RNA.poLA.RNA4)oLA2.RibosomaLS24e,ribosomal protein S24 



JM.IBR 
.SS.TM 

.SS.TM.BRCT.ankABC_lran,ABCJran 
.SS,ForK_head, . 

SS 

.SS,TM.Y _phosphatase.MAM,fn3, 
,SS,laminin_B,Iaminin_EGF,/aminin_Ntemfi 



Homo sapiens dgS mRNA, partial sequence 
membrane-spanning 4-domains, subfamily A 
DKFZP434G232 protein 
COBW-like protein 

Human done 23948 mRNA sequence 
protein tyrosine phosphatase, receptor t 
KIAA0479 protein 



SS,Peptidase_M10,hemopexin,SS,Peptidase_matrix metalloproteinase 3 (stronrelysin 



SS.TM,7tnu1 .SS.TM.7lm_1 , 
SS 

,SS,pkinase, 
.SSArfGap, 
SS,TM.SS.TM 
SS 
SS 

,SS,carb_anhydrase 
.SS.TM.ras 
SS.npocaRnJipocalin, 
BTB.SS 
SS 



angiotensin receptor 1 
trinudeotide repeat containing 9 
ESTs 
ESTs 

C10001899:gi|7508633|pir||T25392 hypothe 
solute carrier family 1 (glial high afTi 
Homo saiNsns cDNA FU11346 fis, done PL 
NM.030920*:Homo sapiens hypothetical pro 
EGF-Gke-domain, multiple 6 (EQFL6) 
fatty acid binding protein 7. braoi 
C12001521:gi|7513934|pir||T31081 cca3 pr 
ESTs 
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17.0 

16.7 

16.5 

16.3 

16.2 

15.7 

15.5 

15.5 

15.0 

14.9 

14.8 

147 

14.3 

14.2 

14.2 

13.9 

137 

137 

13.5 

13.5 

13.3 

13.2 

13.1 

13.1 

13.0 

12.9 

12.8 

127 

12.5 

12.4 

12.3 

12.0 

11.9 

11.9 

11.9 

11.8 

11.8 

11.7 

11.6 

11.5 

11.5 

11.4 

11.4 

11.3 

11.2 

11.1 

11.1 

11.1 

10.9 

10.9 

10.7 

10.4 

10.3 

10.3 

10.3 

10.3 

10.3 

10.2 

10.1 

10.0 

9.9 

9.8 

97 

9.6 

9.6 

9.6 
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416220 


N49776 


Hs,170994 


.SS.TM 


hypothetical protein MGC10946 


9.5 


431808 


M30703 


Hs^70833 


SSJM,EGF,SS 


amphiregulin (schwannoma-derived growth 


9.5 


414142 


AW368397 


Hs.150042 


.SS.UDPGT 


Homo sapiens CDMA FU14438 fis. clone HE 


9.4 


400298 


AA032279 


Hs.61635 


TM 


six transmembrane epithets ant^en of 


9.4 


418601 


AA279490 


Hs.86368 


SS.TM.calretlculin.SSJM,calreticu!in. 


. catmegin 


. 9.4 


415539 


AI733881 


Hs.72472 


death^U5.TM^ct!Vin_recp,pkinase. 


BMP-RIB . 


9.4 


421451 


AA291377 


Hs.50831 


TM 


ESTs 


9.3 


429432 


AI678059 


Hs.202676 


SS 


synaptonemal complex protein 2 


9.3 


442441 


AI820662 


Hs.129598 


ss 


ESTs 


9.1 


426429 


X73114 


Hs.169849 


,SS,TM,fh3,ig, 


myosin-binding protein C, slow-type 


9.1 


406687 


M31126 


Hs.272620 


SS,Peptidase_M10,hemppexin,SS,Peptidase_matrixmetalloproleinase 11 (MMPII; stro 


9.1 


448693 


AW004854 


Hs.228320 


SS 


hypothetical protein FU23537 


9.1 


419946 


AB041035 


Hs.93847 


Ferric reductTM.Fenic reduct 


NM 016931:Homo sapiens NAOPH oxidase 4 ( 


9.1 


426214 


H59d46 


Hs.128355 


SS 


ESTs, Moderately stmHar to ALU7_HUMAN A 


9.0 


427718 


AI798680 


Hs.25933 


.SS.TM.histone.Secl ,histone,sugactr 


ESTs 


8.9 


414812 


X72755 


Hs.77367 


SSJL8,SS,1L8 


monokine induced by gamma Interferon 


8.8 


400285 


NA 




,TM,ABC tran,ABC.membrane, 


Eos Control 


6.8 


422330 


D30783 


Hs,115263 


SSJM,EGF,SS.TM 


epiregulln 


8.8 


416182 


NMJ004354HS.79069 


cycIin,SS 


cydin G2 


8.8 


420077 


AW512260 Hs.87767 


SS 


ESTs 


8.7 


452281 


T93500 


Hs^8792 


,SS,TGF-betaJGFbj3ropeptid6. 


Homo sapiens cDNA FU11041 fis. done PL 


8.7 


434531 


AA642007 


Hs.1 16369 


SS 


ESTs 


8.6 


408380 


AF123050 


Hs.44532 


,SS.TM,ubiquilin,7tm_3,ANF receptor.sush 


diubiquitin 


8.6 


443788 


AI732643 


Hs.144151 


TM 


ESTs 


8.6 


411078 


A1222020 


Hs.182364 


SS.SS 


CocoaCrisp 


8.5 


445495 


BE622641 


Hs.38489 


SS.SS.ENTH.I_LWEQ.ENTH.LLWEQ.DNA_mis_reESTs. Weakly similar to 138022 hypotheti 


8.5 


433426 


H69125 


Hs.133525 


,SS,TM 


ESTs 


8.5 


424871 


NM_004525Hs.153595 


SS.EGF.IdLrecepLa.IdLrecepLb.SS.TM.E 


low density lipoprotein-related protein 


8.4 


426215 


AW963419 Hs.155223 


SS 


stanniocalcin 2 


8.4 


409045 


AA635062 


Hs.50094 


TM 


Homo sapiens mRNA; cDNA DKFZp434O0515 (f 


8.4 


435525 


AI831297 


Hs.123310 


TM 


ESTs 


8.3 


409203 


AA780473 


Hs.687 


SS,p450.SS,p450 


cytochrome P450, subfamily IVB, polypepl 


8.3 


424902 


NM_003866Hs.1 53687 


SS.SS 


inositol polyphosphates-phosphatase, ty 


8.3 


431725 


X65724 


Hs.2839 


SS,Cys„knol»SS 


Nome disease (pseudoglioma) 


8.3 


418092 


R45154 


Hs. 106604 


,death,Zll5,pkinase.Activin_fecp, 


ESTs 


8.3 


439840 


AW449211 


Hs.105445 


SS 


GDNF family receptor alpha 1 


8.2 


427811 


M81057 


Hs.180884 


SS^n_carbOpeptPropep_M14,SS,Propep_M14carboxypeptjdase B1 (tissue) 


8.2 


420807 


AA280627 


Hs.57846 


SS.cpnIO 


ESTs 


8.2 


426320 


W47595 


Hs.169300 


SS.TGF-beta.T6Fb_propeptide,SS 


transfomning growth factor, beta 2 


8.2 


447078 


AW885727 Hs.301570 


.SS.kazat. 


ESTs 


8.1 


415786 


AW419196 Hs.257924 


SS 


hypothetical protein FU13762 


8-1 


410102 


AW248508 Hs^79727 


SS 


Homo sapiens cDNA FLJ14035 fis. done HE 


8.0 


404347 






SS 


Taiget Exon 


8.0 


433687 


AA743991 




TM 


gb:ny57g01.s1 NCI_CGAP_Pr18 Homo sapiens 


8.0 


421373 


AA80822g 


Hs.167771 


,SS.IMPDH_C,IMPDH_N.CBS 


ESTs. 


8.0 


422634 


NM^016010Hs,1 18821 


SS 


CGI-62 protein 


7.9 


453310 


X70697 


Hs.553 


TM.SNF,SSJM,SNF, 


solute carrier family 6 (neurotransmitte 


7.9 


435957 


N39015 


Hs.190368 


.SS.TM 


ESTs 


7.8 


407771 


AL138272 


Hs.62713 


,TM,cpn60_TCP1 .Serna. 


ESTs 


7.8 


443646 


AI085198 


HS.1W226 


.TSPN.vwctspJ ,EGF,miored, 


ESTs 


7.8 


446142 


A1754693 


Hs.145968 


,TM,cadherin.Cadherin_CJerm, 


ESTs 


7.7 


444649 


AW207523 


Hs.197628 


,SS,rrm, 


ESTs 


7.6 


435147 


AL133731 


Hs.4774 


,TM.SDF,UPAR.LY6. 


Homo sapiens mRNA; cDNA DKFZp761C1712 (f 


7.6 


439138 


AI742605 


Hs.193696 


TM 


ESTs 


7.6 


429220 


AW207206 


Hs.136319 


SS 


ESTs 


7.6 


428804 


AK000713 


Hs.193736 


.SS.UDPGT 


hypothetical protein FU20706 


7.5 


453511 


AL031224 


Hs,33102 


SS.SS 


transcription factor AP-2 beta (adivati 


7^ 


439809 


R41396 


Hs.101774 


SS 


hypothetical protein FU23045 


7^ 


414869 


AA157291 


Hs.21479 


SS 


ubinudein 1 


7,5 


416276 


U41060 


HS79136 


SS.TM.TM 


LIV-1 protein, estrogen regulated 


7.5 


452862 


AW378065 


Hs.8687 


,SS.Pep_M1 2B _j)Popep.Repro(ysin,tsp_1 , 


ESTs 


7.4 


452926 


AI742170 


Hs.31297 


.SS.TM 


duodena) cytochrome b 


7.4 


453331 


A(240665 


Hs.8895 


.SS,TM.disintearin,Pep_M1 2B j)rop8p,ReprD ESTs 


7.3 


420802 


U22376 


Hs.1334 


SS,NA,myb_DNA-binding 


v-myb avan myeloblastosis viral oncogen 


73 


450803 


R43646 


Hs.12422 


SS 


ESTs 


7.2 


422867 


L32137 


Ks.1584 


SS,EGF.tsp.3.SS»E2F_TDP. 


carfjlage ofigomeric matrix protein (COM 


7.2 


418004 


U37519 


Hs.87539 


SS.aIdedh.SS^ldedh. 


aldehyde dehydrogenase 3 feimOy. meniber 


7.2 
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426451 AI908165 

450701 H39960 

419519 AI198719 

410555 U92649 

433138 AB029496 

411558 AA102670 

409079 W87707 

417275 X63578 

432731 R31178 

442818 AK001741 

407366 AF026942 

427427 AF077345 

410785 AW803341 
401045 



Hs.169946 

Hs.288467 

Hs. 176376 

Hs.64311 

Hs.59729 

Hs.70725 

Hs.82065 

Hs.295449 

Hs.287820 

Hs.8739 

Hs.177936 



SS.GATA. 

.SS.LRR 

SS 

.TM.disintegrin.Reprolysin, 

SS.ig,Sema.SS.Sema.efhand 

SS,TM,SS.TM 

JM.fn3. 

SS.efhand.SS.efhand.ras 

,SS,fn3,fni,fn2.fn2,fn1 

WD40.SS 

.TM,IBR 

SS,leclin_cSS 

SS 



C11001883*:gi|6753278|ref|NP„033938,11 c 



ICEj>20.SS.lCEJ>10.ICE_p20.ICEj)10.ICEj) 



GATA-blndlng protein 3 (T-cell receptor 
Homo sapiens cDNA FU12280 fis. done MA 
ESTs 

a disintegrin and metalloproteinase doma 
semaphorin sem2 

gamma^minobutyric acid (GA6A) A recepto 
fnterteukln 6 signal transducer (gp130. 
parvalbumin 

fibronectin 1 

hypothetical proteiri FLJ10879 

gbiHomo sapiens cig33 mRNA, partial sequ 

ESTs 

9b:IL2-UM0079-0g0300-050-O03 UM0079 Homo 



418986 A1123555 Hs.81795 
442082 R41?^23 Hs.7413 
442861 AA243837 Hs.57787 
418836 A!655499 Hs.161712 
422060 R20893 Hs.325823 
444381 BE387335 Hs.283713 
404091 NA 

417866 AW067903 Hs.82772 
428819 AL135623 Hs.193914 
410275 U85658 Hs.61796 
425236 AW067800 Hs.155223 
415669 NM„005025Hs.78589 
416319 A1815501 Hs.79197 
412140 AA219691 Hs.73625 
442942 AW167087 Hs.131562 
446163 AA026880 Hs,25252 
443162 T49951 Hs.9029 
409602 W26713 Hs.256972 
428479 Y00272 
400300 X03363 
433404 T32982 
410079 U94362 
401781 

447359 NWL012093HS.18268 
402230 NA. 

427674 NMJ03528HS.2178 
428398 A1249368 Hs.98558 
458098 BE550224 
419968 X04430 
416636 N32536 
419703 AI793257 
424687 J05070 
449679 AI823951 
421296 NM_002666Hs. 103253 
442117 AW664964 Hs.128899 
400303 AA242758 Hs79136 
419440 AB020689 Hs.90419 
444858 AM 99738 Hs.208275 
432239 X81334 Hs:2936 
440705 AA904244 Hs.153205 
400285 NA. 

445466 H38026 Hs.308 
423201 NM-000163HS.125180 
433043 W57554 Hs.125019 
439509 AF086332 Hs.58314 
425247 NM.005940HS.155324 
409757 NM_001898Hs.l23114 
425292 NM_005824Hs.l55545 
448045 AJ297436 Hs.20165 
452681 AF153330 Hs.30246 
452243 AL355715 Hs^8555 
439310 AF086120 H$.102793 



Hs.184572 

Hs.102720 
HS.S8589 



HS74170 

Hs.93913 

Hs.42645 

Hs. 1281 51 

Hs.151738 

Hs.129700 



,SS,Reproiysin,tsp_1, 
,TM.EPHJbd,pkinase.SAM,fn3. 
SS 

,TM.Aclivin_rBcp,pkinase,deathZU5, 

,SS,TM.CD35 

,SS,mito_can- 

,TM,7tm_3,ANF_receplor, 

SS,CoIlagen.COLFI,TSPN,SSJSPN 

SS.SS 

,SS,RibosomaLS4e 
SS 

,SS,serpin, 

SS.TM,lg.SS.TM 

.SS.kinesin, 

,SS.ig,Sema,pktnase, 

.SS,TM,fn3, 

filamentSS.filament.filament 
,SS.TM,DAGKa,DAGKc, 
.SS,pkinase,pkinase 



6.7 
ESTs 

ESTs: calsyntenin-2 

ESTs 

ESTs 

ESTs, Moderately similar to ALU5_HUrMN A 
ESTs, Weakly similar to S64054 hypotheti 
Target Exon 

collagen, type XI, alpha 1 
KIAA0575 gene produa 
transcripfaon factor AP-2 gamma (activat 
. stanniocaldn 2 
serine (or cysteine) proteinase inhibtto 
CD83 antigen (activated B lymphocytes, i 
RAB6 interacting, kinesin-like (rabkines 
ESTs 

prolactin receptor 
DKFZP434G032 protein 
ESTs 

eel! division cyde 2, Gl to S and G2 to 



ESTs 



,SS,TM,pkinase,Recep.L_domain.SH2,PH,FurHER2 receptor tyrosine kinase {c-erb-b2, 
SS 

G!yco_transL8,SS 

,SS,riteinient,Pribosyltran,fllamentArmad 
SS.adenylatekinase, 
,SS.TM.p450, 
histone^SS.hisione. 
.SS.TM 
SS 

SS,IL6,IL6, 
.SS.TM 
,SS.2f-C2H2, 



Target Exon 
adenylate kinase 5 . 
Target Exon 

H2B histone family, member Q 
ESTs 

metallothbnein IE (fundional) 
interieukin 6 (interferon, beta 2) 
solute earner family 16 (monocarboxyfic 
ESTs 



SS,Peplidase_M10.fn2.hemopexin.SS»TM,Pepmatrix metalloproteinase 9 (gelafirjase B 
SS tollold'like 1 

SS perilipin 
,SS.TM ESTs 
.SS.TM LIV-1 protein, estrogen regulated 

SS K1AA0882 protein 

SS ESTs, Weakly similar to ALUA^HUMAN )!)! 

SS,Peptidase_M10,hemopexin,SS,Peptidase_matrix metalloproteinase 13 (collagenase 



TM 

SS,TM.ABC_tran,ABC.membrane.SS 

anBstin.SS 

SS,TM,fn3,SS 

SS 

,SSJM.Syntaxin 

SS,Peptidase_M10.henrK>pexin.SS 

,SS.cystatin» 

SS 

.SS,TM 
,SS.TM 
SS 

,SS.TI^,UDPGT.cas^.kappa 



ESTs 

CI 6000922:giI7499103lpir||T20903 hypothe 

arrestin 3. retinal (X-arrestin) 

growth honmone receptor 

lymphokJ nudear protein (LAF-4) mRNA 

ESTs 

matrix metaltoproteinase 11 (MMP11;stro 
cystafin SN 

37 kDa leudne-iich repeat (LRR) protein 



sohite carrier family 19 (thiamine trans 

prograntmed cell death 9 

ESTs 



7.1 
7.1 

7:1 

7.1 
7.0 
7.0 
7.0 
7.0 
6.9 
6.9 
6.8 
6.8 
6.7 



6.7 

6.7 

6.6 

6.6 

6.6 

6.6 

6.6 

6.6 

6.5 

6.4 

6.2 

6.2 

6.2 

6.2 

6.2 

6.1 

6.1 

6.1 

6.1 

6.1 

6.1 

6.1 

6.1 

6.1 

6.1 

6.1 

6.0 

6.0 

6.0 

6.0 

5.8 

5.8 

5.8 

5.8 

5.7 

5.7 

5.7 

5.7 

5.6 

5.6 

5.6 

5.5 

5-5 

5.5 

5.4 

5.4 

5.4 

5,4 

5.4 

5.3 

5.3 

5.2 
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441111 

452355 
427711 
418636 
429353 
441690 
430447 
429698 
425325 

423600 
414737 
403593 
407768 
445234 
411165 
.420633 
414117 
416783 
401093 
411096 
457411 
436007 
450506 
417975 
421072 
427032 
447752 
403199 
427122 
445900 
413048 
419563 
442432 
452093 
442323 
450606 
435542 
417576 



A1806867 

N54926 

M31659 

AW749855 

AL1 17406 

R81733 

W17054 . 

A1685085 

X52730 



Hs.1 26594 

Hs.29202 

Hs.180408 

Hs.200102 

Hs.33106 

Hs.332848 

Hs^6339 

Hs.1892 



,SSJM.Phosphocliest 
TM.7tm.1.TM 

■ ss 

,SS.TM,HECT 

.SS.TMABC_tfan,ABC_membrane, 
.SS.HECT^-UBR1.PABP,14^3, 

SS 

,SS.ras, 



phenylethanolamine N-tnethyltransferase 



SS.NNMT_PNMT_TEMr,SS,NNMT_PNMT_TEMT.STAR 



ESTs 

G protein-coupfed receptor 34 
solute carrier family 25 (mitochondrial 
gb:QV4-BT0534-281299-053-c05 BT0534 Homo 
ATP-binding cassette transporter MRP8 
ESTs 

SWI/SNF related, matrix associated, acti 
ESTs, Weakly simOar to S21348 probable 



Hs.310359 
Hs.125087 



AI633559 
AI160386 
NA 

D50915 Hs.38365 
AW137636 Hs.146059 
NM.000169HS.69089 
NM.014581HS.274480 
W88559 Hs.1787 
AA206186 Hs.79889 

U80034 Hs.68583 
AW085961 Hs.130093 
AI247716 Hs.232168 
NM_004460Hs.418 
AA641836 Hs.30085 



AI215069 
AF012023 
M73700 
NA 

AW057736 Hs.323910 
AF070526 Hs.13429 



Hs.89113 
Hs.1 73274 
Hs.105938 



M93221 
AA526235 
BE093589 
AA447453 



Hs.75182 
Hs.193162 
Hs.38178 
Hs,27860 



AW016669 Hs,29190 
AI668505 Hs.60380 



445413 
424420 
432378 
452190 
434674 
419986 
421582 
410361 
426327 
406639 
452834 
427315 
446733 
442118 
421524 
453060 
453403 
444301 
453619 
432656 
426384 
431701 
416931 
420854 
418887 
443514 
447499 



AA687376 

AA339449 

AI860021 

AA151342 

BE614743 

AI493046 

H26735 

AA831879 

AI345455 

AI910275 

BE391804 

W03242 

M97711 

AI638627 

AA1 79949 



Hs.269533 

Hs.82285 

Hs.270651 

Hs,12677 

Hs.146688 

Hs.146133 

Hs.91668 

Hs.136985 

HS.7691S 

Hs.1406 

Hs.62661 

Hs.44898 

Hs.105685 
Hs.175563 



AA863360 Hs.26040 
AA976718 Hs.202242 
AA312082 Hs,105445 
AW294092 Hs.21594 
BE465639 Hs.61779 
AK000136 Hs.10760 
H87648 Hs,33922 
NM.000246Hs.3076 
AI472078 Hs.303662 
AW935490 Hs.14658 
D45371 Hs.80485 
AW296927 
D31771 Hs.89404 
BE464288 Hs.141937 
AW262580 Hs.147674 



SS 
SS 

.CfOE-N.pkinase 
SS,SS 
.SS,TM 



5.1 

ESTs 

ESTs 

Target Exon 
KIAA0125 gene product 
ESTs 



SS.Melibkse.BTKPH.pkinase.SH2,SH3,Ribo gafactosrdase, alpha 
SS,lipocalin,SSJipocalin odorant43lnding protein 2A 

.TM,ion_trans.K.tetfa. proteolipid protein 1 (Pelizaeus-Merzbac 

SS,TM.TM monocyte to macrophage differentiatiorv-a 

TM,LRRCT.TM.LRRCT. C12000586*:gil6330167|dbj|BAA86477.1| (A 

Peptidase_M3, mitochondrial intermediate peptidase 

SS ESTs 
,SS.adh_,zinc, ESTs 

SS.DPPIV_N_temf»,Peptidase_S9,SS,DPPIV_N_fibroblast activation protein, alpha 



.SS.trypsin 
SS 

,SS,14-3.3 

SS,transferrin,7tm J .transferrin, 
SSJM,FoIate_canier,SS,TM.Folate_carrie 



hypothetical protein FU23186 
ESTs 

integrin cytoplasmic domain-associated p 
lactotransfenin 

NM_025243*:Homo sapiens solute carrier f 



,SS,TM,pkinase,Recep_L_domain,SH2,PH,Furt4ER2 receptor tyrosine kinase (c-€rl)-b2, 



.SS.Ca_channel_B. 
SS,TM,!ectin_c.fn2,Ricin_BJectin.SS.TM 
SS 
SS 

.SS.TM.7tm_1. 
.SS,TM.CBS,voltage_CLC 
JM.G]yco_hydro_1 
.SS,pkiriase,RhoGEF,ig,PH,SH3, 



Homo sapiens clone 24787 mRNA sequence 
mannose receptor. C type 1 
Homo sapiens cDNA FU1 1983 fis, clone HE 
hypothetical protein FU23468 
Homo sapiens mRNA; cDNA DKFZp586M0723 (f 
ESTs 

ESTs, Moderately similar to ALU6^HUMAN A 
ESTs 



AIRS,f6nnyLtransf,GARS.SS,GARSAIRS.for phosphoribosylglyctnamide fomiyltransfer 



.pkinase 
SS.UPFC099.SS.UPF0099. 
,SS.TM,MAPEG. 
.SS.TM.UDPGT 

.SS.™.PH,SH2,Furin-like.pkinase.Recep_L 

»SS.HisLdeacetyl. 

pkinase,OPR, 

SS,trefoy.SSJM.IdUecepLa.SRCR,lryps 
SS,TM,GBP.TM.GBP 

SS 

SS,SS,ig. 

,SS.DEAD.ForK„head 
SS 

,SS.TM.p450. 

.ig.Sema, 

SS 

.SS,ras.Y_phosphatase,ras 

.SS,HMG_box,filamenl. 

SS.LRR,SS 

SS 

SS,LRR, 

.SS.ArfGap, 

.SS.BIR 

SS,C1q.CoIIagen,SS,C1q, 
.SSJM.Peptidase.I\/l1, 
SS.homeobox,homeobox. 
,SS,TM.MIP, 

.SS.TM,cadher1n.cadherin 

308 



ESTs, Moderately similar to A47582 B-cel 
CGI-147 protein 
prostaglandin E synthase 
ESTs 

Homo sapiens done PP1498 unknown mRNA 
ESTs 

GA-bmding protein transcription factor, 

trefoil factor 1 (pS2) 

guanytate binding protein 1, interferon- 

Homo sapiens clone TCCGTA00151 mRNA sequ 

gb:Human T-cell receptor (V beta 18.1, J 

KIAA1688 protein 

Homo sapiens mRNA; cDNA DKFZp564N0763 (f 
ESTs, Weakly similar to fatty add omega 
ESTs 

GDNF family receptor alpha 1 
hypolhetica! protein MGC15754 
Homo sapiens cDNA FU13591 fis. done PL 
asporin (LRR dass 1) 

Homo sapiens, done M6C:9084, mRNA, comp 

MHC dass II transacthrator 

ESTs 

Human chromosome 5q13.1 done 5G8 mRNA 
adipose most abundant gene transcript 1 
gb:Ui-H-BW0^j&c^7-0-Ul.s1 NGLCGAP.Su 
msh (Drosophila) homeo box homok)g 2 
ESTs 

protocadherin beta 16 



5.2 
52 
5.2 
5^ 
5.1 
5.1 
5.1 
5.1 



5.1 
5.1 
5,1 
5.0 
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.pkinase. 


Homo sapiens done 23736 mRNA sequence 




^nOnftA AAOROfKA Ue 1^1 M'l 
lUsUO't rv\\JO£t20H nS.l*ilO0O 


.SS.CUB, 


ESTs 


A 1 
4.0 


H^jxxof v\£SQhz. ns.1004/1 


,SS.TM.FMO-nke 


ESTs 


A 1 
4.3 




»SS.7M 


ESTs. Weakly similar to 154374 gene NF2 


A ^ 
4.0 


lO^ODO AlsOnOIr MS. 1 £4309 


TM 


ESTs 


A ^ 
4.0 




.SS,filamentfi]ament 


Homo saptens keratin 17 (KRT17) 


A 1 
4.0 




pkinase.pkinasejC; 


serum/glucocoflicoid regulated kinase-P 


A 0 


4O/U00 Aid/ 101*1 nS.iiSoUZ/ 


.SS.TM 


ESTs 


A 0 
4.Z 


AWU/OOlU nS.iWOoo 


.pkinase. 


Homo sapiens cONA FU14142 fis. done MA 


4u£ 


44jiy4 AiyMytxj ns.27yuuy 


.SS.TM 


matrix Gla protein 


A O 

4.2 


4olo71 Aio2100o Hs. 118599 


.SS.GDNF. 


ESTs 


A *) 

4.2 


4o7^n Awy/^oDo ns.ozjyy 


WHI.WHI 


ESTs. Weakly similar to S51797 vasodllat 


4. J 


*tziDDb iNivi^uuujyyns.ioyo 


zf-C2H2.SS 


eariy growth response 2 (Kfox-20 (Drosop 


A 0 
4./ 


43id57 AIo45^7 HS.i0544o 


.SS.TM.pkinase 


ESTs, Weakly similar to B34087 hypotheti 


A 4 

4.1 


Ai^^syy AAozyzoD hs.33^U9j 


,SS,SAA_protelns,ABC_membrane^C_tran,serum amyloid A1 


A 4 

4»i 


444779 Al1y2i05 nS.14717U 


SS 


ESTs 


A 4 

4.1 


442295 AI827240 Hs.22439o 


,COLFI,vwc.Collagen. 


Homo sapiens cDNA FU11469 fis, done HE 


A 4 

4.1 


436396 AIdo34B7 HS. 15221 3 


.wnt. 


wingiess-type MMTV integration site fami 


A 4 

4.1 


44bU3y AnoU4yi ns.yu/9b 


.TM.GIyco_hydro_1 


ESTs 


A 1 
4. J 


422936 timjju 1 ot^nS. 1 594 


,SSJM,thiolase, 


centromere protein A (1 7kD) 


A i 
4.1 


406922 S7U2o4 


SS,TM,Desaturase,SS 


gb:stearoyl-CQA desaturase [human, adipo 


A 1 
4.1 


439265 AL133916 MS. 172572 


.SS,ig,pkinase.LRRNT.LRRCT, 


hypothetical protein FU20093 


A 1 
4.1 


424oUU AL035580 rlS.1532U3 


HLH.SS 


MyoD family inhibitor 


A A 


42yyz2 zy/twu hs.22oii/ 


.SSJM.Iinker histone.7tm_1 


HI histone family, member 0 


A 1 


447178 AWo94d41 nS.19241^ 


.SS.TM 


ESTs 


A n 

4.U 


>monoo TftT/on i_Jr> cnnno 
4UyU38 T9749U nS.5UuU2 


SS.ILB.SS.ILS 


small indudble cytokine subfamily A (Cy 


A n 

4.U 


452/47 DC 153855 HS.oi4bU 


.SS.HLH 


}g superfamily receptor LNIR 


A n 

4.U 


420139 NMJ005357ns.S535i 


.SSJM.P450, 


lipase, hormone-sensitive 


A n 
4.0 


>maQ77 AAii7an9o Um 'iQno4C 
40ao77 AA479U03 nS.13U315 


,SS.TM 


ESTs, Weakly similar to A47582 B-cell gr 


A n 

4.U 


403329 NA 


SS.SS 


Target Exon 


A n 
4.U 


JlOOnOC AlAm4il07C LJ« 4 97/1A7 

439926 AW014875 HS.1 37007 


SS 


ESTs 


A n 
4.U 


430832 A1073913 nS.1006ob 


SS 


ESTs. Weakly similar to JE0350 Anterior 


A n 
4.U 


A*>'%AOA A\htAC4CAC Ua 4e'tcnil 

432461 AW451645 nS.15i504 


.SS.CoiIagen.COLFI.TSPN. 


Homo sapiens cDNA FLJ11973 fis, done HE 


A n 

4.U 


AC044A Al 4O0C4A 00009 

452410 AL133o19 riS.29383 


,SSJM,ras 


Homo sapiens mRNA; cONA DKFZp434E2321 (f 


A f\ 
4.U 


418661 NM_001949Hs.11o9 


SS 


E2F transcription {actor3 


A A 

4.U 


404AJrO VCOCOA I 1- 0077 

431958 X63629 nS.Zo77 


SS,TM,Cadherin_CJemi.cadherin,SS,TM,cadcadherai 3, type 1, P-cadherin (placenta 


A n 
4.U 


425071 N!vi_013989nS.1 54424 


SSJ4_deiodinase,T4_deiodmase. 


deiodinase. iodothyronine, type 11 


4.U 


447197 R36075 


.TM,SDF. 


gb:yh88b01 ^1 Soares placenta Nb2HP Homo 


A A 

4.U 


428722 U7b45D Hs.1 90787 


.SSJIMP, 


tissue inhibitor of metalloproteinase 4 


3.9 


428330 L22524 Hs.2256 


SS,Peplidase_M10,SS.Peptidase_M10,hemopematnx metalloproteinase 7 (MMP7; uterin 


3.9 


*\ZdcHc ALUoy4U2 nS.l2o/oo 


SS 


DEME-6 protein 


Q 


449048 Z4b051 nS.22y20 


SS.SSJM 


similar to S66401 (cattle) glucose indue 


3.y 


A4>IQ04 hJl044CO LJ^ It A'iCi 

414831 iVi3n58 ns. 77439 


,SS.cNMP_binding.Rlla,HMG_box 


protein kinase, cAMP-dependent. regulato 


O Q 

3.9 


413509 AVv4o2b3l nS.313oU3 


.SS.abhydrolase 


ESTs. Highly similar to AF1 57833 1 nond 


0.0 


4UOO/0 iN{vi_Ulb434nS.40bU4 


SS 


DKFZP434B168 protein 


1 ft 
0.0 


>I40C0Q DOy17CCA LJ^ OCQCO 

418b2y Dt247550 nS.8o65y 


SS.SH2,PH.SS.TM,PH,SH2,Furin-like.pkinas growth factor receptor-bound protein 7 


O fl 

0.0 • 


450 /o/ AoUUbiyU nS,254^0 


SSJM.MIP.SS.TM.MIP, 


aquaporin 7 


'i ft 

0.0 


4 140/ U IN/ZZCJ nS.oUUDrU 


SS 


KIAA1204 protein 


ft 
o.o 


4503^5 Aiy3oyb2 nS.2b2oy 


SS 


ESTs 


ft 
o.o 


40rDo3 Nivj_UU/UgynS.3f Toy 


TM,TM 


similar to rat HREV1 07 


'I ft 
0.0 


A0ft170 A A 171*^07 Wc 19'>A*\R 


.SS.DENN 


ESTs 




44ZZ0Z DC 1 r UOOl MS.O/ uU 


.SS.START. 


deleted in liver cancer 1 


0.0 


427961 AW293165 Hs.143134 


SS 


ESTs 


3.8 


445563 AW873606 Hs!l49006 


.SS.WHI.WHI 


ESTs 


3.8 


403943 


p450.SS.p450 


C5000355:gi|4503225|reflNP.000765.1| cyt 


3.8 


408761 AA057264 Hs.238936 


.SSJM,7tm.1, 


ESTs. Weakly similar to (deffine not ava 


3.8 


423279 AW959861 Hs.290943 


SS 


ESTs 


•3.8 


420440 NM_002407Hs.97644 


,SS.SRCR.UterDglobin 


mammaglobin 2 (MGB2; mammaglobin B; lip 


3.8 


445107 AI208121 Hs.147313 


.SS.TM 


ESTs. Weakry similar to 138022 hypotheti 


3.7 


428303 AW974476 Hs.183601 


SS.RGSJ^GS.RGS 


regulator of 6-protein signalfing 16 


3.7 


411667 8E160198 


TM 


gb:QV1-HT0413^1020&O59-h03 HT0413 Homo 


37 


427809 M26380 Hs.180878 


,SSjipase,PLAT.Sec7,PH. 


fipoppotein lipase 


3.7 


418203 X54942 Hs.83758 


CKS.SS.CKS, 


CDC28 protein kinase 2 


3.7 


430376 AW292053 Hs.12532 


SS 


chromosome 1 open reading frame 21 


3.7 


444190 A1878918 Hs.10526 


SS 


cysteine and glydne-rich protein 2 


37 


433495 AW373784 Hs.71 


SS.ig.MHCj,connexin.SCAN.SS,TM 


alpha-2-glycoprotein 1. zinc 


37 


429638 AI916662 Hs-211577 


SSJM,SS 


kinedin 1 (kinesin receptor) 


37 
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454071 

451859 

420281 

427691 

428824 

424676 

418026 

457465 

417601 

407999 

425548 

446619 

411213 

405525 

417511 

428769 

407137 

401866 

451195 

426044 

426310 

440029 

408573 

431830 

444781 

431493 

414175 

411789 

418851 

453968 

407104 

449051 

434398 

454042 

459496 

414998. 

407756 

442101 

449722 

452554 

421991 

420058 

425776 

407846 

406925 

445873 

418054 

414921 

452268 

428862 

412520 

410530 

401780 

447131 

418334 

415138 

421168 

431473 

421379 

411984 

408101 

405366 

414612 

411393 

435767 

416406 

433068 



AI041793 Hs.42502 
H44491 Hs.252938 
AI623693 Hs.191533 
AW194426 Hs^0726 
W23624 Hs.173059 
Y08565 Hs.151678 
BE379727 Hs.83213 
AW301344 Hs.122908 
NM_014735Hs.82292 
AI126271 Hs.49433 



AA890023 

AU076643 

AA676939 

Y13647 

AL049176 



Hs.1906 
Hs.313 
Hs.69285 
Hs.1 19597 
Hs.82223 



AW207175 Hs.106771 
T97307 

U10492 Hs.438 
AA502490 Hs.336695 
NM.000909HS.169266 
AW089705 Hs.293711 
AA284775 Hs.43148 
Y16645 Hs.271387 
NM_014400Hs.11950 
AI791493 Hs.129873 



■AI308876 

AF245505 

A1417828 

AA847843 

S57296 



Hs.103849 
HS72157 
Hs. 192435 
Hs.62711 
Hs.323910 



AW961400 Hs.333526 
AA121098 Hs,3838 
H22570 Hs.172572 
AA808940 Hs.274450 
.NM.002543HS.77729 
AA1 16021 Hs.38260 
A1651930 Hs.135684 
BE280074 Hs.23960 
AW452434 Hs.58006 
NM.014918HS.1 10488 
AK001423 Hs.94694 



U25128 
AA4262Q2 
L34041 
AA250970 



Hs.15949g 
Hs.40403 
Hs.9739 
Hs.25ig46 



NM^002318Hs.83354 
BE390551 Hs.77628 
NNL003512HS.28777 
NM_000346Hs^316 
AA442324 Hs,795 
M25809 Hs.64173 

NM_004585Hs.17466 
AA319233 Hs.5521 
C18356 Hs.295944 
AF182277 Hs.330780 
AA825686 Hs.321176 
Y15221 Hs.103982 
NM_005419Hs72988 
AW968504 Hs.123073 

BE274552 Hs76578 
AW797437 Hs,69771 
H73505 Hs.117874 
086961 Hs.79299 
NM.006456HS288215 



JM.7tmJ, ■ 

.SS,TM.EGF.tdLrecepLa,l(lLrecepLb.EGF 

,SS.GIycos_transL2, 

SS 

Glycos_transL2.Rictn_BJectin.SS,Glyoos 

lipocaUn.SSJlpocarinJipocaiin.fernti 

.SS.Pribosyltran.Sulfatase 

PHD.pkinase,SS 

SS 

SS.TM.fn3,SS.TM.fn3. 
,SSJM,efhand,ion_trans 



SSJM.CUB,F5^F8^type_C.MAM.SSJM.CUB,F5_neuropilin1 



ESTs 

ESTs. Weakly similar to ALU1J1UMAN ALU S 

ESTs 

ESTs 

ESTs 

UDP-N-acetyl-alpha-D-galactosamine:poIyp 

fatty acid binding protein 4, adipocyte 

DMA replication factor 

KIAA0215 gene product 

ESTs. Weakly similar to YZ28^HUMAN HYPOT 

prolactin receptor 

secreted phosphoprotein 1 (osteopontin. 



SS.TM.OesatuFase.SS 

SS 

.SS,7tm_1.SPRY. 

,SS.TM.GDA1_CD39 

»SS.fUament 

SS.itomeobox.Ets,SS.homeobox, 
SS 

SSJM7tm_1. 
SS 

.SS.TM.PMP22_Glaudin, 
,SS.TM.IL8 
,SS,PH,lactamase„B 
,SS.p450,p450 



stearoyj-CoA desaturase (delta-9-desatur 

chordin-like 
ESTs 

gb:ye53h05.sl Scares fetal liver spleen 
Target Exon 

mesenchyme homeo box 1 
ESTs 

neuropeptide Y receptor Y1 
ESTs. Weakly similar to S64329 probable 
ESTs 

small inducible cytokine subfomify A (Cy 
GPI-anchored metastasis-associated prote 
ESTs. Weakly similar to A36036 cytochrom 
.TM,hemopexin,Peptidase_M10,hemopexin,Pehypolhet]cal protein DKFZp761D112 
ig,LRRCT. DKF2P564I1 922 protein 

.SS.TM ESTs 

.SS.HMG_box. Homo sapiens, clone IMAGE:3351295. mRNA 

,SS,TM,SH2,PH.pkinase.Recep_L_domain,Furv-erb-b2 avian erythroblastic leukemia v 



SS 

pkinase,POLO_box,SS,pkinase,POLO_box, 
.SS,lg,pkinase,LRRNT.LRRCT. 
.SS.TM.KRAB.SCAN.zf-C2H2.ig 
.SS.TM 

SS.UCH-1 .UCH-2.SSJM.G jlu_transpept 
SS 

cycIin,SS.TM.cyclin, 
SS 
SS 
SS 

SSJM.7tm^,SS,TM,7tm_2 

,TM,ABC_niembrane,ABC.tran,RibosomaLS4eCbp/p300-interacting transactivator, wtt 
,SS,TM.transportj)rot,SWiB.RhoGAP.DAG.PEglycerol-3-phosphate dehydrogenase 1 (so 
.SS,rmn.PABP,pkinase.14-3-3.rnn poly(A)-binding protein, cytoplasmic 1-1 

.SS.TM.mito_carr.Lysyt_oxkiase lysyl oxidase-Iike 2 

SS,START,SS,START.NNMT_PNMT_TEMT, steroidogenic acute regulatory protein r 
SS,histDne,CaIc_CGRP_IAPP,lg,MHCJ,SPRY»H2A histone family, member L 



HER2 receptor tyrosine kinase (c-erb-b2, 
senjm-inducibie kinase 
hypothetical protein FLJ20093 
EST 

oxidised low density lipoprotein (lectin 
ubiquitin specific protease 18 
ESTs 
cyclin B1 

ESTs. Weakly similar to ALU5_HUMAN ALU S 
KIAA0990 protein 

Homo sapiens cDNA FIJ10561 fis, clone NT 
parathyroid hormone receptor 2 



SS.HMG.box. 
histone.SS.histone.BoIA 
ATP-synLab.SS,7tm_t .ATP-synLab 
filamentSS.filament.filament 
TM 

,SS,TM.RibosomaLL27e. 
,Kunitz_BPTI. • 
SS.p450.SS 
SS 

SS.1L8. 

SH2,STAT,SS,STAT 
.pkinase. 

RhoGEF.PH,SS.RhoGEFPH. 

SAP,SS.FG-GAP.vwa 

SS.sushi,trypsin.vwa.rnn.fibrinogej>_C.fn 

.SS,Peptidase_S8,P,Peptidase_S8,P 

.SS.TM 

,SS.Pribosyltran. 
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SRY (sex delemiining region Y)-box 9 (ca 
H2A histone family, member O 
ATPase. H transporting, lysosomal (vacuo 
NM_005557^Homo sapiens keratin 16 (foca 
retinoic add receptor responder (tazaro 
ESTs 

tissue factor pathway inhibitor 2 
cytochrome P450, subfamily lIB (phenobar 
ESTs. WeaWy similar to S65824 reverse t 
small inducible cytokine subfamily B (Cy 
signal transducer and activator of trans 
CDC2-related protein kir^e 7 
NM_003371*:Homo sapiens vav 2 onoc^ene ( 
protein Inhibitor of activated STAT3 
B4actor» properdin 
ESTs 

lipoma HMGtC fusion partner-fike 2 
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Hs.21342 
Hs,339565 
Hs.1 72914 



Hs.164680. 
Hs.280380 
Hs.134981 



Hs.57664 . 
Hs.133483 



Hs.22785 
Hs.269109 



445462 AA378776 Hs.288649 
439452 AA918317 Hs.57987 
452017 AF109302 Hs.27495 
409099 AK000725 Hs.50579 
452106 AI141031 
447519 U46258 
428928 AF037082 
438825 BE327427 Hs.79953 
414575 H11257 Hs.22968 
417837 AL079905 Hs.1103 
422128 AW881145 
445941 A1267371 Hs.172636 
429973 AI423317 
444542 AI161293 
459561 Ai547306 
425741 AF052152 Hs.159412 
426501 AW043782 Hs.293616 
456508 AA502764 Hs.123469 
434228 Z42047 Hs.283978 
415752 BE314524 Hs.78776 
400419 AF084545 
439750 AL359053 
423858 AL1 37326 
428514 AW236861 Hs.1 931 39 
428698 AA852773 Hs.334838 
448988 Y09763 
432072 N62937 
417433 BE270266 Hs.82128 
452194 AI694413 Hs.332649 
444051 N48373 Hs,10247 
420042 AW015140 Hs.161723 
457292 A1921270 Hs.334882 
421458 NM_003654Hs,1 04576 
431104 AW970859 Hs.313503 
443767 BE562136 Hs.9736 
419589 AW973708 Hs.201925 
416447 Z97171 Hs.78454 
443464 BE548446 Hs.5167 
423431 AA326052 
413276 BE563085 Hs.833 
458451 AW297181 Hs.195922 
440449 AA885430 Hs.201925 
413753 U17760 
434876 AF160477 
435575 AF213457 
415773 R21651 
446440 AV658411 
450847 NM_003155Hs.25590 
426075 AW513691 Hs.270149 
452110 T47667 Hs.28005 
439963 AW247529 Hs.6793 
402837 NA 

439451 AF086270 Hs.278554 
405664 L34041 Hs.9739 
417315 A1080042 Hs.336901 
413011 AW068115 Hs.821 
414987 AA524394 Hs.294022 
429197 H24471 Hs.26930 
448030 N30714 Hs.325960 
407604 AW191962 Hs.249239 
419092 J05581 Hs.89603 
456672 AK002016 Hs.1 14727 
443171 BE281128 Hs.9030 
452256 AK000933 Hs.28661 
432201 Ai538613 
406642 AJ245210 
400903 NA 



HS75517 

Hs.61460 

Hs.44234 

Hs.324725 

Hs.42656 



SS.SS 

SS.SS 

SS 

SS 

SS 

SS 

,SS,adh_short.TGF-beta,TGFbj)ropeptide 

,SS.TM,histoneANF_receplor.guanylate_cy 

.SS.pkinase.ig, 

SS,TGFb_prDpeplide,TGF-beta,SS 
SS 

SS.SSjipoxygenase.PLAT 
,SSJ-box,UDPGT 

SS.SS.Pepfidase_M1,EGF.igJectin_c,sushi 
SS 

(pkinase. 

SS 

SS 

,SS.TM.7tm_1 
TM 

,SS.Peptidase_Mt 

,TM.integrin.B,Ricin_BJecOn.rrm 

.SS.TM 

.SS,START.NNMT_PNMr^TEMT. 

SS 

SS.TM.TM 
,Sema,ig, 

SS,TM.LRRCT.LRRNT,LRR.TM.LRRCT. 

,SSJM.7tm_3.ANF_feceptor,sushi 

.SS.ig, 

.SS.CUB, 

SS,TM.SS,TM,G-patch 

SS 

.Semajg, 

,SS.PCI,RasGEF»hormone_fec,zf-C4, 
.FGF, 

SS.OLF,OLF,OLF,RibosomaLL4 

SS.TM,SSF.SS.TM 

.SS,p450,p450 

.SSJM.ubic|uitin.laminin_G,laminin_EGF.k 

.SS.Ribosomal_L14 

.FGF. 

SS,laminin_EGF,laminin_Ntenn.adh_short,S 

,SS,HLH 
SS.ig.SSJM 

,SS,TM,RibosomaLS3Ae,G-gamma 
SS 

.SS.homeobox, 
.SS,fn3. 

.SS,TM.Activiiurecp,pkinase 
,TM.p450.Ets 

SS 

,SS,Chromo_shadow,chromo, 



hypothetical protein MGC3077 

B-cell CLUlyrhphoma 1 1B (zinc finger pro 

prostate cancer assodated protein 7 

hypothetical protein FU20718 

ESTs 

ESTs 

retinot dehydrogenase 5 (11-cis and 9-cl 
ESTs 

Homo sapiens clone IMAGE:451939, mRNA se 
transforming growth factor, beta 1 
gb:QV0-OT0033^10400.182-a07 OT0D33 Homo 
ESTs 
ESTs 

aminopeptidase 
ESTs 

Homo sapiens clone 24628 mRNA sequence 
ESTs 

ESTs. Weakly similar to AF208855 1 BM-01 
Homo sapiens PR02751 mRNA. complete cds 
putative transmembrane protein 
Target 

Homo sapiens mRNA full length insert cDN 
Homo sapiens mRNA; cDNA DKFZp434B0650 (f 
ESTs 

KIAA1 866 protein 

gamma-amlnobutyric add (GABA) A recepto 
ESTs 

5T4 oncofetal trophoblast glycoprotein 
olfadory receptor, famiHy 2, subfamily 
activated leucocyte cell adhesion molecu 
ESTs 

hypothetical protein FLJ 14251 
carbohydrate (keratan sulfate Gal-6) sul 
ESTs 

proteasome (prosome, macropatn) 26S subu 
Homo sapiens cDNA FU13446 fis. done PL 
myodlin, trabecular meshwork indudble 
Homo sapiens mRNA; cDNA DKFZp434F152 (fr 
gb:EST29171 Cerebellum 11 Homo sapiens c 
interferon-stimulated protein. 15 kDa 
ESTs 

Homo sapiens cDNA FU 13446 fis, clone PL 
laminin. beta 3 (nicein (125kD). kalinin 
Ig superfamily receptor LNIR 
triggering receptor expressed on myetoid 
ESTs, Moderately similar to A47582 B-cel 
KIAA1681 protein 
stanniocaldn 1 

ESTs, Weakly similar to 2109260A B cell 
Homo sapiens cDNA FU11309 fis. clone PL 
platelet-activating factor acetylhydrola 
ENSP00000241312*:DJ947L8.1.8 (novel Sush 
heterochromatin-like protein 1 



,SS,TM.transporU)rot,SWIB,RhoGAP.DAG__PEglycerol-3-phosphate dehydrogenase 1 (so 
,SS.RNA_poLA,RNA_poLA2.RibosoiTiaLS24e.ribosomal protein 824 



Hs.298241 



SS.LRR,LRRNT,SS.LRRNT.LRR. 
,SS,connexin,hormone_rep,zf-C4.connexin 
.SS.Gelsolin. 
,SS,TM 
.SS.TM.C1q, 
SS.TM,SEA, 
,SS.PK.PK 
SSJM.7tm_1.rTm,SS 
.TM.GDl,7tm_l, 

SSJ^yltrypsin,SS.T^/l.trefoil.trypsHl.t^ef 

SS 

SS 
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bigtycan 
hypothetical protein FLJ 14950 
ESTs. Weakty similar to T20272 hypotheti 
membrane>spanning 4-domains, subfamily A 
collagen, type Vlli, alpha 2 
mucin 1. transmembrane 
Homo sapiens, done MGC:16327. mRNA. com 
TONDU 

Homo sapiens cDNA FLi10071 fis. done HE 
Transmembrane protease, serine 3 
gb:Homo sapiens mRNA for {mmunoglbbulin 
Target Exon 



3,3 

3.3 

3.3 

3,3 

3.3 

3.3 

3.3 

3.3 

3,3 

3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

3,3 

3.3 

3.3 

3.3 

3.3 

3.2 

3.2 

3.2 

3.2 

3.2 

3.2 

3.2 

3.2 

3.2 

3.2 

3.2 

3.2 

3.2 

3.2 

3.2 

3.2 

3.2 

3.2 

3.2 

3.2 

3.2 

3,2 

3.2 

3.2 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 
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434408 AI031771 Hs.132586 .SS.G!yco.hydrD_2 ESTs 3.1 

452994 AW962597 Hs.31305 SS,WD40.SS.WD40, KIAA1547 protein 3.1 

445903 AI347487 Hs.132781 fn3,SS,TM.EF1BD dass I cytokine receptor 3.1 

424364 AW383226 Hs.201189 SS ESTs, Weakly similar to G01763atrophln- 3.1 

410196 AI936442 Hs.59838 UBACT^repeat.SS,UBACT_repeat,ThFJamilyhypothetical protein FU10808 3.1 

419150 T29618 Hs.89640 ,TM,pkinase,fh3. TEK tyrosine kinase, endothelial (venous 3.1 

433417 AA587773 Hs.8859 ,SS,SRCR, Homo sapiens. Similar to RIKEN cDNA 5830 3.1 

418624 AI734080 Hs.104211 .Sema.ig. ESTs 3.1 

436291 BE568452 Hs.5101 .SS.abhydroiase, protein regulator of cytokinesis 1 3.1 

411000 N40449 Hs.201619 SS ESTs. Weakly similar to S38383 SEB4B pro 3.1 

412519 AA196241 Hs.73980 SS.Troponin,Hemagglutinin.SS,TM,C2.Tropo troponin T1, skeletal, slow 3.1 

450223 AA418204 Hs.241493 .SS.proJsomerase, natural killer-tumor recognition sequenc 3.1 

422790 AA809875 Hs.25933 ,TM.histone,Sec1.histone^ugar.tr ESTs 3.1 

424269 AW137691 Hs.199754 .SS,TM.7tm_2.GPS ESTs 3.1 

435854 AJ278120 Hs.4996 ,SS.WD40 putative ankyrin-repeat containing prote 3.1 

447388 AW630534 Hs.76277 ,SS,TM.rrm,oxidoredjq6,Qxidored.q6 Homo sapiens, done MGC:9381. mRNA. comp 3.1 

451631 R00866 SS gb:ye79c02.s1 Soares fetal Hver spleen 3.0 

448105 AW591433 Hs.298241 .SS,TM.trefbiLtrypsln.trefbil Transmembrane protease, serine 3 3.0 

438637 6E500941 Hs.126730 ESTs, Weakly similar to KIAA1214 protein 3.0 

423024 AA593731 Hs.325823 .SS,TM,CD36 ESTs. Moderately similar to ALU5.HUMAN A 3.0 

456592 R91600 .SS.Ran3P1.LIM,Ran_BP1,GRIP,TPR,UM gb:yq10c02.r1 Scares fetal Rver spleen 3.0 

425920 AL049977 Hs.162209 SS.TM.SS.TM.PMP2^.Claudin.PMP22_Claudindaudin 8 3.0 

444670 H58373 Hs.332938 .SS.TM hypotheUcal protein MGC5370 3.0 

401197 arf.arf. ENSP00000229263*:HSPC213. 3.0 

437755 AW204256 Hs.291887 ,wnt ESTs 3.0 

452560 BE077084 Hs.335432 ,SS,fTm.2f-RanBP.pkinase,G2.pkinase_C,DA ESTs 3.0 

410274 AA381807 Hs.61762 SS.SS hypoxia-indudble protein 2 3.0 

450098 W27249 Hs.8109 SS hypothetical protein FLJ2 1080 3.0 

404826 ,SS,TM Target Exon 3,0 

458389 H70284 Hs.160152 .SS.RA ESTs, Weakly similar to FPHU alpha-fetop 3.0 

408195 AL034548 Hs.43627 HMG_box,pkinase.2f-CCHC.SS,TM.HMG_box.SRY {sex detennining region Y)-box 22 3.0 

433575 AW977653 Hs,75319 ,SS,ribonuc_red_sm. ribonudeotide reductase M2 polypeptide 3.0 

418848 AI820961 Hs,193455 ,death,2U5.pkinase.Adivin_recp, ESTs 3.0 

422095 AI868872 Hs.282804 SS.Cu-oxidase.SS.Cu-oxidase.Cu-oxidase hypothetical protein FU22704 3.0 

415992 C05837 Hs.145807 .SS.TM hypothetical protein FU 13593 3.0 

424631 AA688021 Hs.179808 SS ESTs 3.0 

409956 AW103364 Hs.727 SS,TGF-beta,TGFb_propeptide,SS,TGF-beta.inhibin. beta A (activin A. activin AS a 3.0 

419667 AU077005 Hs.92208 SS,disintegrin,Reprolysin,Pep_M12B_prope a disinlegrin and metailoproteinase dome 3.0 

450946 AA374589 Hs.l27698 SS ESTs. Moderately similar to 21 09260A B c 3.0 

447770 AB032417 Hs.19545 f rizzled.Fz,SS,TM,Frizzled.Fz. frizzled (Drosophila) homolog 4 3.0 

423826 U20325 Hs.1707 SS.SS cocaine- and amphetamine-rBgulated trans 3.0 

418838 AW365224 Hs.35198 .SS.TM,Phosphodiest. edonudeotide pyrophosphatase/phosphodi 3.0 

442804 AW300118 Hs.131257 .SS.TM.G-gamma ESTs 3.0 

432284 AA532807 ■Hs.105822 .SS.TM.pkinase. • ESTs 3.0 
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TABLE 20A 

Table 20A shows the accession numbers for those pkeys lacking unigenelD's for Table 20. 
5 For each probeset, we have Usted the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland CaUfomia). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

10 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbeis 

15 



Pkey 


CAT number 


Accessions 


410785 


1221055 1 


AW803341 AW803265 AW803403 AW803465 AW803402 AW803413 AW803268 AW803396 AW803334 AW803355 


411667 


1253334 1 


BE160198 AW935898 T11520 AW935930 AW856073 AW861034 


418636 


177402 1 


AW749855 AA225995 AW750208 AW750206 


420854 


197072 1 


AW296927AI684514AJ263168AA281079 • 


422128 


211994 1 


AW881145 AA490716 M85637 AA304575 T05067 AA331991 


423431 


228152J 


AA326062 AA325758 AW962182 


423945 


233566J 


AA410943 AW946953 AA334202 AA332882 


426878 


273265_1 


BE069341 AW748403 AL044891 AI908240 AA393080 


433687 


373061_1 


AA743991 AA604852 AW272737 


447197 


71 1623 J 


R36075AI366546 R36167 


451631 


878098J 


RCJ0866R01523AI806815 


456207 


165078_-1 


AA1 93450 


456592 


202684_1 


R91600 T87079 AA291455 
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TABLE 20B 

Table 20B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 20. For each predicted exon, we have Usted the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



10 



15 



20 



25 



30 



35 



40 



Pkey: 
Ref: 

strand: 
NLpositton: 



Pkey Ref 



Unique number corresponding to an Eos probeset 

Sequence source. The 7 d'git numbers in this column are Genbank Identifier (GI) numbers. "Dunham I. et al" refers to the publtoation 

entitled 'The DNA sequence of human chromosome 22." Dunham I. et al.. Nature (1999) 402:489-495. 
Indicates DNA strand from which exons were predicted. 
Indicates nucleotide positions of predicted exons. 

Strand Nt_posit1on 



400608 9887666 Minus 96756-97558 

400903 2911732 Plus 59112-59228 

401045 8117619 Plus 90044-90184,91111-91345 

401093 8516137 Minus 22335-23166 

401197 9719705 Plus 176341-176452 

401747 9789672 Minus 1 18595-1 18816,1 191 19-1 19244.1 19609-1 19761 .120422-120990.130161-130381,130468-130593,131097-131258.131866- 

131932.132451-132575.133580-134011 

401780 7249190 Minus 28397-28617,28920-29045.29135-29296.2941 1-29567.29705-29787,30224-30573 

401761 7249190 Minus 83215-83435,83531-83656.83740-83901,84237-84393.84955-85037.86290-86814 

401866 8018106 Plus 73126-73623 

402230 9966312 Minus 29782-29932 

402408 9796239 Minus 110326-110491 

402578 9884928 Plus 66350-66496 

402605 9909429 Minus 81747-82094 

402837 9369121 Minus 2013-2186.9570-9758,11136.11309,19429-19677.21210-21455.23368-23562.24342-24527.29132-29320 

403199 9958183 Minus 58895-59036.66618-66789 

403329 8516120 Plus 96450-96598 

403593 6862650 Minus 62554-62712,69449-69602 

403943 7711864 Plus 100742-100904.101322-101503 

404091 7684554 Minus 82121-83229 

404347 9838195 Plus 74493-74829 

404826 6572184 Plus 47726-48046 

405366 2182280 Plus 22478-22632 

405654 4895155 Minus 53624-53759 
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TABLE 21: 210 GENES UP-REGUIATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES THAT ARE 
LIKELY TO ENCODE EITHER ENZYMES OR PROTEINS 
AMENABLE TO MODULATION BY SMALL MOLECULES 



10 



15 



Table 21 shows 210 genes up-regulated in breast cancer compared to normal adult tissues 
that are likely to encode either enzymes or proteins amenable to modulation by small 
molecules. These were selected as for Table 19, except that the ratio was greater than or equal 



to 3.0, the "average" normal adult tissue level was set to the 85 percentile value amongst 
144 non-malignant tissues, and the 96"^ percentile value amongst 73 breast cancers was 
greater than or equal 80 units, and the predicted protein contained a structural domain that is 
indicative of enzymatic fimction or of being modulatable by small molecules (e.g. pkinase, 
peptidase, phosphatase, ATPase, or ion_transporter domains). The predicted protein domains 
are noted. 



20 



25 



30 



35 



40 



45 



50 



55 



Pkey: Unique Eos probeset tdentirier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene We: Unigene gene titie 

R1 : Ratio of 93rd percentile tumor to percentite of nomial body 



Pkey ExAccn UnigenelD Predicted Protein Domains 



449746 
400292 
424735 
407178 
408045 
450375 
429170 
445730 
424634 
420757 
424399 
447350 
456207 
456938 
402578 
425692 
424001 
418007 
421727 
411869 
400289 
443348 
424086 
400295 
408771 
421155 
-424905 
438167 
459583 
423945 
445263 



AI668594 Hs,176588 
AA250737 Hs.72472 
U31875 Hs.272499 
AA195551 Hs.104106 
AW138959 Hs.245123 
AA009647 Hs.8850 
NM_001394Hs.2359 
A1624342 Hs.170O42 
NM_003513Hs.151407 
X78592 Hs.99915 
AI905687 Hs.2533 
AI375572 Hs.172634 
AA1 93450 

X52509 Hs.161640 



D90041 
W67883 
M13509 
Y13153 



Hs.1 55956 
Hs.137476 
Hs.83169 
Hs.107318 



W20027 Hs.23439 
X07820 Hs.2258 
AW873596 Hs.182278 
A)351010 Hs.102267 
W72838 Hs.2533 
AW732573 Hs.47584 
H87879 Hs.102267 
NI^002497Hs.153704 
R28363 Hs.24286 
AI907673 
AA410943 

H57646 Hs.42586 



SS.p450 

death,ZU5,TM^ctivin_req).pkinase, 
SS.TM 

SS,DihydrDon)tase. 
Phosphodiest,Somatomedin_B, 

SS,TM,disintegrin,Pep_M12B_propep,Reprol a disintegrin and metalloproleinase doma 



UnlgeneTitle 

ESTs. Weakly similar to CP4Y.HUMAN CYTOC 
B^/IP-R1B 

shoit-chan alcohol dehydrogenase family 
ESTs 
ESTs 



DSPc.Rhodanese, 
SS,TM.Cation_efflux 
ig,tsp_1,SS.AAA 

homnone_rec,Androgen_rBcep.zf-C4, 
SS 

pkinase, 

SS.TM.p450,p450 



dual spedTicily phosphatase 4 
ESTs 

cartilage intemiediale layer protein, nu 
androgen receptor (dihydrotestosterone r 
aldehyde dehydrogenase 9 family, member 
ESTs 

gb2r40e07.r1 Soares„NhHMPu_S1 Homosapi 



SS,TM,aminotran_1_2,Cadherin_CJerm,cadhtyrDsine aminotransferase 
SS,p450.SS,TM.p450 C1001134:gi|21l7372lpir||l65981 fatty ac 

SSAcetyltransf2, N-acetyltransferase 1 (arylamine N-acety 

pkinase, paternally expressed 10 

SS,hemopexin,Pept{dase_M10.SS.Peptidase_matrix metafloproteinase 1 (MMPI; inters 

kynurenine 3-monooxygenase (kynurenirie 3 
SS,Peptidase_M1. ESTs 

hemopexin.PeptkJase_M10.SSPeptidase_M10matrix metaltoproteinase 10 (MMP10: str 
SS.DENN calmodulin 2 (phosphorytase kinase, delt 

SS.LysyLoxidase lysyl oxidase 

SS aldehyde dehydrogenase 9 femily, member 

TM.K.tetFa.ion.trans.SS,TM,K.tetia;ion_t potassium voltage-gated channel, delayed 
SS.Lysyt_oxtdase,A(dose_epim.Epimerase.S lysyl oxidase 
pkinase.SS,TIM.pkinase,potyprenyt_synt NIMA (never In mitosis gene a)-related k 



SS.TM,7tm_1,p450.nin 
pkinase, 

death,ZU5,TM,Acthrin_recp.pkmase. 
SS^AcyitFansfeiase, 



ESTs 

gb:IL-BT152-080399-004 BT152 Homo sapien 
gb:zt32h03.r1 Scares ovary tumor NbHOT H 
KlAAISeO protein 



R1 

65.7 
55.9 
53.8 
39.3 
34.9 
257 
24.9 
24.1 
21.7 
21.7 
20.3 
19.2 
18.3 
18.1 
17.8 
16.7 
16.5 
15.7 
15.3 
13.9 
13.5 
13.0 
12.B 
12.7 
11.9 
11.8 
11.7 
115 
11.5 
11.4 
11.2 
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60 



65 



423887 
429859 
428368 
418912 
451952 
402408 
415539 
406687 
419948 
400285 
A0S3BQ 
409203 
424902 
431725 
418092 
427811 
423554 
426261 
413374 
432677 
456986 
453331 
407721 
418004 
410555 
443695 
423545 
401045 

442082 
418836 
404091 
450865 
424085 
449650 
432304 
415669 
442942 
428795 
428479 
400300 
447359 
402230 
424687 
432328 
432239 
400286 
425247 
434737 
439310 
441111 
452355 
427711 
429353 
425325 

448706 
403593 
432777 
446232 
411165 
414117 
411096 
450506 
417975 
447752 
427122 



AL080207 Hs.134585 
NM_007050Hs.225952 
BE440042 Hs.83326 
NM_000685Hs.89472 
A1120173 Hs.301663 
NA 

AI733881 
M31126 
AB041035 
NA 

AF123050 
AA780473 



Hs.72472 

Hs^72620 

Hs.93847 

Hs.44532 
Hs.687 



SS,TM.BRCT.ajiMBC„lran^BC_tran DKFZP434G232 protein 
SS,TM,Y_phosphatase,MAM,fn3. protein tyrosine phosphatase, receptor t 

SS.Peptidase_M10,hemop8xin,SS,Peptidase_matrix metalloprotetnase 3 (strometysin 
SS.TMJtm J ,SS.TM,7tn{_1 , angiotensin receptor 1 

SS,pkinase, ESTs 

SS.carb.anhydrase NI\A_030920*:Hoino sapiens hypothetical pio 

death^U5JM,Activin_repp<pkinase, BMP-R1 B 

SS,Peptidase_M10,hemopexin.SS,Peptjdase_matrix metalloproteinase 11 (MiVIP11; stro 



NM-.003866HS.153687 
X65724 Hs.2839 
R45154 Hs.106604 
M81057 Hs.180884 
M90516. Hs.1674 
AW242243 Hs.168670 
NM_001034Hs75319 
Niy/L004482Hs.278611 
038299 Hs.170917 
AI240665 Hs.8895 



Ferric_reduct,TM,Ferric_reduct 
TM,ABC_tranABC_membrane. 
SS,TM,ubiquftin,7fcrrL.3,ANF_feoeptor,sushi 
SS.p450,SS.p450 
SS,SS 

SS.Cys^knotSS 
death7U5.pkinase.Activin_recp, 



NM_016931:Homo sapiens NADPH oxidase 4 ( 
Eos Control 
diubiquitin 

cytochrome P450, subfamily IVB, polypept 
inositol polyphosphate-4-phosphatase, ty 
Norrie disease (pseudoglionna) 
ESTs 



Y12735 
U37519 
'U92649 



Hs.38018 
Hs,87539 
Hs.64311 



AW204099 Hs.337720 
AP000892 Hs.129781 



C1 1001883*:gi|6753278|reflNP^033938.1I c 



SS^_carbOpept.Propep.M14,SS.Propep_M14carfoQxypeptidase 61 (tissue) 
GATase_2,SIS. glutamine-fructose^-phosphate transamin 

SSJM,WO40.ub'iquitin. El -E2^TPase.Cation peroxisomal famesylated protein 
SS ribonudeofide reductase M2 polypeptide 

SSJM.Glycos_tFansL2.Ricin_B.iectin.DPP UDP-N-acetyl-atpha-D-galactosamine:polyp 
SS,TM,7tm_1 , prostaglandin E receptor 3 (subtype EP3) 

SS.TM.disintegrin.Pep_M12B^n)pep.Reprol ESTs 

pidnase. dual-specificity tyrosine-(YVphosphoryl 

SS,aldedh.SS,a1dedh. aldehyde dehydrogenase 3 family, member 

TM,disinlegriaReprolysin, a disintegrin and metalloproteinase doma 

ESTs. Wealdy similar to AF126780 1 retin 
GAF.PDEase chromosome 21 open reading frame 5 

lCEj)20.SS.lCE_p10,ICE_p20,ICE^10.!CE_p 



Hs.7413 
Hs.161712 



R41823 
A16554g9 
NA 

A1248013 Hs.106532 
m 002914HS.139226 
AF055575 Hs.23838 
AA932186 Hs.69297 
NM_005025Hs.78589 
AW167087 Hs.131562 
R45503 Hs.97469 
Y00272 Hs.184572 
X03363 

NM.012093HS.18268 
NA 

J05070 Hs.151738 
AI572739 Hs.195471 
X81334 Hs.2936 
NA 

NM^005940Hs.155324 
AA828246 Hs.291884 



TIVl,EPHJbd.pkinase.SAM.fn3. 
TM,Actjvin_recp.pkinase,dealh,ZU5, 
TM,7tm_3,ANF_receptor, 
2f-C2H2 

SS,AAA.ViraLhelicase1 ,rrm, 

TM,ion_trans,SSJM»ion_trans, 

TM.7tmJ, 

SS.serpin. 

SS.ig.Sema.pkinase, 
SS.TM 

SS,pkinase,pkinase 



6.7 

ESTs; caIsyntenin-2 
ESTs 

Target Exon 

ESTs, Weakiy similar to 138588 reverse t 
replication factor C (activator 1) 2 (40 
calcium channel. voltag&Kiependent, L ty 
ESTs 

serine (or cysteine) proteinase inhibito 
ESTs 

ESTs, Highly similar to A39769 N-acetyll 
cell division cycle 2. G1 to S and G2 to 



AF086120 

A1806867 

N54926 

M31659 

AL1 17406 

X52730 



Hs.102793 

Hs.126594 

Hs.29202 

Hs.180408 

Hs.200102 

Hs.1892 



phenyiethanoiamine N-melhyltransferase 



SS,TM,pkinase,Recep_L_domain.SH2.PH,FuriHER2 receptor tyrosine kinase (c-erb-b2, 
SS.adenylatekinase, adenylate kinase 5 

SS.TM,p450, Target Exon 

SS,Peptidase_M10,fn2,hemopexin,SSJM.Pepmatftx metalloproteinase 9 (gelatinase B 
6PF2K.PGAM, 6'phosphofructo-2-kinasemuctose-2,6-bl 
SS.Peptldase_M10,hemopexin,SS,Peptidase.matrix metaltoprotelnase 13 (collagenase 
SS.TM.ABC^tranABC.membrane.SS C16000922:gil7499103|plr|rr20903 hypothe 
SS,Peptidase_f^1 0,hemopexin,SS matrix metalloproteinase 1 1 (MMP1 1 ; stro 

UCH-1 ,pkinase.OPR.Rhodanese,Aiy4P-binding,ESTs 
SS,TM,UDPGT.casein_kappa ESTs 
SS.TM,Phosphodiest, ESTs 
TM,7tm_lJM G protein-coupled receptor 34 

SS solute carrier family 25 (mitochondrial 

SS.TM,ABC_tran.ABC_membrane, ATP-binding cassette transporter MRP8 

SS.NNMT_PNMTjrEMT.SS,NNMT_PNMTJ"EMT,STAR 



5.1 



AW291095 Hs.21814 
NA . 

AA564991 Hs.269477 
A1281848 Hs.194691 
NM_000169Hs.69089 
W88559 Hs.1787 
U80034 Hs.68583 
NM.00446DHs.418 
AA641836 Hs.30085 
M73700 Hs.105938 
AW057736 Hs.323910 



SS.TM.pldnase, interleukin 20 receptor, alpha 

C1DE-N,pkinase Target Exon 

alpha-amylase, ESTs 
SS,TM,7tm_3,RibosomaLL1 3 retinoic add induced 3 

SS,tVlelibiase.BTK.PH,pkinase,SH2.SH3,Ribo galactosldase. alpha 
TM,ion_trans,KJetra. proteoBpid protein 1 (Pelizaeus-Merzbac 

Peptidase_M3. mitochondrial intermediate peptidase 

SS.DPPIV.NJerin.Peptidase_S9,SS.DPPlV_N Jbroblast activation protein, alpha 
SS.tiypsin hypothetical protein FU23186 

SS,transferrin.7tm_1 .transfenin, lactotransfenin 
SS,TM,pkinase.Recep_L_domain.SH2.PH,FuriHER2 receptor tyrosine kinase (c-eit>-b2. 
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10.9 
10.4 
103 
10.3 
10.3 
9.8 
9.4 
9.1 
9.1 
8.8 
8.6 
8.3 
8.3 
8.3 
83 
8.2 
8.1 
7.8 
7.6 
7.6 
7.5 
7.3 
7.2 
7.2 
. 7.1 
6.9 
6.8 



6.7 
6.6 
6.6 
6.5 
6.5 
6.4 
6,2 
6.2 
6.2 
6.1 
6.1 
6.1 
6.1 
6.1 
5.8 
5.8 
5.6 
5.6 
5.4 
5.4 
5.2 
5,2 
5-2 
5.2 
5.1 



5.1 

5.1 

5.0 

4.9 

4.9 

4.9 

4.9 

4.9 

4.9- 

4.8 

4.8 



wo 02/059377 PCT/US02/02242 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



400181 NA 

452093 AA447453 Hs.27860 
435542 AA687376 Hs.269533 
417576 AA339449 Hs,82285 
446089 AI860021 Hs.270651 
424420 BE614743 Hs.1466a8 
452190 H26735 Hs.91668 
419986 AI345455 Hs.78915 
421582 AI910275 Hs.1406 
446733 AA863360 Hs.26040 
453060 AW294092 Hs.21594 
400205 NA 
420854 AW296927 
432690 AF181490 Hs.278627 
441560 F13386 Hs7888 
416445 AL043004 Hs79337 
439024 R96696 Hs.35598 
NM 013257HS.279696 
AW073310 Hs.163533 
NM_002916Hs.35120 
Af345227 Hs.105448 
AA829286 Hs.332053 
NM_001809Hs.1594 
U38945 Hs.1174 
S70284 

AL133916 Hs.172572 
Z97630 Hs.226117 
NM_005357Hs.95351 
NM„013989Hs.154424 
BE300512 Hs.193557 
U76456 Hs.1 90787 
L22524 Hs.2256 
M31158 HSJ7439 
AW452631 Hs.313803 
BE247550 Hs-86859 
AA129640 Hs.128065 



432882 
447754 
453775 
431657 
427899 
422938 
418478 
406922 
439285 
429922 
420139 
425071 
424511 
428722 
428330 
414831 
413589 
418629 
413453 
403943 
444618 
408761 



AV653785 
AA057264 



427609 M26360 
418203 X54942 



Hs.173334 
Hs.238936 
Hs.180878 
Hs.83758 
Hs.42502 
Hs.1 51678 



454071 AI041793 
424676 Y08565 
457465 AW301344 Hs.122908 
417601 NM_014735Hs.82292 
446519 AU076643 Hs.313 
406625 Y13647 Hs.1 19597 
428769 AW207175 Hs.106771 
426310 NNL000909HS.169266 
417531 NWL003157HS.1087 
444781 NM 014400Hs.11950 
431493 AI791493 Hs.129873 
428966 AF059214 Hs.194687 
414175 AI308876 Hs,103849 
455325 AW895719 
429597 NM_003816Hs.2442 
425320 U29344 Hs,83190 
431854 AA383550 Hs.271699 
407104 S57296 Hs.323910 
449051 AW961400 Hs.333526 
434398 AA1 21098 Hs.3838 
454042 H22570 Hs.172572 
407756 AA1 16021 Hs.38260 
401464 AF039241 Hs.9028 
412970 AB026436 Hs.177534 
412049 N53437 Hs.18268 
425776 U25128 Hs,159499 



SS,TM.3Beta^HSD. ENSP00000171555:CDNA FU10727 Tis, done 

SS.TM.7tm_lT Homo sapiens mRNA; cONA DKF2p586M0723 (f 

SS.pkinase.RhoGEF,ig.PH.SH3, ESTs 

AIRS,formyLtransf,GARS.SS,GARS.AtRS.for phosphoribosylgfycinamide formy(transfer 
pkinase ESTs, Moderately similar to A47582 B-cel 

SS,TM.MAPEG. prostaglandin E synthase . 

SS,TM,PH.SH2,Furirvfike,pkinase.Recep„L_ Homo sapiens done PP1498 unknown mRNA 



pkinase.OPR. 
SS,trefoil.SS.TM,ldLrecepLa,SRCR,tryps 
SS.TM.p450. 

SS.fas.Y_phosphatase.ras 

SSJM.Peptklase_M1. 
SS.pyr_redox.SS.Ribosoma!_L39 



SS.pkinase. 
SS,TM,trypsin,vwd,lg 
pkinase.pkinase.C, 
pkinase. 



GA-bindIng protein transcription factor, 
trefoil fader 1(pS2) 
ESTs, Weakly similar to fatty add omega 
hypothetical protein MGC15754 
NlVL006265*:Homo sapiens RAD21 (S. pombe) 
gb:UI-H-BW0-al(H>07-(UJI.s1 NCI.CGAP.Su 
prenylcystelne lyase 

Homo sapiens done 23736 mRNA sequence 

KIAA0135 protein 

ESTs 

serum/glucocorticoid regulated kinase-li 
Homo sapiens cDNA FU 14142 fis. clone WK 



SSMA.PI3.PI4_kinase.PI3Ka,PI3ierbd,Pl3 replication fadorC (activator 1) 4 (37 
SS,TM,pkinase ESTs, Weakly similar to B34087 hypoiheti 

SS,SAA_proteins.ABC_membianeABCJran, semm amyloid A1 



SS.TM.thiolase. 
ank,ank 

SS,TM,Desalurase,SS 
SS.ig.pkinase,LRRNT.LRRCT, 
SS,TM,linker_histone.7tm_1 
SSJM.p450. 

SS.T4_deiodinase,T4_deiodinase. 
SS,Yj)hosphatase,Band_41 
SS.TIMP, 



centromere protein A (17kD) 
cydin-dependent kinase inhibitor 2A (me 
gb:stearoyl-CQA desaturase [liuman. adipo 
hypothetical protein FLJ20093 
HI histone family, member 0 
lipase, hormone-sensitive . 
deiodinase, iodothyronlne, type II 
ESTs, Moderately similar to ALU7_HUMAN A 
tissue inhibitor of metalloproteinase 4 



SS,Peptidase_M10,SS,Peptidase_M10,hemopematrix metalloproteinase 7 (I\/1MP7; uterin 
SS.cNMP_blnding.RlIa.HMG_box protein kinase. cAMP-dependent regulato 

SS.abhydroIase ESTs, Highly similar to AF1 57833 1 nond 

SS,SH2,PH,SS,TM,PH,SH2.Furin-!ike,pkinas growth fador receptor-bound protein 7 



ESTs 

C5000355:gi|4503225|reflNP_000765.1 1 cyt 
ELL-RELATED RNA POLYMERASE II, ELONGATIO 
ESTs, Weakly similar to (defline not ava 
lipoprotein lipase 
CDC28 protein kinase 2 
ESTs 

UDP-N-acetyl-alpha-D-gatactosamine:polyp 
DNA replication factor 
KIAA0215 gene product 
secreted phosphoprotein 1 (osteopontin. 
stearoyl-CoA desaturase (delta-d^esatur 
ESTs 

neuropeptide Y receptor Y1 
serine/threonine kinase 2 
GPl-anchored metastasis-assodated prote 
ESTs, Weakly similar to A36036 cytochrom 



SS.Peptidase_C1 ,gpdh 
p450.SS.p450 

SS.TM,7tmJ, 
SS.Iipase.PUT.Sec7.PH. 
CKS.SS,CKS, 
TM.7tm_1. 

Glycos_transf_2,Ridn_BJectin,SS.GIyoos 
SS.Pribosyltran.Sulfalase 
PHD.pkinase.SS 
SS,TM,efhand,ion_lrans 
SS,TM,Desaturase.SS 
SS,7tm_1,SPRY, 
SS.TM.7tm_1. 

SS,pkinase.vwa,vwa.Glyco.transf_8 
SS,PH,lactamase_B 
SS.p450,p450 

cholesterol 25-hydroxylase 
TM.hemopexin.Peptidase_M10.hemopexin,Pephypothetical protein DKFZp761D112 
TMJon_trans.K^tetra, gb:QV4-NN0039-290300'1 54-f06 NN0039 Homo 

a disintegrin and metalloproteinase doma * 
AcyLtransf,adh_2inc.kfitoacy^syntpp*i fatty add synthase 
IMS.SS polymerase (DNA direded) bta 

SSJM,SH2.PH,pkinase,Recep_L_domain,Fufiv-€rl>4)2 avian erythroblastic leukemia v 



SS 

pkinase.POLO_box,SS,pkinase,POLO_box. 

SS.{g.pkinase.LRRNT,LRRCT, 

S$.UCH-1,UCH-2,SS.TM,G_gltL.transpept 

Peptidase_M24, 

DSPc,Myosin_taH, 

SS,adenylatekinase, 

SS,TM,7tm^2,SS,TM.7tnv.2 
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HER2 receptor tyrosine kinase (c-erl)-b2, 
seruno-indudble kinase 
hypothetical protein FLJ20093 
ubiquitin spedTic protease 18 
histone deaoetylase 5 
dual spedfictty phosphatase 10 
adenylate kinase 5 
parathyroid homnone receptor 2 



4.6 

4.6 

4.6 

4.6 

4.6 

4.5 

4.5 

4,5 

4.5 

4.4 

4.4 

4.4 

4.3 

4.3 

4.3 

4.3 

4.3 

4.2 

4.2 

4.2 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.1 

4.0 

4.0 

4.0 

3.9 

3.9 

3.9 

3.6 

3.8 

3.8 

3.8 

3.8 

3.8 

3,7 

3.7 

3.7 

3.7 

3.7 

3.7 

3.7 

3.6 

3.6 

3.6 

3.6 

3.6 

3.6 

3.6 

3.6 

3.6 

3.6 

3.5 

3.5 

3.5 

3.5 

3.5 

3.5 

3.5 

3.5 

3.5 

3.5 

3.4 
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407846 AA426202 HsAOAOZ 
406925 L34041 Hs.9739 
445873 AA250970 Hs^1946 
418054 NM^002318Hs.83354 
S 406815 AA833930 Hs^6036 
410530 M25809 Hs.64173 
407021 U52077 
421168 AF182277 Hs.330780 
431473 AA825686 Hs.321176 

10 408101 AW968504 Hs.123073 
422083 NM_00l141Hs.l11256 
411393 AW797437 Hs.6g771 
435767 H73505 Hs.1 17874 
433068 NM^0a6456Hs.288215 

15 426928 AF037062 Hs.172914 
414575 HI 1257 
445941. A1267371 
444542 AI1612d3 
425741 AF052152 Hs.159412 

20 434228 Z42047 Hs.283978 
433264 085782 
400419 AF084545 
439750 AL359053 Hs.57664 
417757 R19897 

25 - 452194 Aie94413 

421458 NM_003654Hs. 104576 
443767 BE562136 Hs.9736 
422648 D86983 Hs.118893 
423431 AA326062 

30 451264 AI768235 

452110 T47667 Hs.28005 
439963 AW247529 Hs.6793 
453941 U39817 Hs.36820 
406664 L34041 Hs,9739 

3 5 phosphate dehydrogenase 1 (so 
453487 R31770 Hs.23540 
420911 U77413 Hs.100293 
443171 BE281128 Hs.9030 
452256 AK000933 Hs.28661 

40 432201 AI538613 Hs,29a241 
419150 T29618 
444443 Ai149286 
426283 NIVL003937HS.169139 
436291 BE568452 Hs.5101 

45 450223 AA418204 Hs.241493 
• 424269 AW137691 Hs.199754 
448105 AW591433 Hs^8241 
4S2560 BE077084 Hs^6432 



TM,ABC_membrane,ABC_tran,RibosonBLS4eCbpyp300-interacting transactivator. wit 
SS,TM,transportj3rotSWlB,RhoGAP.DAG_PE-g!ycerol-3-phosphate dehydrogenase 1 (so 



Hs.22968 

Hs.172636 

Hs.280380 



Hs.3229 



Hs.106604 
Hs.332649 



Hs.89640 
Hs.55099 



SS,rrm,PABP,pkinase.1 4-3-3,rrm 
SS.TM,mito_canr,Lysyl_Qxidase 
SS.IPPT. 

ATP-synLab.SS,7tin.1 >VTP-synLab 

SS.p450.SS 
SS 

pkinase. 

lipoxygenase.PLAT, 

SS.sushi,trypsin,vwa.nrm,fibrinogen_C,fn 

SS.Peptldase38.P,Peptidase_S8,P 

SS,Pribosyitran. 

SS,adh_shortTGF-beta.TGFbj)ropeptide 

SS.pkinase.ig, 

SS.SS.IipoxygenasePLAT 

SS,SS,Peptkiase_M1,EGF.ig jectin_c^sht 

pkinase. 

SS.TM.7tmJ 



SS.Peptidase.M1. 
TM,integ/in_B,Rictn_BJecb*n.mn 
deathZU5.pkinaseActivin_recp, 
SSJM,7tm_3^NF.receptor,sushi 
SS 

SS,PCl,RasGEF.hormone_rec,2f-C4, 
petDxidase.LRRCT, 
SS,p450,p450 
SS.Trehalase 

SS.TM,Activin_recp,pkifiase 
TM,p450,Ets 

SS,DEAD,HRDC,helicase_C, 
SS.TM,transporLprot.SWIB,RhoGAP,DAG^PE 
3.1 

TM,7tmJ, 



poly(A)-binding protein, cytoplasmic ^A 
(ysyi oxidase-like 2 

tRNA isopentenylpyrophosphate transferas 
ATPase, H transporting, lysosomal (vacuo 
gb:Human mariner! transposase gene, comp 
cytochrome P450, subfamily IIB (pheriobar 
ESTs, Weakly similar to S65824 reverse t 
CDC2-related protein kinase 7 
arachidonate 15-)ipoxygenase. second typ 
6-factor, properdin 
ESTs 

sialyltransferase 

retinol dehydrogenase 5 (1 1-cis and 9-d 
Homo sapiens ctone IMAGE:451939, mRNA se 
ESTs 

aminopep&dase 

Homo sapiens ctone 24628 mRNA sequence 
Homo sapiens PR02751 mRNA. complete cds 
cysteine dioxygenase. type I 
Target 

Homo sapiens mRNA fuU length insert cDN 
ESTs 

olfactory receptor, family 2. subfamily 
carbohydrate (keratan sulfate Ga[-6) sul 
proteasome (prosome, macropain) 26S subu 
Melanoma associated gene 
gb:EST29171 Cerebellum 11 Homo sapiens c 
gb:wg82g08.x1 Soares_NSF_F8_9W_OT_PA_P_S 
Homo sapiens cDNA FUl 1309 fis, done PL 
platelet-activating fiactor acetylhydrola 
Bloom syndrome 



SS.TM,7tm.1,rrm.SS 
TM,GDl.7tmJ. 

SS.TM.tfypsin,SS,TM,trefoil.trypsin,tref 
TM,pkinase.fn3, 
SS 

SS,abhydroIase, 
SSiproLisomerase. 
SS.TM.7tm_2,GPS 
SS,TM,trefoil,trypsin,trefoil 
SiS,rnn.zf-RanBP,pkinase,C2,pklnase.C.0AGESTs 



ESTs 

0-linked N-acetylglucosamine (GlcNAc) tr 
TONDU 

Homo sapiens cDNA FLJ 10071 fis. clone HE 
Transmembrane protease, serine 3 
TEK tyrosine kinase, endothelial (venous 
rabS GTPase activating protein (GAP and 
kynureninase (L-kynurenine hydrolase) 
protein legulator of cytokinesis 1 
natuial klller-tunror recognition sequenc 
ESTs 

Transmembrane protease, serine 3 



3.4 

3.4 

3.4 

3.4 

3.4 

3.4 

3.4 

3.4- 

3.4 

3.4 

3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

3.3 

3^ 

3.3 

3.3 . 
22 
3.3 
3.3 

3;3 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.1 

glycerol-3- 

3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3,1 
3.1 
3.1 
3.1 
3.1 
3.0 
3.0 
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TABLE 21 A 

Table 21 A shows the accession numbers for those pkeys lacking unigenelD's for Table 21. 
For each probeset, we have Usted the gene cluster number from which the oUgonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" colunin. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 



420854 197072J AW296927 AI684514 A1263168 AA281079- 

20 . 423431 228162^1 AA326062 AA32575B AW962182 

423945 233566_1 AA410943 AW948953 AA334202 AA332882 

451264 863988_1 A176 8235 R3 1400 H29082 H231 07 

455325 1 279475_1 AW89571 9 N31451 N41451 

456207 165078 -1 AA193450 
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TABLE 21B 

Table 21B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 21. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



10 



15 



20 



Pkey: Unique number oorrespond ing to an Eos pnobeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbanic Identifier (Gl) numbers. *Dunbam 1. et a)." refers to the publication 

entitled The DNA sequence of human chromosome 22 * Dunham I. et al.. Nature (1999) 402:469-495. 
strand: Indicates DMA strand from which exons were predicted. 

NLposlUon: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


Ntj3osition 


401045 


8117619 


Plus 


90044-90184.91111-91345 


402230 


9966312 


Minus 


29782-29932 


402408 


9796239 


Minus 


110326-110491 


402578 


9884928 


Plus 


66350-66496 


403593 


6862650 


Minus 


62554-62712,69449-69602 


403943 


7711864 


Plus 


100742-100904.101322-101503 


404091 


7684554 


Minus 


82121-83229 



BNSDOCfO£ <WOJ .Q205Sa77A2^li.> 
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TABLE 22: 739 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT BREAST 

Table 22 shows 739 genes up-regulated in breast cancer compared to normal adult breast. 
These were selected as for Table 19, except that the ratio was greater than or equal to 3.0, the 
denominator was the 85^** percentile value for 12 non-malignant breast specimens, and the 
96^'' percentile value amongst the 73 breast cancers was greater than or equal 100 units. 



Pkey. Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number. Genbank acce^ion number 

UnigenetD: Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of S0<» percentile tumorto 85^ percentile nomial breast tissue 



Pkey ExAccn UnigenelD UnigeneTHIe 



R1 



400292 

424735 

400297 

431448 

451110 

431211 

418203 

407980 

414646 

446921 

409041 

412140 

407824 

453160 

407137 

425692 

438533 

428227 

444342 

422505 

430515 

417308 

452744 

412446 

415539 

435496 

438209 

400205 

430965 

415263 

451952 

449722 

406685 

406690 

429925 

416498 

432378 

441377 

456207 

422805 

407811 

407178 



AA250737 Hs.72472 
U31875 Hs.272499 



A1127076 Hs.334473 

AL137517 Hs.334473 

AI955040 Hs.265398 

M86849 Hs.323733 

X54942 Hs.83758 
AA046309 

AA353776 Hs.901 

AB012113 Hs.16530 

AB033025 Hs.50081 

AA219691 Hs.73625 

AA147884 Hs.9812 

AI263307 Hs.239884 
T97307 

D90041 Hs.1 55956 

AI440266 Hs.170673 

AA321649 Hs.2248 
NM_014398Hs.10887 

AL120862 Hs.124165 

AA746503 Hs^83313 

H60720 Hs.81892 



BMP-RIB 51.5 

short-chain alcohol dehydrogenase family 38.3 

hypothetical protein DKFZp56401278 29.9 

hypothetical protein DKF2p56401278 26.9 

ESTs. WeaWy slmMr to transformation-r 25.8 

gap Junction protein, beta 2, 26kD (conn 23,2 

CDC28 protein kinase 2 22.6 
gb:zf12f01.s1 Scares fetaLheart_NbHH19W 19.8 



AI267652 
AI768015 
A1733881 



Hs.30504 
Hs.92127 
Hs.72472 



AW840171 Hs.265398 
AL1 20559 Hs.6111 
NA 

AA489732 Hs.154918 
AA948033 Hs.130853 
AL120173 Hs.301663 
BE280074 Hs,23960 
M18728 

M29540 Hs.220529 
NM_000786Hs.226213 
U33632 Hs.79351 
AI493046 Hs.146133 
BE218239 Hs.202656 
AA193450 

AA436989 Hs.121017 
AW190902 Hs.40098 
AA195651 Hs.104106 



CD48 antigen (B-cell membrane protein) 18.9 

small inducible cytokine subfamily A (Cy 1 8.0 

KIAA1 199 protein 17.6 

RAB6 interacting, kinesin-like (rabkines 17.6 

Homo sapiens cDNA FLJ1 4388 fis. done HE 17.1 

H2B histone family, member L 17.0 

gb:ye53h05.s1 Soares fetal liver spleen 16.1 

N^oelyttransferase 1 (aiylamine N-acety 16.1 

ESTs. Weakly similar to T24832 hypotheti 16.0 

small inducible cytokine subfamily B (CX 1 5.5 

similar to tysosonn&'associated membrane 15.1 

programmed cell death 9 (PDCD9) 14.9 

ESTs 14.7 

KIAA0101 gene product 14.4 
Homo sapiens mRNA; cDNA DKFZp434E082 (fr14.4 

ESTs 14.2 

BMP-RIB 14.1 

ESTs, Weakly similar to transfomiation-r 13.8 

aryl-hydrocaibon receptor nuclear transi 1 3.8 
NM_006265*:Homo sapiens RA021 (S. pombe)13.5 

ESTs 13.4 

ESTs 13.3 

ESTs 13.2 

cydin B1 ' . . 13.2 

gbiHuman nonspecific crossreacling anlig 1 3.0 

cardnoembryonic antigen-related cell ad 1 2.8 

cytochrome P450. 51 (lanosterol 14-aIpha 1 2.8 

potassium channel, subfamily K. member 1 12.7 

ESTs 12.5 

ESTs 125 
gb2r40e07.r1 Soares_NhHMPu_S1 Homosapi12.4 

H2A histone faml^. member A 12.2 

cysteine knotsuperfamily 1, BMP antagon 12.2 

ESTs 12^ 
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420931 
421727 
434408 
446591 
431385 
443348 
416602 
433365 
437866 
412472 
416030 
439979 
420757 
411598 
423600 
430770 
421037 
452461 
409269 
417791 
447268 
424001 
447342 
424905 
453619 
442942 
434377 
427217 
445730 
432887 
452243 
424590 
432169 
438950 
418836 
430291 
444665 
407377 
445413 
443462 
442145 
435570 
439820 
428966 
449448 
433929 
432731 
411815 
415385 
422026 
432596 
439451 
423945 
442432 
446715 
408771 
437021 
428479 
428839 
402408 
418601 
426327 
419519 
440821 
446142 
418198 
447178 



AF044197 Hs.100431 small Inducible cytokine B subfamUy (Cy 12.1 

Y13153 Hs.107318 kynurenlne3-fnonooxygenase(kynurenine3 12.0 

AI031771 Hs,132586 ESTs 12,0 

H44185 Hs.15456 PD2 domain containing 1 11.9 

BE178536 Hs.11090 membrane-spanning 4-domains. subfamily A 118 

calmodurin 2 (phosphorylase kinase, deit 1 1 7 

nel (chickenHike 2 11.7 

ESTs 11.6 

metailothtonein 1 E (fundionaQ 1 1 .5 

ESTs 11.4 

ESTs 11.3 

hypoflielical protein FU10430 11.3 

androgen receptor (dlhydrotesiosteTone r 11 .3 

H3 histone family, member A 1 1 .2 

AI633559 Hs.310359 ESTs 11.2 

AA765694 Hs.123296 ESTs 11.0 

AI684808 Hs.1 97653 programmed cell death 9 (PDCD9) 10.9 

N78223 Hs.108106 transcription factor 10,7 

AA576953 Hs.22972 hypothetical protein FU 13352 10.6 

ESTs 10.6 

hypothetical protein FU2241 8 1 0.4 

patemally expressed 10 10.4 

Homo sapiens, Similar to RIKEN cDNA 2010 10.4 

MIMA (never In mitosis gene a)-re!ated k 10.1 
Homo sapiens, done MGC:9084. mRNA. complO.1 

ESTs 10,1 

Homo sapiens cDNAFUl 1382 fis, done HE 10.1 

ESTs 10.1 

ESTs 10.0 

ESTs 10.0 

programmed cell death 9 9.9 

hypothetical protein FU20086 9.9 

phosphoribosyt pyrophosphate synthetase 9.9 

EST 9.9 

ESTs 9.8 

AV660345 Hs.238126 CGI^9 protein 9.8 

BE613126 Hs.47783 B aggressive lymphoma gene 9.7 
C16391 gb:C16391 Clontech human aorta polyA mRN 9.7 

AA151342 Hs.12677 CGI-147 protein 9.7 

Aiq64690 Hs.171176 ESTs 9.7 

AI022650 Hs.8117 ert)b2Hnteracting protein ERBIN 9.7 

AF212222 Hs.177812 uncharacterized bone marrow protein BM04 9.7 

AL360204 Hs.283853 Homo sapiens mRNA full length insert cDN 9.6 

AF059214 Hs.194687 cholesterol 25-hydroxy!ase 9.6 

D60730 Hs.57471 ESTs 9.6 

AI375499 Hs.27379 ESTs • 9.5 

R31178 Hs.2a7820 fibronectin 1 9.3 

AA1 56679 Hs.1 25790 leudne-ridi repeat-containing 2 9.3 

R17798 Hs.7535 COBW-Uke protein . 9.3 

U80736 Hs.110826 trinudeotide repeat containing 9 9.2 

AJ224741 Hs.278461 matriGn3 9.2 

AF086270 Hs.278554 heterodiromatin-like protein 1 9.2 

AA410943 gb2t32h03,n Soares ovary tumor NbHOT H 9.1 

BE093589 Hs.38178 hypothetical protein FU23468 9.1 
AI337735 Hs.173919 ESTs. Moderately simUar to ZN91_HUMAN Z 9.0 

potassium vottege^ated channel, delayed 9.0 

ESTs 9.0 

cell division cyde 2. G 1 to S and G2 to 8.9 

Homo sapiens cDNAFU14814fis, done NT 8.9 

NiVL030920*:Homo sapiens hypothetical pro 8.9 

calmegin 8.8 
Homo sapiens done TCCCTA00151 mRNA sequ8.8 

ESTs 8.8 

ESTs 8.8 

ESTs 8.8 

KiAA1708 protein 8.7 

ESTs 8.7 



AW873596 Hs.182278 
NM^00615gHs.79389 
AF026944 Hs.293797 
AA1 56781 Hs.74170 
AW975398 Hs.293836 
H15261 Hs.21948 
AW600291 Hs.6823 
X78592 Hs.99915 
BE336654 Hs,70937 



AW965339 Hs.111471 
AI370413 Hs.36563 
W67883 Hs.137476 
AI199268 Hs.19322 
NM_002497Hs.153704 
H87648 Hs.33922 
AW167087 Hs.131562 
AW137148 Hs.306593 
AA399272 Hs.144341 
AI624342 Hs,170042 
A1926047 Hs.162859 
AL355715 Hs.28555 
AW966399 Hs.46821 
Y00971 Hs.2910 
H23789 Hs.144530 
AI655499 Hs.161712 



AW732573 Hs.47584 
AI076089 Hs.292239 



Hs. 184572 
Hs,82302 

Hs.86368 
Hs.44898 
Hs.176376 
AW296Q24 Hs.150434 
AI754693 Hs.145968 
A1745649 Hs.26549 
AW594641 Hs.192417 



Y00272 

A1767756 

NA 

AA279490 

W03242 

AI198719 
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427585 D31152 Hs. 179729 collagen, type X. alpha 1 (Schmid metaph 8.6 
415857 AA866115 Hs.127797 Homo sapiens cDNAFUl 1381 fs, done HE 8.6 



435061 A1651474 Hs.163944 ESTs 8.6 

431374 BE258532 Hs.251871 CTP synthase '8.4 

5 417866 AW067903 Hs.82772 collagen, type XI. alpha 1 8.4 

437211 AA382207 Hs.5509 ecolfopic viral integration site 2B 8.3 
437751 AA767373 Hs.35669 ESTs. Moderately similar to ALU1_HUMAN A 8.3 

423887 AL080207 Hs.134585 DKFZP434G232 protein 8.2 

440941 BE268362 Hs.7535 COBW-Iike protein .8.2 

10 429859 NM_007050Hs,225952 protein tyrosine phosphatase, receptor t 8.2 

410193 AJ132592 Hs.59757 zinc finger protein 281 8.2 

431725 X65724 Hs.2839 Norrie disease (pseudoglioma) 8.1 

446258 AI283476 Hs.263478 ESTs 8.1 

416747 AW876523 Hs.15929 hypothetical protein FU12910 8.1 . 

15 434424 AI611202 Hs.325335 Homo sapiens cDNA: FU23523 fe. ctone L 8.1 

421650 AA781795 Hs.122587 ESTs 8.0 

429534 AW976987 Hs.163327 ESTs. Weakly similar to 21 09260ABceD 8.0 

457465 AW301344 85.122908 DNA replication fector 8.0 

427961 AW293165 Hs.143134 ESTs 8.0 

20 436481 AA379597 Hs.5199 HSPC150 protein similar to ubiquilin-con 8.0 

418216 AA662240 Hs.283099 API 5q 14 protein 8.0 
418250 U29926 Hs.83918 adenosine monophosphate deaminase (isofo 7.9 

400285 NA Eos Control 7.9 

401464 AF039241 Hs.9028 histone deacetylase 5 7.9 

25 407242 gb:Human nonspecific crossreacting antig 7.8 

422232 D43945 Hs.113274 transcription factor EC 7.8 

454024 AA993527 Hs,293907 hypothetical protein FU23403 7.8 

444542 AI161293 Hs.280380 aminopept'dase 7.8 

436396 AI683487 Hs.152213 wingless-type MMTV integration site fami 7.7 

30 437204 AL110216 Hs.12285 ESTs, Weakiy simiiar to 155214 salivaiy 7.6 

408805 H69912 Hs.48269 vaccinia related kinase 1 7.6 

437207 T27503 Hs.15929 hypothetical protein FU 12910 7.6 

442818 AK001741 Hs.8739 hypothetical protein FLJ1 0879 7.6 

426283 NM_003937Hs.169139 kynureninase (L-kynurenine hydrolase) 7,5 

* 35 424687 J05070 Hs.151738 matrix metalloproteinase 9 (gelalinase B 7.5 

446315 NM^016293Hs.14770 bridging integrator 2 7.5 

433426 H69125 Hs.133525 ESTs 7.5 

406639 1^97711 gb:Human T-cell receptor (V beta 18.1, J 7.5 

420077 AW512260 Hs.87767 ESTs 7.4 

40 457332 AA961694 Hs.105187 kinesin protein 9 gene 7.4 

422938 NM.001809Hs.1594 centromere protein A (17kD) 7.4 
447555 A1391662 Hs.160963 Homo sapiens, ctone M6C:1231 8. mRNA.com7.4 
444618 AV653785 Hs.173334 ELL-RELATED RNA POLYMERASE 11, ELOr4GATI07.3 

410361 BE391804 Hs.62661 guanylate binding protein 1, interferon- 7.3 
45 400268 NA NM„003292:Homo sapiens translocated prom 7.3 

439509 AF086332 Hs.58314 ESTs 7,3 

407771 AL138272 Hs.62713 ESTs 7.3 

407202 N58172 Hs.109370 ESTs 7.3 

433096 AU076803 Hs.282975 carboxylesterase 2 (Intesfine, liver) 7.2 

50 422094 AF129535 Hs.272027 F-box only protein 5 * 7.1 

430832 AI073913 Hs.100686 ESTs. Weakly similar to JE0350 Anterior 7.1 
430287 AW182459 Hs.125759 ESTs, Weakly similar to LEU5„HUMAN LEUKE7.0 

423739 AA398155 Hs.97600 ESTs 7.0 ' 
448212 AI475858 gb:tc87d07.x1 NCLCGAP_CLL1 Horro sapiens7.0 
55 407277 AW170035 Hs.326736 Homo sapiens breast cancer antigen NY-BR 7.0 
454440 BE062906 Hs.28338 KIAA1 546 protein 7,0 
444783 AK001468 Hs.62180 anillin (Drosophila Soaps homotog). act 7.0 
421373 AA808229 Hs.167771 ESTs 6.9 
431960 AW241821 Hs.301927 c6.1A 6.9 
60 424704 AI263293 Hs.152096 cytochrome P450. subfamily lU (arachido 6.8 
449517 AW500106 Hs.23643 serine/threonine protein kinase MASK 6.8 
439840 AW449211 Hs.1 05445 GDNF family receptor alpha 1 6.8 
414080 AA135257 Hs.47783 B aggressive lymphoma gene 6.8 
441243 AI767056 Hs.193002 ESTs 6.7 
65 408380 AF123050 Hs.44532 diubtquitin 6.7 
422956 BE545072 Hs.122579 hypotheUcal protein FU 10461 6.7 
446651 AA393907 Hs.97179 ESTs 6.7 



323 



icnnr»r> <.wrt 



wo 02/059377 PCT/US02/02242 



419839 U24577 Hs.93304 ■ phospholipase A2, group VII (platelet-ac 6.7 

437740 M810265 Hs.l22915 ESTs 6.7 

421582 A(910275 Hs.1406 trefoil factor 1 (pS2) 6.7 

427356 AW023482 Hs,97849 ESTs 6.6 

5 429597 NM.003816Hs.2442 a disintegrin and mefalioproteinase doma 6.6 

422634 NM.016010HS.1 18821 CGW2 protein 6.6 

421072 AI215069 Hs.89113 ESTs 6.5 

427718 AI798680 Hs.25933 ESTs 6.5 

411000 N40449 Hs.201619 ESTs. Weakly similar to S38383 SEB4B pro 6.5 

10 449343 AI151418 Hs.272458 protein phosphatase 3 (formerly 2B)/cat 6.4 

409757 NMJ01898HS.123114 cystatinSN 6.4 

447164 AF026941 Hs.17518 Homo sapiens cig5 mRNA. partial sequence 6.4 

458938 X52509 Hs.161640 tyrosine aminotransferase 6.4 

418848 A!820961 Hs.193465 ESTs 6.4 

15 424902 NM_003866Hs.1 53687 inositol polyphosphate-4-phosphatase, ty 6.4 

452838 U65011 Hs.30743 preferentially expressed antigen in mela 6.4 

439452 AA918317 Hs.57987 8-ce«CLL/lymphoma 118 (zinc finger pro 6.4 

407266 AJ235664 gbiHomo sapiens mRNA for immunoglobulin 6.3 

411078 AI222020 Hs.182364 CocoaCrisp 6.3 

20 433001 AF217513 Hs.279905 clone HQ0310 PRO0310p1 6.3 

434340 AI193043 Hs.128685 ESTs. Weakly similar to T17226 hypotheti .6.2 

429503 AA394183 Hs.26873 ESTs 6.2 

402578 C1001134:g{|2117372|pir||l65981 fettyac 6.2 

409646 AW161391 Hs.709 deoxycytidine kinase 6.1 

25 430447 W17064 H5.332848 SWI/SNF related, matrix assodated. acti 6.1 

432415 T16971 Hs.289014 ESTs. Weakly simflar to A43932 mucin 2 p 6.1 

443709 AI082692 Hs.134662 ESTs 6.1 

420929 AI694143 Hs.296251 programmed cell death 4 6.1 

428368 BE440042 Hs.d3326 matrix metalloproteinase 3 (stromelysin 6.1 

30 428248 AI126772 Hs.40479 ESTs 6.0 

420344 BE463721 Hs.97101 putative G protein-coupled receptor 6.0 

453392 U23752 Hs.32964 SRY (sex detennining region Y)-box 1 1 . 6.0 

425397 J04088 Hs.156346 topoisomerase (DNA) II alpha (170kD) 6.0 

418007 M13509 . Hs,83169 matrix metalloproteinase 1 (MMPI; Inters 6.0 

3 5 428585 AB007863 Hs.185140 K1AA0403 protein 6.0 

437608 AA761605 Hs.292308 ESTs. Weakly similar to ALU1_HUMAN ALU S6.0 

427408 AA583206 Hs.2156 RAR-related orphan receptor A 6.0 

406687 M31126 Hs,272620 matrix metalloproteinase 1 1 (MMP1 1; slro 6,0 

418092 R45154 Hs.106604 ESTs 6.0 

40 447051 AW139130 Hs.160951 ESTs. Weakly simflar to Coni [H.sapiens] 6.0 

441233 AA972965 Hs.135568 ESTs 6.0 

432239 X81334 Hs.2936 matrix metalloproteinase 13 (coilagenase 6.0 

435106 AA100847 Hs.193380 ESTs. Highly similar to AF174600 1 F-box 5.9 

435525 AI831297 Hs.123310 ESTs 5.9 

45 458809 AW97251 2 Hs.20985 sin3-associated polypeptide. 30kD 5.9 

410785 AW803341 gb:IL2-UM0079-090300-<}50.D03 UM0079 Homo5.9 

422576 BE548555 Hs.118554 CGl-83 protein 5.9 

451398 AI793124 H$.144479 ESTs 5.9 

441881 AWg68904 Hs.179566 hypothetical protein FU22624 5.8 

50 412022 AI005043 Hs.24143 Wiskott-Akirich syndrome protein Interac 5.8 

416636 N32536 Hs.42645 solute carrier family 16 (monocarboxy lie 5.8 

447350 AI375572 Hs.172634 ESTs 5.8 

434094 AA305599 Hs.238205 hypothetical protein PRQ2013 5.8 

409151 AA306105 Hs.50785 SEC22. vesk:l8 trafTicking protein (S. c 5.8 

55 448807 A1571940 Hs.7549 ESTs 5.8 

452281 T93500 Hs.28792 Homo sapiens cDNAFUl 1041 fis. done PL 5.8 

421281 AI299139 Hs,17517 ESTs 5.8 

430361 AI033965 Hs.239926 sterol-C4-methyl oxidase-like 5.8 

400289 X07820 Hs.2258 matrix metaltoproteinase 10 (MMPIOistr 5.7 

60 440527 AV657117 Hs.184164 ESTs, Moderately sinular to S65657 alpha 5.7 

434674 AA831879 Hs.136985 ESTs 5.7 

426320 W47595 Hs.169300 transfomning growth factor, beta 2 5.7 

452401 NM_007115Hs.29352 tumorneoosis factor, alpha-induced pro 5.7 

448663 BE614599 Hs.106823 hypothetk^al protein M6C14797 5.7 

65 438199 AW016531 H$.122147 ESTs 5.7 

446203 Z47553 Hs.14286 flavin containing monooxygenase 5 5.7 

428336 AA503115 Hs.163752 microseminoprotein. beta- 5.6 
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430379 

422835 

444758 

443425 

400301 

447078 

432015 

438691 

439809 

415786 

456373 

401645 

437967 

445885 

439138 

440270 

437536 

438167 

452741 

426214 

413554 

422867 

434263 

446382 

422406 

438321 

418310 

419625 

450701 

445900 

449051 

418758 

431070 

417079 

421928 

428804 

427427 

403485 

422168 

421937 

426752 

453310 

423198 

412281 

447513 

453931 

404347 

431808 

429113 

436291 

450603 

434725 

435981 

407376 

431689 

405348 

436196 

437065 

410196 

429412 

446619 

403329 

442875 

442441 

430375 

424128 

408873 



AF134149 Hs.240395 
BE218705 Hs.121378 
AL044878 Hs.11899 
AF098158 Hs.9329 
X03635 Hs.1657 
AW885727 Hs.301570 
AL157504 Hs.159115 
AA906288 Hs.212184 
R41396 Hs.101774 
AW419196 Hs.257924 
BE247706 Hs.89751 
NA 

BE277414 Hs.5947 
AI734009 Hs.127699 
AI742605 Hs.193696 
NM_015986Hs7120 
X91221 Hs.144465 
R28363 Hs.24286 
BE392914 Hs.30503 
H59846 Hs.1 28355 
AA319146 Hs.75426 



L32137 
N34895 



Hs.1584 
Hs.44648 



AW205168 Hs.150823 
AF025441 Hs.1 16206 
AA576635 Hs.6153 
AA814100 Hs.86693 
U91616 Hs.91640 
H39960 Hs.288467 
AF070526 Hs.13429 
AW961400 Hs.333526 
AW959311 Hs.172012 
AW40d164 Hs.249184 
U65590 Hs.81134 
AF013758 Hs,109643 
AKD00713 Hs.193736 
AF077345 Hs.177936 



AA586894 

AI878857 

X69490 

X70697 

M81933 

A)810054 

AW955776 

AL121278 

M30703 
028235 



Hs.1 12408 

Hs.109706 

Hs.172004 

Hs.553 

Hs.1634 

Hs.14119 

Hs.313500 

Hs.25144 

Hs.270833 
Hs.195384 



BE568452 Hs,5101 
R43646 Hs.12422 
AK000796 Hs,4104 
H74319 Hs.188620 
AA993138 Hs.142287 
AA305688 Hs.267695 
NA 

AK001084 Hs.333498 
AL036450 Hs.103238 
AI936442 Hs.59838 
NM_006235Hs.2407 
AU076643 Hs.313 
NA 

BE623003 Hs.23625 
At820662 Hs.129598 
AW371048 Hs.93758 
AW966163 

AL046017 Hs.182278 



potassium channel, subfamily K, member 6 5.6 - 
metallothionein-like 5, testis-specific 5.6 
3-hydroxy-3-methyfgJutary!-Coenzyme A re 5.6 
chromosome 20 open reading frame 1 5.6 
estrogen receptor 1 5.6 
ESTs 5.6 
Homo sapiens mRNA; cDNA OKFZp586O0724 (S.5 
ESTs 5.5 
hypothetical protein FU2304S 5.5 
hypothetical protein FLJ13782 5.5 
membrane-spanning 4-domain5, subfamily A 5.5 
C16001440*:gi|12330704|gblAAG52890.11AF35.5 . 
mel transfonning oncogene (derived from 5.5 
KIAA1603 protein 5.4 
ESTs 5.4 
cytokine receptor-like molecule 9 5.4 
ESTs 5.4 
ESTs 5.4 
Homo sapiens cDNA FLJ1 1344 fis. clone PL 5.4 
ESTs. Moderately similar to ALU7_HUMAN A 5.4 
secrelogranin 11 (chromogranin C] 5.4 
cartilage oltgomeric matrix protein (COM 5.4 
ESTs 5.4 
ESTs 5.4 
Opa-interacting protein 5 5.3 
CGl-48 protein 5.3 
ESTs 5.3 
rvjr. ; . ' factor of kappa light poiypeptid 5.3 
r - ■apienscDNAFLJ12280fiS,cloneMA 5.3 
apiens clone 24787 mRNA sequence 5.2 
r r: receptor tyrosine kinase (c-erl>-b2. 5.2 
hypoiheticai protein OKFZp434 J037 5.2 
transcription factor 1 9 (SCI ) 5.2 
intedeukin 1 receptor antagonist 5.2 
polyadenylate binding protein-inteFactin 5.2 
hypothetical protein FU20706 5.2 
ESTs 5.2 
C3001B13*:gi|12737279lref|XP^012163.1l k 5.2 
S100 caicium-binding. protein A7 (psorias 6. 1 
hematological and neurological expressed 5. 1 
titin 5.1 
solute carrier family 6 (neurotransnutte 5.1 
ceil division cyde 25A 5. 1 

ESTs 5.1 
ESTs. Moderatelysimilarto ALU7^HUMAN A 5.1 
ESTs 5.1 
Target Exon 5.1 
amphiregulin (schwannoma-derived growth 5.1 
prostagtandln-endoperoxtde synthase 2 (p 5.1 
protein regulator of cytokinesis 1 5.1 
ESTs 5.1 
hypothetical protein 5.0 
ESTs 5.0 
ESTs. Weakly similar to ALUF^HUMAN !!!! 5.0 
UDP-Gal:b8taGlcNAc beta 1.3-ga!actosyltr 5.0 
C7001664:gi|12698061|dbjlBAB21849.1| (AB 5.0 
Homo sapiens cDNAFU10222fis, done HE 5.0 
ESTs 5!0 
hypothetical protein F1J10808 5.0 
POU domain^ class 2, assodating factor 5.0 
secreted phosphoprotein 1 (osteopontin. 4.9 
Target Exon 4.9 
Homo sapiens done TCCCTA00142 mRNA sequ4.9 
ESTs 4.9 
H4 histone family, member H 4.9 
gb:EST378236 MAGE resequences, MAGI Homo45 
calmodulin 2 (phosphorylase kinase, delt 4.9 
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407910 AA650274 Hs.41296 fibronectin leucine rich transmembrane p 4.9 

432606 NM_002104Hs.3066 granzyme K (serine protease, granzyme 3; 4.9 

453204 R10799 Hs.191990 ESTs 4.8 

452020 AA722012 Hs.255757 ESTs. Weakly similar to AT2A HUMAN POTEN4.8 

5 449048 Z45051 Hs.22920 similar to S68401 (cattie) glucose indue 4.8 

408369 R38438 Hs.182575 solute carrier family 15 (H??? transport 4.8 

431645 AF078849 Hs.266483 dynein liglit chain-A 4.8 

423575 C18863 Hs.163443 Homo sapiens cDNAFUl 1576 fis. done HE 4.8 

444246 H93281 Hs.10710 hypothetical protein FU204 17 4.8 

10 421524 AA312082 Hs.105445 GDNF famUy receptor alpha 1 4.8 

452827 AI571835 Hs.55468 ESTs 4.8 . 

414222 AL135173 Hs.878 sorbitol dehydrogenase 4.8 

456086 AL1 61999 Hs77324 eukaryotic translation temnination facto 4.8 

419078 M93119 Hs.89584 insulinonra-assodated 1 4.8 

15 418973 AA233056 Hs.191518 ESTs 4.8 

447033 AI357412 Hs.157601 ESTs 4.8 

451621 AI879148 Hs.26770 fatty add binding protein 7. brain 4,7 

419968 X04430 Hs.93913 interieukin 6 (interferon, bete 2) 4.7 

424325 NM_014479Hs.145296 disintegrin protease 4.7 

20 431585 BE242803 Hs.262823 hypothetical protein FU 10326 4.7 

429294 AA095971 Hs.19d793 Homo sapiens cDNA: FLJ22463 Hs. clone H 4.7 

416814 AW192307 Hs.80042 dolichyl4'-Glc:Man9GlcNAc2-PP-dofichylgl 4.7 

439897 NM_015310Hs.6783 KtAA0942 protein 4.7 

429687 A1675749 Hs.211608 nucteoporin153kD 4.7 

2S 422880 AF226704 Hs.121524 glutafhkine reductase 4.7 
405801 NM.000390:Homo sapiens choroideremia(Ra4.6 

432435 BE218686 Hs.282070 ESTs 4.6 

439544 W26354 Hs.28891 hypothetical protein FU 11 360; artemisp 4.6 

425354 U62027 Hs.155935 complement component 3a receptor 1 4.6 

30 436027 A1864053 Hs.39g72 ESTs. Weakly similar to 138588 reverse t 4.6 

424623 AW963062 Hs.337404 ESTs 4.6 

403366 NA Target Exon 4.6 

402542 Target Exon 4.6 

450193 A1916071 Hs.15607 Homo sapiens Fanconi anemia complementat 4.6 

35 411678 AI907114 Hs.71465 squalene epoxidase 4.6 

456844 Ai264155 Hs.1 52981 CDP-diacylglycerol synthase (phosphatida 4.6 

448072 AI459306 Hs.24908 ESTs 4.5 

408045 AW138959 Hs.245123 ESTs 4.5 

423782 AI472209 Hs.323117 ESTs 4.5 

40 447388 AW630534 Hs.76277 Homo sapiens, clone MGC:9381, mRNA, comp4.5 

448140 AF146761 Hs.20450 BCM-like membrane protein precursor 4.5 

452561 AI692181 Hs.49169 KIAA1 634 protein 4.5 

425331 AW962128 gb:EST374201 MAGE resequences, MAGG Homo4.5 

428801 AW277121 Hs.254881 ESTs AJ5 

45 428500 AI815395 Hs.184641 fatty add desatufase 2 4.5 

426075 AW513691 Hs.270149 ESTs, Weakly simitar to 2109260AB cell 4.4 

437259 AI377755 Hs.120695 ESTs 4.4 

400409 AF153341 Hs.283954 Homo sapiens winged helix/foridiead trans 4.4 

412863 AA121673 Hs.59757 zinc finger protein 281 4.4 

50 426989 AI815206 Hs.993g5 ESTs 4.4 

401866 Target Exon 4.4 

418819 AA228776 Hs.191721 ESTs 4.4 

406348 Target Exon 4.4 

412138 AW895387 gb:QV4-NN0038-300300-157^10 NN0038 Homo4.4 

55 428550 AW297880 Hs.9866i ESTs 4.4 

411743 AW862214 gb:QV4-CT0361-301299-074-b05CT0361 Homo4.4 

429966 BE081342 Hs.283037 HSPC039 protein 4.4 

423291 NM_004129Hs.126590 guanylate cyclase 1, soluble, beta 2 4.4 

423456 AL1 10151 Hs.128797 DKF2P586D0824 protein 4.4 

60 452190 H26735 Hs.91668 Homo sapiens done PP1 498 unknown mRNA 4.4 

424871 NM_Q04525Hs.1 53595 low density tipoprotein-related protein 4.3 

429575 AA706003 Hs.99387 ESTs 4.3 

429922 Z97630 Hs.226117 HI histone family, member 0 4.3 

421379 Y15221 Hs.103982 smafl inducible cytokine subfamily B (Cy 4.3 

65 400300 X03363 HER2 receptor tyrosine kinase (<>ert}-b2. 4,3 

437258 AL041243 Hs.174104 ESTs 4.3 

446595 T57448 Hs. 15467 hypothetical protein FU20725 4.3 
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403011 NA ENSP00000215330*:PrDbableserine/threoni 4.3 

419055 A1365384 Hs.11571 Homo sapiens cDNAFU11570fis. done HE 4.3 

418661 Nf^00ig49Hs.1189 E2F transcription factor 3 4.3 

407785 AA687538 HS.3&972 tetraspan 1 4.3 

5 429183 AB014604 Hs.197955 KIAA0704 protein 4.3 

442914 AW1d8551 Hs.99519 hypothetical protein FU 14007 4.3 

441029 AI091795 Hs.179246 ESTs 4.3 

452194 AI694413 Hs.332649 olfactory receptor, family 2. subfanuly 4.3 

414821 M63835 Hs.77424 FcfragmentoftgG, high affinity la, re 4.2 

10 410102 AW246508 Hs.279727 Homo sapiens cONA FU14035 fis, done HE 4.2 

452110 T47667 Hs,28005 Homo sapiens cDNAFU11309fis. done PL 4.2 

442007 AA301116 Hs.142838 nucleolar phosphoprotetn Nop|)34 4.2 

417318 AW953937 Hs.12891 ESTs 4.2 

431818 AW510444 Hs.191705 ESTs. Weakly similar to T471 84 hypothefi 4.2 

15 443646 AI085198 Hs.164226 ESTs 4.2 

419169 AW851980 Hs,262346 ESTs, Weakly similarto S72482 hypotheti 4.2 

446839 BE091926 Hs.16244 mitotic spindle c-Diled-coil related prot 4.2 

423242 AL039402 Hs.125783 DEME-6 proteir. 4.2 

432116 AA902953 Hs.308538 ESTs 4.2 

20 409038 T97490 Hs.50002 small inducible cyxokine subfamily A (Cy 4.2 

445625 BE246743 Hs.288529 hypothetical protein FLJ22635 A2 

425139 AW630486 Hs.325820 protease, serine. 23 4.2 

447397 BE247676 Hs,18442 E-1 enzyme 4.2 

410166 AK001376 Hs.59346 iiypothetical protein FU10514 4.1 

25 437295 AW779318 Hs.88417 ESTs 4.1 

430486 BE062109 Hs.241551 chloride channel, calcium activated, fam 4.1 

441790 AW294909 Hs.132208 ESTs 4.1 

410129 BE244074 Hs.58831 regulator of Fas-induced apoptosis 4.1 

427521 AW973352 Hs.290585 ESTs 4.1 

30 425247 NIVL005940HS.155324 matrix metalkiproteinase 11 (MMP11:stro 4.1 

412886 AF041163 Hs.74647 Human T-cell receptor active alpha-chain 4.1 
441153 BE562826 gb:601336534F1 NIH_MGC^44 Homo sapiens c4.1 

444301 AK000136 Hs.10760 asporin (LRR class 1) 4.1 

426711 AA383471 Hs.180669 conserved gene amplified in osteosarcoma 4.1 

35 405850 NA Target Exon 4.1 

440283 AI732892 Hs.190489 ESTs 4.0 . 

432441 AW292425 Hs.163484 ESTs 4.0 

400284 NA estrogen receptor 1 4.0 

417341 N91453 Hs.102987 ESTs 4.0 

40 429732 U20158 Hs.2488 lymphocyte cytosolic protein 2 (SH2 doma 4.0 

• 411393 AW797437 Hs.69771 B-factor, properdin 4.0 

425704 U79293 Hs.159264 Human clone 23948 mRNA sequence 4.0 
419594 AA013051 Hs.91417 topoisomerase (DNA) II binding protein . 4.0 

419092 J05581 Hs.89603 mucin 1, transmembrane 4.0 

45 443147 A1034351 Hs.19030 ESTs 4.0 

408633 AW963372 Hs.46677 PRO2000 protein 4.0 

433404 T32982 Hs.102720 ESTs 4.0 

421506 BE302796 Hs,105097 thymidine kinase 1 . soluble 4.0 

417900 BE250127 Hs.82906 CDC20 (cell division cyde 20, S. cerevi 3.9 

50 414602 AW630088 Hs.76550 Homo sapiens mRNA; cDNA DKFZp564B1264 (f3. 9 

413762 AW411479 Hs.848 FK5a6-binding protein 4 (59kD) 3.9 
404580 NM_014112*:Homo sapiens trichorhinophala 3.9 

452046 AB018345 Hs.27657 K1AA0802 protein 3.9 
459587 AA031956 gb:2k15e04.s1 Scares j)regnanLutenis^NbH 3.9 

55 416658 U03272 Hs.79432 fibrillin 2 (congenital contractural ara 3.9 

426647 AA243464 Hs.294101 pre-B-ceH leukemia transcription factor 3.9 

429353 AL117408 HS.2GC102 ATP-binding cassette transporter MRP8 3.9 

419038 AW134924 HS.1S0325 ESTs 3.9 

418918 X07871 Hs.89476 CD2 antigen (p50). sheep red biood ceU 3.9' 

60 421977 W94197 Hs. 11 0165 n'bosomal protein L26 homolog 3.9 

442567 AI201183 Hs.130251 ESTs 3.9 

421168 AF182277 Hs.330780 cytochrome P450. subfamily IIB (phenobar 3.9 
431701 AW935490 Hs. 14658 Human chromosome 5q13.1 done 5G8 mRNA3.9 

418526 BE019020 Hs.85838 solute carrier family 16 (monocaiboxylic 3,9 

65 414998 NM_002543Hs.77729 oxidised low density lipoprotein (lectin 3.9 

422790 AA809875 Hs.25933 ESTs 3.9 

419741 NM-007019HS.93002 ubiquitin earner protein E2-<^ 3.9 
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430017 AA263172 Hs.35 prote^ tyrosine phosphatase, non-recept 3.9 

458814 AI498957 Hs.170861 ESTs. Weakly similar to Z195.HUMAN ZINC 3.8 

428514 AW236861 Hs.193139 ESTs 3.8 

434521 NM_Q02267Hs.3888 karyropherin alpha 3 (importin alpha 4) 3.8 

5 409425 U40462 Hs.54452 anc finger protein, subfamily 1 A. 1 (Ik 3.8 

439560 BE565647 Hs74899 hypothetical protein FU 12820 3.8 

424028 AR)55084 Hs.153692 Homo sapiens cDNAFU14354fis. done Y7 3.8 

400021 AFFX control - HUMISGF3A/M97935_MA 3.8 

453403 BE486639 Hs.61779 Homo sapiens cONA FU 13591 fis. done PL 3,8 

10 445941 AI267371 Hs.172636 ESTs 3.8 

434378 AA631739 Hs.335440 EST 3.8 

429220 AW207206 Hs.136319 ESTs 3.8 

439176 AI446444 Hs.1 90394 ESTs, Weakly similar to B28096 fine-l pr 3.8 

401045 C11001883*:gi|6763278|reqNP_033938.1|c 3.8 

15 430178 AW449612 Hs.152475 ESTs 3.8 

423397 . NM.001838Hs.1652 chemokine (C-C motiO receptor 7 3.8 

447630 AI660149 Hs.44865 lymphoid enhancer-binding factor 1 3.8 

436391 AJ227892 Hs.146274 ESTs 3.8 

413011 AW068115 Hs.821 biglycan 3.8 

20 422121 AI767949 Hs.179833 ESTs 3.8 

452268 NM„003512Hs.28777 H2A histone family, member L 3.8 

427811 M81057 Hs.1 80884 carboxypeptkiase 81 (tissue) 3.8 

415579 AA165232 Hs.222069 ESTs 3.8 
437330 AL353944 Hs.50115 Homo sapiens mRNA;cDNA DKF2p761J1 112 (f 3.8 

25 427122 AW057736 Hs.323910 HER2 receptor tyrosine kinase (c^rb-b2, 37 

400286 NA C16000922:gi|7499103|pir||T20903hypothe 3.7. 

420281 AI623693 Hs.191533 ESTs 37 

419926 AW900992 Hs.93796 DKFZP586D2223 protein 3.7 

417541 AI992191 Hs.180040 hypothetical protein FU22439 37 

30 426172 AA371307 Hs.125056 ESTs 37 

429638 AI916662 Hs.211577 kinectin 1 (kinesin receptor) 37 

457001 J03258 Hs.2062 vitamin D (1 .25- dihydroxyvitamin D3) re 37 



424109 AW406878 gb:Ul-HF-BL0-adg^-06-0-U!.r1 NIH_MGC_,3737 



417022 NM_014737Hs.80905 Ras association (RalGDS/AF-6) domain fam 3.7 

35 435222 AI208737 Hs.l22810 Homo sapiens cONA FU1 1489 fis, done HE 3.7 

430448 AI633553 Hs.13303 Homo sapiens cDNA: FU21784 fis. done H 37 

432729 AK000292 Hs.278732 hypothetical protein FU20285 3.7 

413916 N49813 Hs.75615 apolipoprotein C-ll 37 

421662 NM_014141 Hs.106552 cell recognition molecule Caspr2 3.7 

40 441 633 AW958544 Hs. 1 12242 normal mucosa of esophagus spedfic 1 3.7 

408761 AA057264 Hs.238936 ESTs. Weakly similar to (defline not ava 37 

406153 Target Exon 37 

445563 AW873606 Hs.149006 ESTs 3.7 

453464 ■ AI88491 1 Hs.32989 receptor (calcitonin) activity modifying 3.7 

45 448918 AB011152 Hs.22572 KIAA0580 protein 3.7 

413936 AFl 13676 Hs297681 serine (or cysteine) proteinase inliibito 3.6 

448069 U76248 Hs.20191 seven in absentia (Drosbphila) homolog 2 3.6 

453313 6E005771 Hs.153746 hypothetical protein aJ22490 3.6 

425234 AW152225 Hs.165g09 ESTs, Weakly similar to 138022 hypotheti 3.6 

50 419941 X98654 Hs.93837 phosphatidyiinositol transfer protein, m 3.6 

402397 AF186625 Hs.189507 phospholfpaseA2. group NO 3.6 

430378 Z29572 Hs.2556 tumornecrosis factor receptor superfami 3.6 

448106 A1800470 Hs.171941 ESTs. 3.6 

426431 NM.000458HS.169853 transcription factor 2. hepatic; LF-B3: 3.6 

55 431843 AA516420 Hs.183526 ESTs, Weakly similar to 138022 hypotheti 3.6 
426878 BE069341 gb:QV3-BT0381 -270100^73^08 BT0381 Homo3.6 

434061 AW024973 Hs.283675 NPD009 protein 3.6 

445292 AV653264 Hs.13982 Homo sapiens cDNAFU14666rts, done NT 3.6 

452101 T60298 Hs.10844 Homo sapiens cONA FU 14476 fis. done MA 3.6 

60 427581 NN4_014788Hs.179703 KIAA0129 gene product 3.6 

409047 AW961434 Hs.31539 ESTs 3.6 

416820 NM^000402Hs.80206 glucose-6-phosphate dehydrogenase 3.6 

410386 W26187 Hs.3327 Homo sapiens cDNA: FU2221 9 fis, done H 3.6 

440516 S423D3 Hs-161 cadherln 2, type I.N-cadherin (neuronal 3.6 

65 434360 AW015415 Hs.127780 ESTs 3.6 

428970 BE276891 Hs.l 94691 retinote add induced 3 3.6 

415079 R43179 Hs.22895 hypothetical protein FU23548 3.6 
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409619 AK001015 Hs.55220 BCL2nassociated athanogene 2 . 3.6 

430044 AA464510 Hs.152812 ESTs 3.6 

430829 AW45ig99 Hs.1 94024 ESTs 3.6 

434224 AA380731 Hs.84 Interieukin 2 receptor, gamma (severe CO 3.6 

5 439247 AF08B020 Hs.46767 EST .r 3.6 

431542 H63010 Hs.5740 ESTs 3.5 

430713 AA351647 Hs.2642 eukafyoflcfranstattonetongatton factor 3.5 

434988 AI418055 Hs.1 61160 ESTs 3.5 

437748 AF234882 Hs.5814 suppression of tumo/igenicit/ 7 3.5 

10 418322 AA284166 Hs.84113 cydin-dependent kinase inhibitor 3 (CDK 3.5 

439569 AW602166 Hs.222399 CEGP1 protein 3.5 
459583 AI907673 gb:iL-BT152-080399-004 BT152 Homo sapien3.5 

403212 NM_019595:Homo sapiens intersecGn 2 (IT 3.5 

409099 AK000725 Hs.50579 hypothetical piotern FU20718 3.5 

15 453968 AA847843 Hs.62711 Homo sapiens, done IMAGE:3351295. mRNA3.5 

435338 W92147 Hs.1 18394 ESTs 3.5 

422890 Z43784 Hs.75893 ankyrin 3, node of Ranvier (ankyrin G) 3.5 

442295 AI827248 Hs.224398 Homo sapiens cON A FU 11 469 fis, done HE 3.5 

417975 AA641836 Hs.30085 hypothetical protein FU23186 3.5 

20 433730 AK002135 Hs.3542 hypothetical protein FLJ 11 273 3.5 

439926 AW014875 Hs.1 37007 ESTs 3.5 

445873 AA250970 Hs.251946 poly(A)-binding protein, cytoplasmic 1-1 3.4 

410153 BE311926 Hs.15830 hypothetical protein FU 12691 3.4 
422128 AW881 145 gb:QV0-OT0033-O1 0400-182-a07 OT0033 Homo 3.4 

25 414921 BE390551 Hs.77628 steroidogenic acute regulatoiy protein r 3.4 

441134 W29092 Hs.7678 cellular retinolc ack)-binding protein 1 3.4 

444564 A)167877 Hs.143716 ESTs 3.4 

402470 NA Target Exon 3.4 

418120 AA213437 Hs.192249 ESTs 3.4 

30 422414 AW875237 Hs.1 3701 ESTs 3.4 " 

433345 AI681545 Hs.152982 hypothetical protein FU 131 17 3.4 

409213 U61412 Hs.51133 PTK6 protein tyrosine kinase 6 3.4 

422611 AA158177 Hs.118722 fucosyltransferase 8 (alpha(1.6)fucosy 3.4 

423554 M90516 Hs.1 674 glutamine-fnjctose^hosphatetransamin 3.4 

35 442619 AA447492 Hs.2ai83 ESTs. Weakly similar to AF164793 1 prote 3.4 
402359 NA C19001991^gi|12656111|gb|AAK00751.1|AF23.4 

439398 AA284267 Hs.221504 ESTs 3.4 

415208 F01020 Hs.172004 titin 3.4 

452853 AA812633 Hs.10845 ESTs 3.4 

40 429345 R11141 Hs.1 99695 hypothetical protein 3.4 

449027 AJ271216 Hs.22880 dipeptldylpeptidase III 3.4 

412115 AK001763 Hs.73239 hypothetical protein FU10901 3.4 

432180 Y18418 Hs.272822 RuvB (E coli homolog)-like 1 3.4 

428977 AK001404 Hs.194698 cyciin B2 3.4 

45 431611 U58766 Hs.264428 tissue specific transplantation antigen 3.4 
418286 AA622528 Hs.319825 Homo sapiens, done IMAGE:361 6574, mRNA.3.4 

436895 AF037335 Hs,5338 carbonic anhydrase XII (tumor antigen H 3.4 

443378 AW392550 Hs.9280 proteasome (prosome, macropain) subunit, 3.4 . 

428450 NM_014791Hs.184339 KIAA0175 gene product. 3.3 

50 449571 AW016812 Hs.200266 ESTs 3.3 

412777 AI335773 Hs.270123 ESTs 3.3 

420542 NM_0005O5Hs.1321 coagulation factor Xll (Hageman factor) 3.3 

412754 AW160375 Hs.74565 amyloid beta (A4) precursor-Oke protein 3.3 

418327 U70370 Hs,84136 paired-like homeodomain transcription fa 3.3 

55 449065 A1627393 Hs.258998 ESTs. Weakly similar to high mobiBtygr 3.3 ' 

425999 AW513051 Hs.332981 ESTs. Weakly similar to 138022 hypotheti 3.3 

430280 AA361258 Hs.237858 interieukin 7 receptor 3.3 

407777 AA161071 Hs.71465 squalene epoxidase 3.3 

426516 BE262660 Hs.170197 glutamkx»caloacefic transaminase 2, mit 3.3 

60 414361 AI086138 Hs.204044 ESTs 3.3 

427080 AW068287 Hs.173466 ras-related C3 botulinum toxin substrate 3.3 

426429 X73il4 Hs.169849 myosin-binding protein C, slow-type 3.3 

446163 AA026880 Hs.2S252 prolactin receptor 3.3 

428566 U41763 Hs.184916 dathrin. heavy polypeptide^ike 1 3.3 

65 418641 6E243136 Hs.d6947 a disintegrin and metalloproteinase doma 3.3 

436293 AI601188 Hs.12O910 ESTs 3.3 
411257 AA6289S7 Hs.115274 ESTs, Highly amilar to IHH.HUMAN INDIAN 3.3 
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430253 AK001514 Hs.236844 hypothetical protein FU10652 3.3 
430066 AI92g659 Hs.237825 signal recognition particle 72kD 3.3 
436469 AK001455 Hs.5198 Down syndrome critical region gene 2 33 
437788 BE142681 Hs.155573 polymerase (ONA dtiected). eta 3.3 
5 444079 H09048 Hs.23806 ESTs 3.3 
457183 H91882 Hs.1 18569 Dvl-binding protein IDAX (inhibition of 3.3 
431215 AA495078 Hs.121554 Human DMA sequence from done RP11-218C1 3.3 
424563 AA446932 Hs, 151 428 ret finger protein 2 3.3 
450828 AW270655 Hs,193804 ESTs 3.3 

10 408652 R43409 Hs.6829 Homo sapiens mRNA for KIAA1644 protein. 3.3 
445142 AW978484 Hs.93842 Homo sapiens cDNA: aJ22554 fis. clone H 3.3 
426761 AI015709 Hs.172089 Homo sapiens mRNA; cDNA DKFZp58612Q22 (f3. 3 
439237 AW408158 Hs.318893 ESTs. Weakly simHar to A47582 B-celigr 3.3 
422616 BE300330 Hs.1 18725 selenophosphate synthetase 2 33 

15 443247 BE614387 Hs.333893 c-Myc target JP01 3.3 
406663 U24683 Hs.302063 immunoglobulin heavy constant mu 3.3 
434137 AA907734 Hs.124895 ESTs 3.3 
408877 AA479033 Hs.130315 ESTs, Weakly similar to A47582 B-cell gr 3.3 
439101 C01765 Hs.38750 hypothetical protein FU 11 526 3.3 

20 408221 AA912183 Hs.47447 ESTs 3.3 
447519 U46258 Hs.339665 ESTs 3.3 
404755 NA Target Exon 3.3 

451871 AI821005 Hs.1 18599 ESTs 3.2 
420319 AW406289 HS.96S93 hypothetical protein 3.2 

25 • 430580 AA806105 Hs.300697 immunoglobulin heavy constant gamma 3 (G 3.2 
400202 NA NI^002795^Homo sapiens proteasome(pios3.2 

400222 NA NM.002082*:Homo sapiens G protein^uple 3.2 

425988 BE045897 Hs.274454 ESTs, Weakly similar to 138022 hypotheti 3.2 
458098 BE550224 Hs.74170 metaltothionein IE (functional) 3.2 

30 430589 AJ002744 Hs.246315 UDP-N-acetyl-alpha-D-galactosamine:polyp 3.2 
431563 A1027643 Hs.120912 ESTs 3.2 
442353 BE379594 Hs.49136 ESTs. Moderately similar to ALU7_HUMAN A 3.2 
422309 U79745 Hs.114924 solute carrier family 16 (monocarboxy lie 3.2 
419703 A1793257 Hs.128151 ESTs 3.2 

35 420380 AA640891 Hs.102406 ESTs 3.2 
410853 H04588 Hs.30469 ESTs 3.2 
454417 AI244459 Hs.110826 trinucleotide repeat containing 9 3,2 
432745 AI821926 gb:nt78f05.x5NCLCGAP.Pr3 Homo sapiens 3.2 

422032 AA476966 Hs.1 10857 polymerase (RNA) 111 (DNA directed) poly 3.2 

40 415339 NM_015156Hs.78398 KIAA0071 protein 3.2 
426384 A1472078 Hs.303662 ESTs 3.2 
448030 N30714 Hs.325960 membrane-spanning 4-domalns. subfamily A 3.2 
418739 AA310964 Hs.88012 SHP2 interacting transmembrane adaptor 3.2 
442053 R35343 Hs.24968 Human DNA sequence from clone RP1-233G163.2 

45 434747 AA837085 Hs.220585 ESTs 3.2 
427297 AW292593 Hs.334907 Homo sapiens, clone MGC:17333. mRNA, com3.2 
412228 AW503785 Hs.73792 complement component (3d/Epstein Ban* vi 3.2 
. 452304 AA025386 Hs.61311 ESTs, Weakly similar to SI 0590 cysteine 3.2 
453953 AW408337 Hs.36972 CD7 antigen (p41) 3.2 

50 407758 D50915 Hs.38365 KIAAOI 25 gene product 3.2 
451149 AL047586 Hs.10283 RNA binding motif protein 8B 3.2 
430015 AW768399 Hs.112157 ESTs 3.2 
433313 W20128 Hs.29803g ESTs 3.2 
418334 AA319233 Hs.5521 ESTs 3.2 

55 450223 AA418204 Hs.241493 natural killer-tumor recognition sequenc 3.2 
■454365 AW966728 Hs.54642 methionine adenosyltransferase II. beta 3.2 
451128 AL1 18668 gb:DKFZp761 1031 0_r1 761 (synonym: hamy2)3.2 

417793 AW405434 Hs.82575 small nuclear ribonudeoprotein polypept 3.2 
428027 U22029 Hs.334345 cytochrome P450. subfamily IIA (phenobar 3.2 

60 441197 BE244638 Hs,166 sterol regulatory element binding transc 3.2 
424634 Nf^_003613Hs.151407 cartilage intemf^ediate layer protein, nu 3.2 
419986 AI345455 Hs.78915 GA-binding protein transcription factor. 3.2 
416714 AF283770 Hs.79630 CD79A antigen (immunoglobulin-assodated 3.2 
449465 NM_004380Hs.23598 CREB binding protein (Rubinstein-Taybi s 3.2 

65 422166 W72424 Hs.1 12405 SI 00 caldum-binding protein A9 (calgran 3.2 
409079 W87707 Hs.82055 interleukin 6 signal transducer (gpl 30. 3.2 
423551 AA327598 Hs.233785 ESTs 3.2 
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453553 M036849 Hs.61829 Homo sapiens cDNAFU12763fis. clone NT 3.2 

442580 AI733682 Hs.1 30239 ESTs 3.2 
458079 Ai796870 Hs,54277 DNA segment on chromosome X (unique) 99232 

.425700 AF076292 Hs.159251 forkhead box H1 3.2 

5 417124 BE1 22762 Hs.25338 ESTs 3.2 

407104 S57296 Hs.323910 v-erb-b2 avian erythroblastic leukemia v 3.2 

442215 AI703172 Hs.129005 ESTs. Weakiy similar to 21 09260A B cell 3.1 

430271 T08199 Hs.237506 DnaJ (Hsp40) homolog, suWamily B, membe. 3.1 

425317 AW205118 Hs.210546 interieukin 21 receptor 3.1 

10 426095 AI278023 Hs.89986 ESTs 3.1 

442313 BE388B98 Hs.8215 hypothetical protein FU 11 307 3.1 

424709 AL137589 Hs.152149 hypothetical protein DKF2p434K0410 3.1 

429671 BE379335 Hs.211594 proteasome (prosome. macropafn) 268 subu 3.1 

432715 AA247152 Hs.200483 ESTs. Weakly similar to KIAA1074 protein 3.1 

15 431574 AW572659 Hs.261373 hypothetical protein dJ434014.3 3.1 

436876 AI124756 Hs.5337 isocitrate dehydrogenase 2 (NADP), mitoc 3.1 

405017 NA Target Exon 3.1 

433805 AA706910 Hs.1 12742 ESTs 3.1 

437352 Al^53957 Hs.284181 hypothetical protein DKFZp434P0531 3.1 

20 430105 X70297 Hs.2540 cholinergic receptor, nfcotinic, alpha p 3,1 

422083 NM_001 141 Hs.1 11256 arachidonate 15'lipoxy9enase, second typ 3.1 

413507 BE145360 Hs.190064 ESTs. Weakly similar to 138022 hypotheti 3,1 

415989 A1267700 Hs.317584 ESTs • 3.1 

422907 Ai879263 Hs.6986 Human glucose transporter pseudogene 3.1 

25 425548 AA890023 Hs.1906 prolactin receptor 3.1 

422599 BE387202 Hs.1 18638 non-melastatic ceils 1, protein (NM23A) 3.1 

439953 AW247529 Hs.6793 platelet-activating factor acetylhydrola 3.1 

453883 AI63851 6 Hs.22630 cofactor required for Spl transcriptiona 3.1 

458021 AI885190 Hs.1 56089 ESTs, Weakly similar to repressor protei 3.1 

30 418478 U38945 Hs.1 174 cyclin-dependent kinase inhibitor 2A (me 3.1 

400814 NA Target Exon 3.1 

402327 Target Exon 3.1 

416935 AA190712 gb:zp87f09.r1 Stratagene HeLa cell s3 93 3.1 

439838 AL355722 Hs.106875 Homo sapiens EST from clone 35214. full 3.1 

3 5 437036 AI571514 Hs.133022 ESTs 3,1 

449523 NM_000579Hs.54443 chemokine (C-C motif) receptor 5 3.1 

406642 AJ245210 gb.Homo sapiens mRNA for immunoglobulin 3.1 

406624 AF052762 gb:Homo sapiens clone csneg8-1 immunogb 3.1 

421924 BE514514 Hs.109606 coronin, ac&n-bmding protein. 1 A 3.1 

40 414523 AU076633 Hs,76353 serine (or cysteine) proteinase inhibito 3.1 

416379 N38857 Hs,203933 ESTs 3.1 

422823 D89974 Hs,121102 vanin2 3.1 

433904 AI399956 Hs.208956 ESTs 3.1 

421904 BE143533 Hs.109309 hypothetical protein FU20035 3.1 

45 428834 AW899713 Hs.339315 ESTs 3.1 
436043 AW963838 Hs.168830 Homo sapiens cDNA FU12136 fis. clone MA 3.1 

452823 AB012124 Hs.30696 transcription factor-fike 5 (baste helix 3.1 

405381 NA Target Exon 3.1 

428746 AW503820 Hs.192861 Spi-B transcription factor (Spi-1/PU.1 r 3.1 
50 435147 AL133731 Hs.4774 Homo sapiens mRNA; cDNA DKFZp761C1712 (f3.1 

425782 U66468 Hs.1 59525 cell growth regulatoiy with EF-hand doma 3.1 

423306 W88562 Hs.108198 ESTs 3.1 

419123 AA234276 Hs.88253 ESTs 3.1 

438581 AW977766 Hs.292133 ESTs, Moderately similar to 178885 serin 3.1 

55 417105 X60992 Hs.81226 CD6 antigen 3.0 

428361 NM_015905Hs.183858 transcriptional fnteriTiediary factor 1 3.0 

417880 BE241595 Hs.82848 selectin L (lymphocyte adhesion molecule 3.0 
402606 NM_024626:Homo sapiens hypothetical prot 3.0 



401451 NM_004496*:Homo sapiens hepatocyte nude 3.0 

60 421878 AA299652 Hs.111496 Homo sapiens cDN A FU 11 643 fis. done HE 3.0 



409518 BE384836 Hs.3454 KIAA1 821 protein 3,0 

416933 BE561850 Hs.80506 small nudear ribonudeoprotein polypept 3.0 

414324 Y14768 Hs.890 lymphotoxin beta (TNF superfamiiy. membe 3.0 

425081 X74794 Hs.154443 minichromosome maintenance deficient (S. 3.0 

65 401519 C15000476*:gi|12737279|ref|XPjD12163.1| 3.0 

411704 AI499220 Hs-71573 hypothetical protein FU1 0074 3.0 

428819 AL135623 Hs.193914 KIAA0575 gene product 3.0 
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428423 
413835 
412600 
410491 
433658 
427666 
452514 
429500 
432485 
437400 
452234 
413269 
453216 
400929 
445145 
432615 
423279 
429392 
408548 
451346 
4t3109 
401714 
421462 
421750 
453293 
457085 
438930 



AU076517 

AI272727 

L28824 

AA465131 

L03678 

A1791495 

A1904898 

X78565 

N90866 

AB011542 

AW084176 

BE167526 

AL137566 



Hs.184276 

Hs.249163 

Hs.74101 

Hs.64001 

Hs.156110 

Hs.180142 

Hs^89114 
Hs.276770 
Hs.5599 
Hs.223296 

Hs.32405 



A1961702 Hs.147434 
AA557191 Hs.55028 
AW959861 Hs.290943 
AL109712 Hs.296506 
AA055449 Hs.63187 
NM_006338Hs.26312 
AW389845 Hs.1 10855 
NA 

AF016495 Hs.104624 
AK000768 Hs.ia7872 
AA382267 Hs.10653 
AA412446 Hs.98138 
AW843633 Hs.306163 



solute carrier family 9 (sodium/hydrogen 3.0 
fatly add hydroxylase 3.0 
spleen fyrosine kinase 3.0 
Homo sapiens done 25218 mRNA sequence 3.0 
Immunoglobulin kappa constant 3.0 
calmoduRn-fike skin protein 3.0 
gb:RC-BT068-130399-085 BT068 Homo sapien3.0 
hexabrachion (tenascin C, cytotactin) 3.0 
CDW52 antigen (CAMPATH-1 antigen) 3.0 
EGF-like-domain, multiple 5 3.0 
ESTs, WeaWy similar to 138022 hypotiieti 3.0 
gb:CM4-HT0509-080300-107-g07 HT0509 Homo 
Homo sapiens mRNA; cONA DKFZp585G0321 (f 
ENSP00000252232*:Steral regulatory eleme 3.0 
ESTs 3.0 
ESTs. Weakly similar to 154374 gene NF2 3.0 
ESTs 3.0 
Homo sapiens mRNA full length insert cDN 3 .0 
ESTs. Weakly similar to ALUC.HUMAN 111! 3.0 
glioma amplified on chromosome 1 protein 3.0 
ESTs 3.0 
ENSP00000241802*:CDNA RJ11007 RS, CLON 
aquaporin 9 3.0 
hypothetical protein FU20761 3.0 
ESTs 3.0 
ESTs 3.0 
hypothetk:al protein AL110115 3.0 



3.0 
3.0 



3.0 
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Table 22A shows the accession niimbers for those pkeys lacking unigenelD's for Table 22. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession*' column, 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessbns 

407980 103087_1 AA046309 AI263500 AA046397 

410785 1221055^1 AW803341 AW803265 AW803403 AW803466 AW803402 AW803413 AW803268 AW803396 AW803334 AW803355 

411743 1256098J AW862214 AW859811 AW862215 

412138 1279172 J AW895387 AW895547 AW895554 AW895323 AW895405 AW895539 AW895538 

413269 1356961J BE1 67526 BE1 67651 BE076401 R24654 

416935 163179_1 AA19071 2 AA 190665 AA252564 

422128 211994_1 AW881145 AA490718 M85537 AA304575 T06067 AA331991 

423945 233566_1 AA4 10943 AW948953AA334202AA332882 

424109 235505_1 AW406878 AW966560 AW966151 AW966495 AA336174 AA335376 AA335537 

424128 235728_1 AW966163AA335983 AA336011 AA33566B AA335973 

425331 250199 J AW962128 AA355353 AA427363 

426878 273265_1 BE069341 AW748403 AL044891 AI908240 AA393080 

432745 353673_1 A1821926 AA658826 AA564492 AA635129 AI791191 

441153 51084_2 BE562826 BE378727 

448212 755099 1 AI475858 AW969013 

451128 859865^1 AL1 18568 D78823AI7621 76 

452514 920172 1 AI904898 AI904849 AI904899 

456207 165078.-1 AA193450 
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TABLE 22B 

Table 22B shows the genomic positioning for those pkeys lackitig unigene ZD's and 
accession numbers in Table 22. For each predicted exon, we have listed the genomic 
sequCTice source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 
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Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this coiumn are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled The DNA sequence of human chromosome 22/ Dunham i. et al., Nature (1999) 402:489-495. 
Strand: Indicates ONA strand from which exons were predicted. 

Nfj>osition: Indicates nucfeotide posifions of predicted exons. 



Pkey 


Ref 


Strand 


Ntjiosition 


400814 


8569925 


Minus 


72840-72924,74761-74849 


400929 


7651921 


Minus 


122033-122241.123483-124028 


401045 


8117619 


Plus 


90044-30184.91111-91345 


401451 


6634068 


Minus 


119926-121272 


401519 


6649315 


Plus 


157315-157950 


401645 


7657839 


Minus 


34986-35133 


401714 


6715702 


Plus 


96484-96681 


401866 


8018106 


Plus 


73126-73623 


402327 


7656595 


Minus 


108675-108770.109801-109910 


402359 


9211204 


Minus 


40403-41961 


402408 


9796239 


Minus 


110326-110491 


402470 


9797107 


Plus 


195129-195776 


402542 


9801558 


Minus 


67076-67594 


402578 


9884928 


Plus 


66350-66496 


402606 


9909429 


Minus 


81747-82094 


403011 


6693597 


Minus 


3468-3623 


403212 


7630897 


Minus 


156037-156210 


403329 


8516120 


Plus 


96450-96598 


403366 


8783692 


Minus 


49323-49652 


403485 


9966528 


Plus 


2888-3001 .3198-3532,3655-41 17 


404347 


9838195 


Plus 


74493-74829 


404560 


6539738 


Minus 


240588-241589 


404755 


7706327 


Minus 


53729-53846 


405017 


6532084 


Plus 


35551-35690 


405348 


2914717 


Minus 


43310-43462 


405381 


6006920 


Minus 


7636-8054 


405801 


2924321 


Plus 


63469-63694 


405850 


6164995 


Pius 


13871-14110 


406153 


9929734 


Minus 


12902-13069 


406348 


9255985 


Minus 


71754-71944 
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TABLE 23: 320 GENES DOWN-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT BREAST 

5 Table 23 shows 320 genes down-regulated in breast cancer compared to normal adult breast. 
These were selected as for Table 22, except that the numerator was set to the median value 
for 12 non-malignant breast specimens, the denominator was set to the median value amongst 
the 73 breast cancers, the 90^^ percentile value amongst the 12 non-malignant breast 
specimens was greater than or equal 80 units, and the ratio was greater than or equal to 4.0 

1 0 (i. e. 4-fold down-regulated in tumor vs. normal breast) . 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number. Genbank accession number 

15 UnigenelD: Unigene number 

Unlgene Title: Unigene gene title 

Rl : Ratio of 50*^ percentile normal body tissue to 75^ percentile tumor 



20 Pkey ExAccn UnigenelD UnigeneTitle 



Ratio 

428722 U76456 Hs.190787 tissue inhibitor of metalloproteinase 4 22.4 

428848 NM_000230 Hs.194236 leptin {murine obesity homolog) 17.4 

445263 H57646 Hs.42586 KIAA1 560 protein 15.4 

25 418935 T28499 Hs.89485 caibonic anhydrase IV 15.0 

407228 M25079 Hs.1 55376 tiemoglobin, beta 14.6 

417511 AL049176 Hs.82223 chordin-fike 14.6 

439285 AL133916 Hs.172572 hypothetical protein FLJ20093 14.3 

412442 AI983730 Hs.26530 semm deprivation response (phosphatidyl 13.6 

30 410544 AI446543 Hs,95511 ESTs 12.6 

412047 AA934589 Hs.49696 ESTs 12 2 

422667 H25642 Hs.133471 ESTs 12.0 

406664 L34041 Hs.9739 glycerol-3-phosphate dehydrogenase 1 (so 12.0 

423201 NM^000163 Hs.125180 growth homione receptor 11.7. 

35 422163 AF027208 Hs.1 12360 prominin {mouse)-like 1 10.8 

428769 AW207175 Hs.106771 ESTs 10.6 

407049 X72632 NM_021724*:Homo sapiens nuclear receptor 10.1 

412295 AWD88826 Hs.1 17176 poly{A)-binding protein, nuclear 1 9.8 

425126 N32759 Hs.172944 chorionic gonadotropin, beta polypeptide 9.8 

40 406791 AI220684 Hs.272572 hemoglobin, alpha 2 9.5 

447471 AF039843 Hs.1 8676 sprouty (Drosophila) homolog 2 9.5 

451533 NM_004657 Hs.26530 senjm deprivation response (phosphatidyl 9.4 

419407 AW410377 Hs.41502 hypothetical protein FU21276 9.0 

411939 AI365585 Hs.146246 ESTs 9-0 

45 410532 T53088 Hs.155376 hemoglobin, beta fi.9 

425707 AP1 15402 Hs.1 1713 E74-like factor 5 (ets domain transcript 8.8 

416585 X54162 Hs.79386 leiomodin 1 (smooth muscle) 8.7 

443050 D78874 Hs.8944 procollagen C-^ndopeptidase enhancer 2 8.6 

435265 AA779958 Hs.185932 ESTs 8.5 

50 422511 AU076442 Hs.1 17938 collagen, type XVil. alpha 1 8,4 

433138 AB029496 Hs.59729 semaphorin sem2 8.3 

402195 NM„004497*:Homo sapiens hepatocyte nude 8.1 

429350 AI754634 Hs.131987 ESTs 8.1 

445107 A1208121 Hs.147313 ESTs, Weakly similar to 138022 hypotheti 8,1 

55 406643 N77976 Hs.272572 hemogtobin. alpha 2 8,0 

410199 AW377424 Hs.205126 Homo sapiens cDNA: FU22667 fis, clone H 8.0 

417225 AA815048 Hs.24078 hypothetical protein FIJ12649 7.8 

437569 AA760849 Hs.294052 ESTs 7.5 

436062 mSOQOZT Hs.98633 ESTs 7.5 

60 425078 NM_002599 Hs.1 54437 phosphodiesterase 2A. cGMP-slimulated 7.5 

430327 AW973636 Hs.55931 ESTs 7.4 
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447577 


AI393693 


Hs.183297 


DKFZP566F2124 protein 


7.4 


446039 


AI150491 


Hs.90756 


ESTs 


7.2 


422060 


R20893 


Hs,325823 


ESTs. Moderately similar to ALU5 HUMAN A 


7.2 


424455 


AA452008 


Hs.333199 


ESTs 


7.1 


424343 


AW956360 


Hs.4748 


adenylate cyclase activating polypeptide 


7.1 


442792 


AI352340 


Hs.131194 


ESTs 


7.0 


406714 


AI219304 


Hs,283108 


hemoglobin, gamma G 


6.9 


407571 


A)4461d3 


Hs,9572 


ESTs, Highly similar to CYA5_HUMAN ADENY 


6.8 


429580 


AA346839 


Hs,209100 


DKFZP434C171 protein 


6.7 


453500 


AI478427 


Hs,43125 


esophageal cancer related gene 4 protein 


6.7 


422233 


AB002058 


Hs.1 13275 


purinergic necepbr P2X-like 1, orphan r 


6.7 


420205 


AA256395 


Hs.88156 


ESTs 


6.6 


404368 


^4A 




ENSP00000241O75*:TRRAP PROTEIN. 


6.6 


447261 


NM 006691 


Hs.17917 


extracellular link domain-containing 1 


6.5 


417090 


AA193282 


Hs.85863 


ESTs. Weakly similar to B34612 zinc ting 


6.5 


410677 


NM 003278 


Hs 65424 


tetranectin (plasminogen-binding protein 


6.5 


427283 


AL1 19796 


Hs 174185 


ectonucleotide pyrophosphatase/phosphodi . 


.6.5 


415011 


AW963085 




gb:EST375158 MAGE resequences. MAGH Homo6.4 


412068 


S72043 


Hs 73133 


metaltothioneln 3 (growth Inhibitory fac 


6.4 


416253 


BE2S0659 


Hs.15463 


Honrio sapiens, done tMAGE:2959994. mRNA 


6.4 


435885 


AA701483 


Hs 36341 


ESTs 


6.3 


402779 


NA 




Target Exon 


6.3 


418138 


AA213626 


. Hs. 136204 


EST 


6.3 


439335 


AA742697 


Hs.62492 


ESTs, Weakly similar to B39066 proline-r 


6.3 


427019 


AAn01732 


Hs 173233 


hypothetical protein FU10970 


6.2 


411478 


BE143068 




gb:MR0-HT0158^30200^3-b09 HT0158 Homo6.2 


452654 


BE004783 




gb:MR2-BN0114-270400^04-e11 BN0114 Homo6.1 


447359 


NM_012093 


Hs 18268 


adenylate kinase 5 


6.1 


414323 


NMJ014759 


Hs.334688 


K1AA0273 gene product 


6.1 


441266 


H 15968 


Hs,293845 


Homo sapiens, done IMAGE:3502329, mRNA, 


6.1 


417011 


F08212 


Hs.23489d 


ESTs. Weakly similar to 2109260A B cell 


6.0 


400089 


NA 




Eos Control 


6.0 


433614 


W07475 


Hs.277101 


cytochrome c oxidase subunit IV isoform 


5.9 


440439 


N92818 


Hs.64754 


ESTs. Weakly similar to potential CDS [H 


5.9 


454404 


BE067414 




gb:MR4-BT0355-200100-201-e05 BT0355 Homo 5.9 


436704 


AA062610 


Hs.1 48050 


EST 


5.9 


406563 


NA 




Target Exon 


5.9 


433490 


AW451023 


Hs.65848 


hypothetical protein DKFZp7610132 


5.9 


419313 


AA843387 


Hs.87279 


ESTs 


5.9 


409196 


NM_001874 


Hs.334873 


carboxypepGdase M 


5.8 


410882 


AW809163 




gb:MR4-ST01 18-261099-012-303 ST01 18 Homo 5.8 


453469 


AB014533 


Hs.33010 


KIAA0633 protein 


5.8 


441899 


AI372588 


Hs.8022 


TU3A protein 


5.8 


426210 


AA372052 


Hs 334559 


Homo sapiens cONA FU 14456 fis. clone HE 


5.8 


413065 


BE063555 




gb:CM1-BT0283-081 199-033-d09 BT0283 Homo 5.8 


454192 


AW87G813 


Hs.3343 


phosphogtycerate dehydrogenase 


5.7 


425187 


AW014488 


Hs.22509 


ESTs 


5.7 


429757 


AW452355 


Hs.256037 


ESTs 


5.7 


420202 


AL036557 


Hs 95910 


putative lymphocyte Q0/G1 switch gene 


5.7 


416284 


AI695473 


Hs 298006 


ESTs 


5.7 


428553 


AA181641 


Hs 184907 


G protein-coupled receptor 1 


5.6 


404689 


NA 




Target Exon 


5.6 


438887 


R68857 


Hs,265499 


ESTs 


5.6 


406082 


S47833 


Hs.82927 


adenosine monophosphate deaminase 2 (iso 


5.6 


449748 


H23963 


Hs.32043 


ESTs 


5.6 


431048 


R50253 


Hs.249129 


cell death-indudng DFFA-fike effector a 


5.5 


452205 


CI 5819 




gb:C15819 Clontech human aorta polyA mRN 


5.5 


434040 


AW444613 


Hs.288809 


hypothetical protein FU20159 


5.5 


407744 


AB020629 


Hs.38095 


ATP-binding cassette, sub-family A (ABC1 


5.5 


450606 


AI668605 


Hs.60380 


ESTs. Moderately similar to ALU6_HUMAN A 


5.5 


414629 


AA345824 


Hs.76688 


carboxylesterase 1 (monocyte/macrophage 


5.5 


401665 






CI 1000703:gi|10048448|reflNP_065258.11 g 


5.5 


436107 


T99079 


Hs.191194 


ESTs 


. 5.5 


444432 


AI161428 


Hs.75916 


splidng factor 3b. subunit 2, 145kD 


5.5 


434715 


BE005346 


Hs.1 16410 


ESTs 


5.5 


447205 


BE617015 


Hs.11008 


ESTs. Moderately similar to T17372 plasm 


5.5 


408122 


AI432652 


Hs.42824 


hypothetical protein FU10718 


5.5 
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454016 AW016806 
414913 R25621 
459033 AA017590 
441003 BE172240 
5 450637 N49826 
442398 AA994520 
403612 NA 
407102 AA007629 
410057 R66634 

10 428232 BE272452 
432769 AA620814 
431344 R99530 
427032 AF012023 
406305 BE261320 

15 437411 AW613948 
442800 AI809481 " 
402054 NA 
432085 AF212829 
415313 R59638 

20 459159 A1904646 
427164 AB037721 
441391 BE467930 
458959 AI285901 
402698 NA 

25 401810 NA 

438879 AA827674 
. 414657 AA424074 
427809 M26380 
456063 NM_006744 

30 451186 AW023469 
451882 A1821324 
402583 NA 
431130 NM.006103 
458218 A1435179 

35 416083 R53467 . 
455282 BE143867 
426488 X03350 
426156 BE244537 
407891 AA486620 

40 408610 AW026692 
445967 D59597 
434813 AI524307 
437526 A1076012 
454775 BE160229 

45 409451 AF012626 
409853 AW502327 
405062 

446490 AK000706 
417622 AW298163 

50 421978 AJ243662 
440338 R62431 
415421 R35009 
417574 R00348 
409882 AJ243191 

5 5 447998 A1768289 
445613 BE550889 
443074 AW341470 
451324 A1783600 
432433 AW014734 

60 449654 AI989812 
414519 N94587 
457531 AW973716 
433200 AA682722 
430782 AF026263 

65 427555 AW1 37094 
433545 AA868510 
420334 AI349351 



Hs.233108 ESTs 



Hs.1 29907 
Hs.126379 
Hs.18602 



Hs.9739 

Hs.26d107 

Hs.183109 

Hs.144959 

Hs.272572 

Hs.173274 

Hs.158196 

Hs.194915 

Hs.131227 

Hs.272406 
Hs.6181 

Hs.173871 
Hs.170381 
Hs.181297 



Hs.189073 

Hs.76780 

Hs.1 80878 

HSJ6461 

Hs.65256 

Hs.100445 

Hs.2719 

Hs.126820 

Hs.269122 

Hs.4 

Hs.167382 

Hs.41135 

Hs.224829 

Hs.1 18821 

Hs.162870 

Hs.121388 



gb7h45ffl6.r1 Scares placenta Nb2HP Homo 
ESTs 

ESTs, Weakly ^llar to 138022 hypoihetl 
ESTs 



5.5 
5.4 
5.4 
5.4 
5.4 



gb:ou42g09.s1 Soares_NFUJ_GBC,S1 Homos5.4 

Target Exon 5.3 

glycerol^-phosphate dehydrogenase 1 (so 5.3 

muttimerin 5.3 

monoamine oxidase A 5.3 

ESTs 5.3 

hemoglobin, alpha 2 5.3 

integrin cytoplasmic domain-associated p • 5.3 

transcriptional adaptor 3 (ADA3, yeast h 5.3 

ESTs 5.3 

ESTs 5.3 

Target Exon 5.3 

potassium channel, subfamily K. member 9 5.3 

ESTs 5.2 

gb:QV-BT055-020399-103 BT065 Homo sapien 5.2 

KIAA1300 protein 5.2 

ESTs 5.2 

ESTs 5.2 

ENSP00000251335*:DJ1003J2.1 {sodium and 5.2 

Target Exon 5.2 

ESTs 5.2 

protein phosphatase 1. regulatory (inhib 5.2 

lipoprotein lipase 5.1 

retinol-binding protein 4, interstitial 5.1 

ESTs, Weakly similar to leudne-rich gli 5.1 

ESTs 5.1 

NML021620:Homo sapiens PR domain contain .5.1 

HE4; WFDC2: putative ovarian carcinoma m 5.1 

ESTs 5.1 

ESTs, Weakly simiter to ALU1_HUMAN ALU S 5.1 
gb:MRO-HT0164-070100-013-h02 HT0164 Homo5.1 

alcohol dehydrogenase 16 (class I], beta 5.1 
natriuretic peptide receptor A/guanylats . 5.1 

endonnucin-2 5.0 

ESTs 5.0 

CGI-62 protein 5.0 

ESTs 5.0 
ESTs, Weakly similar to MDHC^HUMAN MALAT 5.0 
gb:QV1-HT0413-090200^62-a12 HT0413 Homo 5.0 



Hs.54472 


fragile X mental retardation 2 


5.0 




gb:UI.HF-BR0p-aka-a-07-0-Ul.r1 N1H_MGC_5 


5.0 




Target Exon 


5.0 


Hs.15125 


hypothetical protein FLJ20699 


5.0 


Hs.82318 


WAS protein family, members 


5.0 


Hs.1 10196 


NlCE-1 protein 


5.0 


Hs.12758 


ESTs 


5.0 


Hs.24903 


ESTs 


5.0 




gb:ye69e06.rl Soares fetal liver spleen 


5;0 


Hs,56874 


heat shock 27kD protein family, member 7 


5.0 


Hs.304389 


ESTs 


4.9 


Hs.158491 


ESTs 


4.9 


Hs.144907 


ESTs 


4.9 


HS208052 


ESTs 


4.9 


Hs.157969 


ESTs 


4.9 


Hs.199850 


ESTs 


4.9 


Hs.55063 


ESTs 


4.9 


Hs.13913 


K1AA1577 protein 


4.9 


Hs.192725 


ESTs 


4.9 


Hs.247920 


cholinergic receptor, muscarinic 5 


4.8 


Hs.97990 


ESTs 


4.8 


Hs.1 12496 


ESTs 


4.8 


Hs.1 18944 


hypothetical protein FLI22477 


4.8 
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421795 X63094 Hs783822 Rhesus blood group. D anfigen 4.8 

427138 N77624 Hs.173717 phosphatidic add phosphatase type 2B 4.8 

413072 BE063955 gb:QV3-BT0296-140200-085-h01 BT0296 Homo 4.8 

443721 AW450451 Hs.266355 ESTs 4.8 

5 408053 AW139474 Hs.246862 ESTs 4.8 

427067 AA843716 Hs.177927 ESTs 4.7 

442969 A1025499 Hs.132238 ESTs 4.7 

426220 AI383475 Hs.171697 ESTs. Weakly similar to T 13924 sdkprote 4.7 

4 14593 BE386764 gb:601 273249F1 N1H_MGC^20 Homo sapiens c 4.7 

10 426893 AA398716 Hs.97418 ESTs 4.7 

434046 AW292618 Hs.1 13011 ESTs 4.7 

401590 NA TaigetExon 4,7 

457971 AW134679 Hs.242849 ESTs 4.7 

427722 AK000123 Hs.180479 hypothetical protein FU201 16 4.6 

15 443793 AA045290 Hs.25930 ESTs, WeaWy similar to 2109260A B cell 4.6 

407737 R49187 Hs.6659 ESTs 4.6 

441955 AA972327 Hs.142903 ESTs 4.6 

441499 AW298235 Hs.101689 ESTs 4.6 

447517 A1382726 Hs.182434 ESTs 4.6 

20 403017 Target Exon 4.6 

450580 N40087 Hs.15248 ESTs 4.6 

404611 H58589 Hs.35156 Homo sapiens cDNAFUl 1027 fis. done PL 4.6 

414831 M31158 Hs.77439 protein kinase, cAMP^ependent. reguteto 4.6 

459290 NM_001546 Hs.34853 inhibitor of DNA binding 4. dominant neg 4.6 

25 444341 AI142027 Hs.146650 ESTs 4.6 

408614 AL137698 Hs.46531 Homo sapiens mRNA; cDNADKFZp434C 1915 (f 4. 6 

449638 AW204277 Hs250723 hypothetical protein MGC2747 4.6 

434418 AF1 34707 Hs.278679 a dislntegrin and metalloproteinase doma 4.6 

447360 AI375984 Hs.167216 ESTs 4.6 

30 419583 F00312 gb:HSBB0D101 STRATAGENE Human skeletal m4.6 

440698 AI348455 Hs.147492 Homo sapienScDNAFLJ11777fis. done HE 4.6 

451199 AI290653 Hs.124758 ESTs 4.6 

438338 NM^014861 Hs.6168 KIAA0703 gene product 4.6 

433756 AW015933 Hs.112654 Homo sapiens, done MGC:9764, mRNA, cbmp 4.5 

35 423301 S67580 Hs.1645 cytochrome P450. subfamily IVA. polypepl 4.5 

417237 H86385 Hs.81737 palmitoyl-protein thioesterase 2 4.5 

439745 AL389981 Hs.149219 Homo sapiens mRNA fulllength insert cDN 4.5 

424137 AA335769 - Hs.16262 ESTs 4.5 

449338 H73444 Hs.394 adrenomedullln 4.5 

40 434744 N94835 Hs.283828 Homo sapiens genomic DNA, chromosome 21 q 4.5 

407402 AF035303 gb:Homo sapiens clone 23943 mRNA sequenc 4.5 

443510 NM 012190 Hs.9520 fomiyltetrahydrofoiate dehydrogenase 4.5 

415754 AA169114 Hs.12247 hypothetical protein FU1141 3 4.5 

415986 Z43619 gb:HSC1GE121 normalized infant brain cDN 4.5 

45 457416 BE142052 Hs.62654 kringle-containing transmembrane protein 4.5 

418064 BE387287 Hs.83384 . SlOO.caldum-binding protein, beta (neur 4.4 

437120 AI356125 Hs,157767 ESTs. WeaWy similar to HXA2_HUMAN HOMEO 4.4 . 

453950 AA156998 Hs.211568 eukaryotic translation initiafion ^ctor 4.4 

401093 C12000586*:gi|6330167|dbj|BAA86477.1|(A 4.4 

50 436935 AW206494 Hs.253560 ESTs 4.4 

457974 AW842353 Hs.321717 ESTs, Weakly similar to S22765 heterogen 4.4 

428222 AL133112 Hs.183085 Homo sapiens mRNA; cDIMA DKFZp434K098 (fr 4.4 

442705 AI264634 Hs.131127 ESTs 4.4 

437409 AL359599 Hs.283850 Homo sapiens mRNA; cONA DKFZp547C1 26 (fr 4,4 

55 458494 AI380906 Hs.158436 ESTs 4.4 

410490 H03589. gb:yj42d08.r1 Soares placenta Nb2HP Homo 4.4 

416069 R37101 Hs.20982 ESTs 4.4 

438463 AA807958 Hs.314232 ESTs 4.4 

444043 AI499723 Hs.135089 ESTs 4.4 

60 438327 H87407 Hs.172944 chorions gonadotropin, beta polypeptide 4.4 

457711 AF147401 Hs.23917 ESTs 4.3 

400870 C11000905:gi|11692565|gblAAG39879.1lAF28 4.3 

441425 AA933590 Hs.28937 homeobox protein from AL590526 4.3 

416267 H45384 gb:yn99c10.r1 Soares adult brain N2b5HB5 4.3 

65 407262 Ml 2873 gbiHuman Ig reananged H-diain mRNA VDJ4 4.3 

444567 AV654020 Hs.184261 ESTs, WeaWy similar to T26686 hypotheti 4.3 

403263 NA TaigetExon 4.3 
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410034 BE067414 gb:MR4-BT0355-200100.201-e05BT0355Homo4.3 

456804 AI421645 Hs.139851 caveoiin2 4.3 

448427 BE395260 Hs.309438 EST 4.3 

416931 D45371 Hs.80485 adipose most abundant gene transcript 1 4.3 

S 421296 NM.002666 Hs.1032S3- periGpin 4.3 

400973 NA ENSP00000236667*:Mucin 58 (Fragment). 4.3 

452602 AW366194 Hs55962 ESTs 4.3 

412330 NIVLOOSlOO Hs.788 A kinase (PRKA) anchor protein (gravin) 4.3 

405016 CY00D17r:gil928O4O51gblAAF88402.1|AF245 4.3 

10 435104 AI475671 Hs,88607 ESTs, Highly similar to F-box protein FB 4.3 

405118 ENSP00000246632:CDNAFU20261fis. clone 4.3 

418556 T02850 gb:FB12A9 Fetal brain. Stratagene Homo s 4.3 

429745 AA480818 Hs.221736 ESTs 4.3 

433088 AW451206 Hs.115899 ESTs 4.3 

15 444445 AA342329 Hs.1 15920 Homo sapiens cDNA: FU22816 fis. clone K 4.3 

453880 AI803166 Hs.28462 ESTs. Weakly similar to 138022 hypotheti 4.3 

447384 AI377221 Hs.40528 ESTs 4.2 

414541 BE293116 Hs.76392 aldehyde dehydrogenase 1 family, member 4.2 

444975 AV652165 Hs.l82482 ESTs, Weakly similar to T00362 hypotheti 4.2 

20 403921 NA C5000212*:gi|10047237|db]|BAB13407.1|(A 4.2 

451477 A1798425 Hs,42710 ESTs 4.2 

406344 C5001660:gi|11611537ldbjlBABl8935.1|(AB 4.2 

416970 AA191201 Hs.35861 . DKFZP586E1 621 protein 4.2 

413662 BE155866 Hs.25522 KiAA1808 protein 4.2 

25 458504 AW070634 Hs.144794 ESTs 4.2 

404682 NA C9001188*:gi|12738842lreflNP_073725.1| p 4.2 

418089 N69913 Hs.6858 ESTs. Weakly similar to 178885 serine/th 4,2 

403433 NM_001622:Homo sapiens alpha-2-HS-glycop 4.2 

446532 AW975460 Hs.143563 ESTs 4,2 

30 414217 AI309298 Hs.279898 Homo sapiens cDIMA: FU231 65 fis, done L 4,2 

418425 AI871247 Hs.6262 hypotiietical protein IVIGC8407 4.2 

419589 AW973708 Hs.201925 Homo sapiens cDNA FLJ13446 fis. clone PL 4.2 

457029 AA397789 Hs.161803 ESTs 4,2 

447860 AF193807 Hs.1 31835 Rhesus blood group. B glycoprotein 4.2 

35 448988 Y09763 Hs.22785 gamma-aminobutyric add (6ABA) A recepto 4.2 

440610 AI733098 Hs.130800 ESTs 4.2 

439590 AF086410 gb:Homo sapiens full length insert cDNA 4.2 

427240 AA399975 Hs.274151 ligatin 4.2 

408932 AW594172 Hs.278513 TP53TG3 protein 4.2 

40 436112 T77545 Hs,187559 ESTs 4.2 

444382 AI144152 Hs.58246 ESTs 4,2 

456716 AA318060 Hs.135121 hypothetical protein FLJ224 15 . 4.2 

419846 NM_015977 Hs.285681 Williams-Beuren syndrome chromosome regi 4.2 

410036 R57171 Hs.57975 calsequestrin 2 (cardiac muscle) 4.1 

45 400545 NA Target Exon 4.1 

403051 NA Target Exon 4.1 

420139 NM_005357 Hs.95351 lipase, hormone-sensitive ' 4.1 

450244 AA007534 Hs.1 25062 ESTs 4.1 

453261 AA034116 Hs.118494 ESTs 4.1 

50 440246 W52010 Hs.191379 ESTs 4.1 

414516 AI307802 Hs.135560 ESTs. Weakly similar to T43458 hypotheti 4,1 

438232 A1150595 Hs.122226 ESTs 4.1 

410233 AA082947 gb:znl0g07.s1 Stratagene hNT neuron (937 4.1 

412179 BE270758 Hs.69428 hypothetical pnDtein MGC3020 4.1 

55 441871 AI306150 Hs.153450 ESTs, Weakly shnilar to 19091 23ANa glue 4.1 

426411 AK000708 Hs.169764 hypothetical protein FU20701 4.1 

453692 AL1 10416 gb:DKFZp434K0431_r1 434 (synonym: htes3) 4.1 

448640 AW817177 Hs.102558 . Homo sapiens^ done MGC:5352. mRNA, comp 4.1 

417481 AA203281 Hs.21798 ESTs 4.1 

60 412912 AW118878 Hs.1 10835 ESTs 4.1 

454183 AW807116 gb:MR4-ST0062-O4010O4)24-b12 ST0062 Homo 4.1 

426328 AW631296 gb:hh83c09.y1 NCLCGAP.GUI Homo sapiens 4.1 

435942 R08285 Hs.191215 ESTs 4.1 

417629 T76945 gb:yc92o07.r1 Scares infant brain 1NBH 4.1 

65 403593 NA Target Exon 4,0 

402690 Target Exon 4.0 

418190 R49591 Hs.270425 ESTs 4.0 
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408641 


'AW245207 


Hs.5555 


hypothetical protein MGC5347 


4.0 


427899 


AA829286 


Hs.332053 


senim amyloid A1 


4.0 


445975 


AI811536 


Hs.145734 


ESTs 


4.0 


438831 


BE263273 


Hs.6439 


synapsin II 


4.0 


455578 


BE006350 


Hs.14355 


Homo sapiens cDNA FLJ13207 fis. done NT 


4.0 


401840 


NA 




Target Exon 


4.0 


413753 


U17760 


Hs.75517 


lamtnin, beta 3 (nicein (125kD). kaDnin 


4.0 


445030 


AI205925 


Hs.147238 


ESTs. Highly similar to AAC3_HUMAN ALPHA 


4.0 


433873 


AW156913 


Hs. 150478 


ESTs, WeaWy sMar to A Cham A, Cryst 


4.0 


456736 


AW248217 


Hs.1619 


achaete-scute c»mplex (Drosophila) homol 


4.0 


450112 


BE047734 


Hs.5473. 


ESTs. Moderately similar to ALU5.HUMAN A 


4.0 


448906 AI58d567 


Hs.309719 


ESTs 


4.0 
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TABLE 23A 
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Table 23 A shows the accession numbers for those pkeys lacking unigenelD's for Table 23. 
For each probeset, we have Hsted the gene cluster number from which the oligonucleotides ■ 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland CaUfomia). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset Identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 

409853 1156226,1 AW502327 AW502488 AW501829 AW502625 AW502687 

410034 1 170594_1 BE067414 BE067958 BE067419 BE067963 AW577127 AW601412 

410233 118656_1 AA082947 AA083D36 

410490 1205347 1 H03589 AW750687 AW750688 

410882 1225686.1 AW809163 AW809247 AW809177 AW809190 AW809225 

411478 1247073 1 BE143068 AW849143 AW848705 AW848569 AW848071 AW848475 AW848092 AW848005 
413065 1347960ll BE063555 BE151321 BE151319 BE151657 BE151655 BE063556 BE151322 

413072 1348163 1 BE063965 BE063968 BE064034 BE064028 BE063874 BE063966 BE063869 BE064043 BE064033 BE063884 

414593 1464909^1 BE386764 BE387560 

414913 1506721 1 R25621 C03959 C04010 

415011 151328J AW963085AA159005AW963073 

415986 1564410_1 Z43519 R61274 H12206 R12883 

416267 1583547J H45384 H49125 H41699 

417574 1687770_1 R00348 R09593 

417629 1690392.1 T76945 R20210 R05755 

418556 1767866_-1 T02850 

419583 186198J F00312 AA247490 F31427 AA383663 F22045 

426328 264901J AW631296 AA375484 

439590 47413J AF086410 W94386 W74609 

442398 541271J AA994520 AW393574 

452205 90415_1 C15819 AA024741 AA024742 

452654 925931J BE004783 BE004947 A1911790 

453692 977825 1 AL1 10416 AW876759 

454183 1049636J AW807116 AW807569 AW807415 AW807338 AW807288 AW807263 AW807316 AW177402 AW807413 AW807068 BE141561 
BE141569 AW807401 AW807310 BE141565 AW807318 AW807119 AW807299 AW807241 AW807225 AW807204 AW807345 
AW807103 BE141 615 AW807431 AW807393 AW807337 AW807406 AW807259 AW807375 AW845890 AWB07220 AW807399 
AW807064 AW807376 AW807024 BE141595 AW807236 AW807027 AW807112 AW807378 AW807202 BE141593 AW807216 
AW807138 AW807244 AW807221 AW807297 AW807050 AW807248 AW807404 AW807075 AW807237 AW807212 AW807308 
AW807110 AW807104 BE140912 AW807301 AW807382 AW807294 AW807026 AW807020 AW807108 AW807025 AW807433 
AW807124 AW807419 AW807031 AW807264 AW807032 AW807029 AW807052 AW807391 AW807207 AW807215 AW807019 
AW807238 AW807201 BE141590 AW807302 AW807323 AW807380 AW807109 BE141588 AW845877 AW807418 AW807407 
AW807309 BE141614 AW845861 AW807396 AW807300 AW807348 AW80731 1 AW607214 AW807132 AW807402 AW807350 
AW807028 AW807298 AW807291 AW807305 AWB07217 AW807312 AW807261 AW807268 AW807260 AW807266 AW807198 
AW807346 AW807315 BE140940 AW807351 AW807122 AW807235 AW807076AW807314 AW807213 AW807143AW807131 
AW800079AW807066 AW807077 AW845923AW807421 BE141619 BE1 40943 AW807420 AW807381 BE141621 AW845921 
BE141629 BE141625 BE141624 BE141636 BE141630 AW807405 AW807290 AW807353 

454404 1170594J BE067414 BE067958 BE057419 BE067963 AW577127 AW601412 

454775 1234106.1 BE160229 AW819879 AW820179AW819882 AW819876AW820169BE153201 AW993736BE152911 
• 455282 1273020.1 BE 1 43867 AW935060AW886684 
459159 919998J AI904646 BE179494 BE179421 
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TABLE 23B 

Table 23B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 23. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et at." refers to the publication 

entitled The ONA sequence of human chromosome 22." Dunham I. et at., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: lndk:ates nudeotkle positions of predicted exons. 



Pkey 


Ref 


Strand 


N(_posftion 


400545 


9800107 


Minus 


124618-124881 


400870 


9838306 


Minus 


34081-35027 


400973 


7960452 


Minus 


98119-98253 


401093 


8516137 


Minus 


22335-23166 


401590 


9966320 


Minus 


33547-33649 


401665 


7145001 


Plus 


121591-122537 


401810 


7342191 


Plus 


129063-129476 


401840 


7684597 


Plus 


56283-56439 


402054 


8083691 


Minus 


8288-8806 


402195 


7689778 


Minus 


147901-148884 


402583 


7684486 


Plus 


94883-95003 


402690 


8348058 


Plus 


13368-13998 


402698 


8570304 


Minus 


108641-108903 


402779 


9588555 


Minus 


38173-39210 


403017 


6693623 


Plus 


78630-79367 


403051 


4827080 


Minus 


5269-5411 


403263 


7770677 


Plus 


52431-52737 


403433 


9719611 


Minus 


72225-72437 


403593 


6662650 


Minus 


62554-62712,69449-69602 


403612 


8469060 


Minus 


94723-94859 


403921 


7711590 


Minus 


3297^536 


404368 


7630956 


Minus 


102053-102199 


404682 


9797231 


Minus 


40977-41150 


404689 


7534100 


Plus 


119461-119717 


405016 


6524300 


Plus 


51997-53308 


405062 


7657730 


Plus 


101283-101432 


406118 


9143818 


Plus 


53997-54629 


406344 


9255974 


Plus 


20254-20374,20526-20659, 


406563 


7711604 


Plus 


34401-34538 
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TABLE 24: 

Table 24 depicts Seq ID No., UnigenelD, UnigeneTitle, Pkey, Pred.Cell.Loc., and ExAccn 
for all of the sequences in Table 25. The information in Table 24 is linked by Seq ID No. to 
Table 25. 
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Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

Pred.Cell.Loc: Predicted Cellular Localization 

Seq.lD.No.: Sequence Identification Number found in Table 25 

Pkey ExAccn UnigenelD UnigeneTitle 



Pred.CeU.Loc. Seq. ID. No. 



449746 

407276 

415539 

400297 

450375 

102457 

429170 

424399 

422505 

449765 

425692 

426215 

439840 

410102 

429220 

416276 

409079 

442818 

442082 

444381 

446163 

416636 

442117 

433043 

429353 

452190 

446733 

452747 

423242 

417433 

432201 

423961 

439569 

114480 

404561 

325372 

112287 

335824 

424735 

400289 

427585 

429925 

429441 

421155 

420931 

420813 

452744 



AI668594 
AI951118 
AI733881 
All 27076 
AA0Q9647 
NM_001394Hs.2359 
NM_001394Hs.2359 
AI905687 Hs.2533 
AL1 20862 
N92293 
D90041 
AW963419 Hs.155223 
AW449211 Hs.105445 
AW248508 Hs.279727 
AW207206 Hs.136319 
U41060 



Hs. 176588 

Hs.326738 

Hs.72472 

Hs.334473 

Hs.8850 



Hs.124165 
Hs.206832 
Hs.155956 



W87707 

AK001741 

R41823 

BE387335 

AA026880 

N32536 



Hs.79136 
Hs.82065 
Hs.8739 
Hs.7413 
Hs.283713 
Hs.25252 
Hs.42545 



AW664964 Hs.1 28899 
W57554 Hs.125019 



AL1 17406 

H26735 

AA863360 

BEl 53855 

AL039402 

BE270266 

AI538613 

D13666 



Hs.200102 

Hs.91668 

Hs,26040 

Hs.61460 

Hs.125783 

Hs.82128 

Hs.298241 

Hs.1 36348 



AW602166 Hs.222399 

BE066778 Hs.151678 

NA 

AB033064 Hs.334806 
NA 

U31875 Hs.272499 

X07820 Hs.2258 

031 152 Hs.179729 
NM^000786Hs.226213 

AJ224172 Hs.204096 

H87879 Hs.102267 

AF044197 Hs.100431 

X51501 Hs.99949. 

AI267652 H5.30504 



ESTs. WeaWy similar to CP4Y_HUMAN CYTOC 
Homo sapiens breast cancer antigen NY-BR 
BMP-RIB 

hypothetical protein DKFZp56401278 

a disintegrin and metalloproteinase doma 

dual spedficity phosphatase 4 nuclear 

dual specificity phosphatase 4 nuclear 

aldehyde dehydrogenase 9 family, member cytoplasm 

ESTs 

ESTs, Moderately similar (o ALU8,HUMAN A 

N^acetyttransferase 1 (arylamine N-acety 

stanniocalcin 2 

GDNF family receptor alpha 1 

Homo sapiens cDNA FLJ14035 fts, done HE 

ESTs 

LIV-1 protein, estrogen regulated 
interleukin 6 signal transducer (gpl30, 
hypothetical protein FU10879 
ESTs 

ESTs. Weakly similar to S64054 hypotheli 
Homo sapiens cDNA FU 13603 fis, clone PL 
solute carrier family 16 (monocarboxyfic 
ESTs 

lymphoid nuclear protein (LAF-4) mRNA 
ATP-binding cassette transporter MRP8 
Homo sapiens clone PP1498 unknown mRNA 
ESTs. Weakly similar fo fatty add omega 
Ig superfamily receptor LNIR 
DEME-6 protein 

5T4 oncofetal trophoblast glycoprotein 
Transmembrane protease, serine 3 
osteoblast spedfic factor 2 (fasciclin 
CEGP1 protein 

UDP-N-acetyl-aIpha-D-galactosamine:pofyp 
NM.014112*:Homo sapiens tridiortiinophatei mitochodria 
Phase2&3Exons nuclear 
KIAA1238 protein 

ENSP00000249072*:DJ222E13.1 (N-TERMINAL 
short^in afoohol dehydrogenase family 
matrix metaUoproteinase 10 (stromelysin 
collagen, type X, alpha 1 (Schnud metaph 
cytochrome P450« 51 (ianosterol 14-a|pha ER 
iipophilin B (uterogtobin family member) 
lysyl oxidase extracellular 
small indudble cytokine B subfamily (Cy 
prolactln-induced protein nuclear 
Homo sapiens mRNA: cDNA DKFZp434E082 (fr 
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Seq ID1&2 
Seq ID3&4 
Seq ID5&6 
Seq ID7&8 
SeqlOg&IO 
Seq ID 11 & 12 
Seq ID 11 & 12 
Seq ID 13 & 14 
Seq ID 15&16 
Seq ID 17&18 
Seq ID 19&20 
Seq ID 21 & 22 
Seq ID 23 & 24 
Seq ID 25 & 26 
Seq ID 27&28 
Seq ID 29 & 30 
Seq ID 31&32 
Seq ID 33&34 
Seq ID 35 & 36 
Seq ID 37 & 38 
Seq ID 39&40 
Seq ID 41 & 42 
Seq ID 43 fit 44 
Seq ID 45 & 46 
Seq ID 47 & 48 
Seq ID 49 & 50 
SeqlD51&52 
Seq ID53&54 
Seq ID 55&56 
Seq ID 57 & 58 
Seq ID59&60 
Seq ID 61 & 62 
Seq ID63&64 
SeqlD65&66 
Seq[D67&68 
Seq ID69&70 
Seq ID 71 & 72 
SeqiD73&74 
Seqf075&76 
Seq 1D77&78 
Seq ID 79&80 
Seq ID 81&82 
Seq ID 83 & 84 
Seq ID 85 885 
SeqlD87&88 
Seq ID 89 & 90 
Seq ID 91&92 
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420757 X78592 Hs.99915 

424905 N[\l.002497Hs.153704 

429859 N{\/L007050Hs.225952 

446921 AB012113 Hs.16530 

5 445537 AJ245671 Hs.12844 

428227 AA321849 Hs.2248 

424001 W67883 Hs,137476 

421727 Y13153 Hs.107318 

452838 U65011 Hs.30743 

10 419667 AU077005 Hs.92208 

414812 X72755 Hs.77367. 

426320 W47595 Hs.169300 

422867 L32137 Hs.1584 

411558 AA102670 Hs.70725 

15 417866 AW067903 Hs.82772 

428398 AI249368 Hs.98558 

431958 X63629 Hs.2877 

428722 U76456 Hs.190787 

412970 AB026436 Hs.i77534 

20 421379 Y15221 Hs.103982 

415752 BE314524 Hs.78776 

444051 M48373 Hs.10247 

451110 AI955040 Hs.265398 



androgen receptor (dihydrotestosterane r 
NIMA (never in mitosis gene aHelated k 
protein tyrosine phosphatase, receptor t 
small inducible cytokine subfami^ A (Cy 
EGF-like-domain, ifnultiple 6 
small inducible cytoldne subfanvly B (Cy 
patemally expressed 10 
kynurenine 3^nonobxygenase (kynurenine 3 
preferentially expressed antigen in mela 
a disintegrin and metalloproteinase doma 
monokine induced by gamma Interferon 
transfomiing growth factor, beta 2 
cartilage oligomeric matrix protein (pse 
gamma-aminobutyric add (GA6A) A recepto 
collagen, type XI. alpha 1 
ESTs 

cadherin 3. type 1, P-cadherin (placenta 
tissue inhibitor of metalloproteinase 4 
dual speciffdty phosphatase 10 
smafl tndudble cytoldne subfamily B (Cy 
putative transmembrane pnstein 
activated leucocyte cell adheskm molecu 
ESTs. Weakly similar to transfonnation-r 



cytoplasm 
nudear 



Seq 
Seq 
Seq 
Seq 
Seq 
Seq 
Seq 
Seq 
Seq 
Seq 
Seq 
Seq 
Seq 
Seq 
Seq 
Seq 

plasma membraneSeq 
Seq 
Seq 
Seq 
Seq 
Seq 
Seq 



extracellular 
extraceliular 



nudear 

extracellular 
extracellular 
extraceliular 



extracellular 



ID93&94 
ID95&96 
10 97 & 98 
ID 99 & 100 
ID 101 & 102 
ID 103 & 104 
ID 105 & 106 
ID 107 & 108 
ID 109 & 110 
ID 111 & 112 
ID113&114 
ID115&116 
ID117&118 
ID 119 & 120 
ID 121 & 122 
ID 123 & 124 
ID 125 & 126 
ID 127 & 128 
10 129 & 130 
ID 131 & 132 
ID 133 & 134 
ID 135 & 136 
ID 137 & 138 
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TABLE 24A 

Table 24 A shows the accession numbers for those pkeys lacking unigenelD's for Table 24. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
5 were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and niRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

10 Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

IS Pkey CAT number Accession 

335824 CH22^3197FG_619J1_LINK_E 
325372 c12_hs 



BNSOCxna<wo aao5S377»2 1 
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TABLE 24B 

Table 24B shows the genomic positioning for those pkeys lacking unigene ZD's and 
accession numbers in Table 24. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



10 



15 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbe/s. "Dunham J. et al." refers to the publication 

entitled "The DNA sequence of human chromosome 22." Dunham I. et al., Nature (1 999) 402:489-495. • 
strand: Indicates DNA strand from which axons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey Ref Strand 
404561 9795980 AfTinus 



Kt_positlori 
69039-70100 



20 



BNSDOCIO: <WO^__0aQS9977A2J_> 
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Table 25 

The 69 gene sequences identified to be overexpressed in breast cancer may be used to identify coding regions from the public DNA databases (nr and htgs m 
Genbank). The sequences may be used to eitlier identify genes tJiat encode known proteins, or they may be used to predict the coding regions from genomic 
DNA using exon prediction algoritlmis, such as FGENESH (Saiamov and Solovyev, 2000, Genome Res. 10:S 16-522) 

Seq ID NO: 1 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted ORF 

Coding sequence: 1-1518 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I 1 

ATGG AGCCCT CCTGGCTTCA GG AACTCATG GCTCACCCCT TCTTGCTGCT GATCCTCCTC 60 
TGCATGTCTC TGCTGCTGTT TCAGGTAATC AGGTTGTACC AGAGG AGGAG ATGG ATGATC 1 20 
AGAGCCCTGC ACCTGTTTCC TGCACCCCCT GCCCACTGGT TCTATGGCCA CAAGGAGTTT 1 80 
TACCCAGTAA AGGAGTTTGA GGTGTATCAT AAGCTGATGG AAAAATACCC ATGTGCTGTT 240 
CCCTTGTGGG TTGGACCCTT TACGATGTTC TTCAGTGTCC ATGACCCAGA CTATGCX:AAG 300 
ATTCTCCTGA AAAGACA AG A TCCCAAAAGT GCTGTTAGCC ACAAAATCCT TG AATCCTGG 360 
GTTGGTCGAG G ACTTGTG AC CCTGG ATGGT TCTA A ATGG A AAAAGCACCG CCAGATTGTG 420 
AAACCTGGCT TCAACATCAG CATTCTGAAA ATATTCATCA CCATGATGTC TG AGAGTGTT 480 
CGGATGATGCTGAACAAATG GGAGGAACGC ATTGCCCAAA ACTCACGTCTGGAGCTCTTT 540 
CAACATGTCT CCCTGATGAC CCTGGACAGC ATCATGAAGT GTGCCTTCAG CCACCAGGGC 600 
AGCATCCAGT TGGACAGTAC CCTGGACTCA TACCTGAAAG CAGTGTTCAA CCTTAGCAAA 660 
ATCTCCAACC AGCGCATGAA CAATTTTCTA CATCACAACG ACCTGGTTTT CAAATTCAGC 720 
TCTCAAGGCC AAATCTTTTC TAAATTTAAC CAAGAACTTC ATCAGTTCAC AGAGAAAGTA 780 
ATCCAGGACC GGAAGGAGTC TCTTAAGGAT AAGCTAAAAC AAGATACTAC TCAGAAAAGG 840 
CGCTGGGATT TTCTGGACAT ACTTTTGAGT GCCAAAAGCG AAAACACCAA AGATTTCTCT 900 
GAAGCAGATC TCCAGGCTGA AGTGAAAACG TTCATGTTTG CAGGACATGA CACCACATCC 960 
AGTGCTATCT CCTGGATCCT TTACTGCTTG GCAAAGTACC CTGAGCATCA GCAGAGATGC 1020 
CGAGATGAAA TCAGGGAACT CCTAGGGGAT GGGTCTTCTA TTACCTGGGA ACACCTGAGC 1080 
CAGATGCCTT ACACCACG at GTGCATCAAG G a ATGCCTCC GCCTCTACGC ACCGGTAGTA U 40 
AACATATCCC GGTTACTCGA CAAACCCATC.ACCTTTCCAG ATGGACGCTC CTTACCTGCA 1200 
GGAATAACTG TGTTTATCAA TATTTGGGCT CTTCACCACA ACCCCTATTT CTGGGAAGAC 1260 
CCTCAGGTCT TTAACCCCTT GAGATTCTCC AGGGAAAATT CTGAAAAAAT ACATCCCTAT- 1320 
GCCTTCATAC CATTCTCAGC TGGATTAAGG AACTGCATTG GGCAGCATTT TGCCATAATT 1380 
GAGTGTAAAG TGGCAGTGGC ATTAACTCTG CTCCGCTTCA AGCTGGCTCC AGACCACTCA 1440 
AGGCCTCCCC AGCCTGTTCG TCAAGTTGTC CTCAAGTCCA AGAATGGAAT CCA TGTG TTT 1500 
GCAAAAAAAG TTTGC TAAT T TTAAGTCCTT TCGTATAAGA ATTAATGAGA CAATTTTCCT 1560 
ACCAAAGGAA GAACAAAAGG ATAAATATAA TACAAAATAT ATGTATATGG TTGIT TGACA 1620 
AATTATATAA CrrAGGATACrrCTGACTGGTTTTGACATCCATTAACAGT AATTTTAATT 1680 
TCTTTGCTGT ATCTGGTGAA ACCCACAAAA ACACCTG AAA AAACTCAAGC TGACTTCCAC 1740 
TGCGAAGGGA AATTATTGGT TTGTGTAACT AGTGGTAGAG TGGCTTTCAA GCATAGTTTG 1800 
ATCA AAACTC CACTCAGTAT CTGC ATTACT TTTATCTCTG CAAATATCTG CATG ATAGCT 1 860 
TTATTCTCAG TTATCTTTCC CCAATAATAA AAAA 



Seq ID NO: 2 Protein sequence: 

Protein Accession #: FGENESH predicted 

1 II 21 31 41 51 

' I ' » ' 1 ' ^ 
MEPSWLQELM AHPFLLLILL CMSLLLFQVI RLYQRRRWMI RALHLFPAPP AHWFYGHKEF 60 . 

YPVKEFEVYH KJLMEKYPCAV PLWVGPFTMF FSVHDPDYAK ILLKRQDPKS AVSHKJLESW 120 

VGRGLVTLDG SKWKKHRQIV KPGFNISILK IFITMMSESV RMMLNKWEER lAQNSRLELF 180 

QHVSLMTLDS IMKCAFSHQG SIQLDSTLDS YLKAVFNLSK ISNQRMNNFL HHNDLVFKFS 240 

SQGQIFSKFN QELHQFTEKV IQDRKESUCD ICLKQDTTQKR RWDFLDILLS AKSENTKDFS 300 

EADLQAEVKT FMFAGHDTTS SAISWILYCL AKYPEHQQRC RDEIRELLGD GSSITWEHLS 360 

QMPYTTMCDC ECLRLYAPVV NISRLLDKPI TFPDGRSLPA GITVFINIWA LHHNPYFWED 420 

PQVFNPLRFS RENSEKIHPY AFIPFSAGLR NCIGQHFAn ECKVAVALTL LRFKLAPDHS 480 

RPPQPVRQVV UCSKNGIHVF AKKVC 

Seq ID NO: 3 DNA sequence 

Nucleic Acid Accession #: NM_052997 

Coding sequence: 1 00-4 1 25 (underlined sequences correspond to start and stop codons) 



I 11 21 31 41 51 
I I » ' ' ' 

CTAGTCTATA CCAGCAACG A CTCCTACATC GTCCACTCTG GGG ATCTTAG AAAGATCCAT 60 
AAAGCTGCCT CCCGGGGACA AGTCCGGAAG CTGGAGAAG A TG ACAAAGAG GAAGAAGACC 120 
ATCAACCTTA ATATACAAGA CGCCCAG AAG AGG ACTGCTC TACACTGGGC CTGTGTCAAT 1 80 
GGCCATGAGG AAGTAGTAAC ATTTCTGGTA GACAGAAAGT GCCAGCTTGA CGTCCTTGAT 240 
GGCGAACACA GGACACCTCT GATGAAGGCT CTACAATGCC ATCAGGAGGC TTGTGCAAAT 300 
ATTCTQATAG ATTCTGGTGC CG ATAT AAAT CTCGTAGATG TGTATGGCAA CATGGCTCTC 360 
CATTATGCTG TTTATAGTGA GAITTTGTCA GTGGTGGCAA AACTGCTGTC CCATQGTGCA 420 
GTCATCGAAG TGCACAACAA GGCTAGCCTC ACACCACTTT TACTATCCAT AACG AA AAGA 480 
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AGTGAGCAAA TTGTGGAATT TTTGCTGATA AAAAATGCAA ATGCGAATGC AGTTAATAAG 540 
TATAAATGCA CAGCCXTTCAT GCTTGCTGTA TGTCATGGAT CATCAGAG AT AGTTGGCATG 600 
CTTCTTCAGC AAAATGTTGA CGTCTTTGCT GCAGATATAT GTGGAGTAAC TGCAG AACAT 660 
TATGCTGTTA CTTGTGGATT TCATCACATT CATG AACAAA TTATGGAATA TATACGAAAA 720 
5 TTATCTAAAAATCATCAAAATACCAATCCAGAAGG AACAT CTGCAGGAACACCTGATGAG 780 
GCTGCACCCT TGGCGGAAAG AACACCTGAC ACAGCTG AAA GCTTGGTGG A AAAAACACCT 840 
GATGAGGCTG CACCCTTGGT GGAAAGAACA CCTGACACGG CTGAAAGCTT GGTGGAAAAA 900 
ACACCTGATG AGGCTGCATC CTTGGTGGAG GGAACATCTG ACAAAATTCA ATGTTTGGAG. 960 
AAAGCGACAT CTGG AAAGTT CG AACAGTC A GCAGAAGAAA CACCTAGGG A AATTACG AGT 1 020 

1 0 CCTGCA AA AG A A ACATCTG A GAAATTTACG TGGCCAGCA A AAGG AAGACC TAGGA AG ATC J 080 
GCATGGGAGA AAAAAGAAGA CACACCTAGG GAAATTATGA GTCX^CGCAAA AGAAACATCT 1 140 
GAGAAATTTA CGTGGGCAGC AAAAGGAAGA CCTAGGAAGA TCGCATGGGA GAAAAAAGAA 1200 
ACACCTGTAA AGACTGGATG CGTGGCAAGA GTAACATCTA ATAAAACTAA AGTTTTGGaA 1260 
' AAAGG AAGAT CTAAG ATGAT TGCATGTCCT ACAAAAG AAT CATCTACAAA AGCAAGTGCC 1 320 

1 5 AATGATCAGA GGTTCCCATC AGAATCCAAA CAAGAGGAAG ATGAAGAATA TTCTTGTGAT 1380 
TCTCGGAGTC TCTTTGAGAG TTCTGCAAAG ATTCAAGTGT GTATACCTGA GTCTATATAT 1440 
CAAAAAGTAA TGGAGATAAA TAGAGAAGTA GAAGAGCCTC CTAAGAAGCC ATCTGCCTTC 1500 
AAGCCTGCCA TTGAAATGCA AAACTCTGTT CCAAATAAAG CCTTTGAATT GAAGAATG AA 1 560 
CAAACATTGA GAGCAGATCC GATGTTCCCA CCAGAATCCA AACAAAAGGA CTATGAAGAA 1620 

20 AATTCTTGGG ATTCTGAGAG TCTCTGTG AG ACTGTTTCAC AG AAGG ATGT GTGTTTACCC 1 680 

AAGGCTACAC ATCAAAAAGA AATAGATAAA ATAAATGGAA AATTAGAAG A GTCTCCTA AT 1 740 
AAAG ATGGTC TTCTG AAGGC TACCTGCGGA ATGAAAGTTT CTATTCCAAC TAAAGCCTTA 1 800 
GAATTGAAGG ACATGCAAAC TTTCAAAGCG GAGCCTCCGG GGAAGCCATC TGCCTTCGAG 1860 
CCTGCCACTG AAATGCAAAA GTCTGTCCCA AATAAAGCCT TGGAATTGAA AAATGAACAA 1920 

25 ACATGGAGAG CAGATGAGAT ACTCCCATCA GAATCCAAAC AAAAGGACTA TGAAGAAAAT 1980 
TCTTGGGATA CTGAGAGTCT CTGTGAGACT GTTTCACAGA AGG ATGTGTG TTTACCCAAG 2040 
GCTGCGCATC AAAAAGAAAT AGATAAAATA .^TGGAAAAT TAGAAGGGTC TCCTGTTA AA 2 1 00 
GATGGTCTTC TGAAGGCTAA CTGCGGAATG AAAGTTTCTA TTCCAACTAA AGCCTTAGAA 2160 
TTG ATGG ACA TGCAAACTTT CAAAGCAG AG CCTCCCGAGA AGCCATCTGC CTTCG AGCCT 2220 

3 0 GCCATTG AAA TGCAAA AGTC TGTTCCAAAT AAAGCCTTGG A ATTG AAG AA TG AACAAACA 2280 
TTGAGAGCAG ATGAGATACT CCCATCAGAA TCCAAACAAA AGGACTATGA AGAAAGTTCT 2340 
TGGGATTCTG AGAGTCTCTG TGAGACTGTT TCACAGAAGG ATGTGTGTTT ACCCAAGGCT 2400 
ACACATCAAA AAGAAATAGA TAAAATAAAT GGAAAATTAG AAGAGTCTCC TGATAATGAT 2460 
GGTTTTCTGA AGGCTCCCTG CAGAATGAAA GTTTCTATTC CAACTAAAGC CTTAGAATTG 2520 

3 5 ATGG ACATGC AA ACTTTCAA AGC AG AGCCT CCCG AG AAGC C ATCTGCCTT CG AGCCTGCC 2580 
ATTGAAATGC AAAAGTCTGT TCCAAATAAA GCCTTGGAAT TGAAGAATGA ACAAACATTG 2640 
AGAGCAGATCAGATGTTCCCTTCAGAATCAAAACAAAAGAAGGTTGAAGAAAATTCTTGG 2700 
GATTCTGAGA GTCTCCGTGA GACTGTTTCA CAGAAGGATG TGTGTGTACC CAAGGCTACA 2760 
CATCAAAAAG AAATGGATAA AATAAGTGGA AAATTAGAAG ATTCAACTAG CCTATCAAAA 2820 

40 ATCTTGGATA CAGTTCATTC TTGTGAAAGA GCAAGGGAAC TTCAAAAAGA TCACTGTGAA 2880 
CAACGTACAG G AAAAATGG A ACAAATGAAA AAGAAGTTTT GTGTACTGAA AAAGAAACTG 2940 
TCAGAAGCAA AAGAAATAAA ATCACAGTTA GAGAACCAAA AAGTTAAATG GGAACAAGAG 300O 
CTCTGCAGTG TGAGATTGAC TTTAAACCAA GAAGAAGAGA AGAGAAGAAA TGCCGATATA 3060 
TTAAATGAAA AAATTAGGGA AGAATTAGGA AGAATCGAAG AGCAGCATAG GAAAGAGTTA 3120 

45 GAAGTGAAAC AACAACTTGA ACAGGCTCTC AGAATACAAG ATATAG AATT GAAGAGTGTA 3 1 80 
GAAAGTAATT TG AATCAGGT TTCTCACACT CATGAAAATG AAAATTATCT CTTACATGAA 3240 
AATTGCATGT TG AAAAAGG A AATTGCCATG CTAAAACTGG AAATAGCCAC ACTGAAACAC 3300 
CAATACCAGG AAAAGGAAAA TAAATACTTT GAGGACATTA AGATTTTAAA AGAAAAGAAT 3360 
_. GCTGAACTTC AG ATGACCCT AAAACTGAAA G AGGAATCAT TAACTAAAAG GGCATCTCAA 3420 

5 0 TATAGTGGGC AGCTTAAAGT TCTGATAGCT G AGAACACAA TGCTCACTTC TAAATTGAAG 3480 
GAAAAACAAG ACAAAGAAAT ACTAGAGGCA GAAATTG AAT CACACCATCC TAGACTGGCT 3540 
TCTGCTGTAC AAGACCATGA TCAAATTGTG ACATCAAGAA AAAGTCAAGA ACCTGCTTTC 3600 
CACATTGCAG GAGATGCTTG TTTGCAAAGA AAAATGAATG TTGATGTGAG TAGTAGGATA 3660 
TATAACAATG AGGTGCTCCA TCAACCACTT TCTGAAGCTC AAAGGAAATC CAAAAGCCTA 3720 

5 5 AAAATTAATC TCAATTATGC AGGAGATGCT CTAAGAGAAA ATACATTGGT TTCAGAACAT 3780 
GCACAAAGAG ACCAACGTGA AACACAGTGT CAAATGAAGG AAGCTGAACA CATGTATCAA 3840 
AACGAACAAG ATAATGTGAA CAAACACACT GAACAGCAGG AGTCTCTAGA TCAG AAATTA 3900 
TTTCAACTAC AAAGCAAAAA TATGTGGCTT CAACAGCAAT TAGTTCATGC ACATAAG AAA 3960 
^ GCTGAC AACA AAAGCAAG AT A AC AATTGAT ATTCATTTTC TTG AGAGG AA AATGCAACAT 4020 

60 CATCTCCTAA AAGAGAAAAA TGAGGAG ATA TTTAATTACA ATAACCATTT AAAAAACCGT 4080 

ATATATCAAT ATGAAAAAGA GAAAGCAGAA ACAGAAAACt C ATGAG AGAC AAGCAGTAAG 4140 
AAACTTCTTT TGGAGAAACA ACAGACCAGA TCTTTACTCA CAACTCATGC TAGGAGGCCA 4200 
GTCCTAGCAT CACCTTATGT TG AAAATCTT ACCAATAGTC TGTGTCAACA G AATACTTAT 4260 
TTTAGAAGAA AAATTCATGA TTTCTTCCTG AAGCCTACAG ACATAAA ATA ACAGTGTGAA 4320 

65 G AATTACTTG TTCACGAATT GCATAAAGCT GCACAGGATT CCCATCTACC CTGATGATGC 4380 
AGCAGACATC ATTCAATCCA ACCAGAATCT CGCTCTGCAC TCCAGCCTAG GTGACAGAGT 4440 
GAGACTCCAC CTCGGAAA 



70 5^q ID Nn: 4 Protein sequence: 

Protein Accession NP_443723.1 

1 U 21 31 41 51 

I I 1 I i I 
7 5 MTKRKKTINL NIQD AQKRTA LHWACVNGHE EV VTFLVDRK CQLDVLDGEH RTPLMKALQC 60 
HQEACANILI DSGADINLVD VYGNMALHYA VYSEILSVVA KLLSHGAVIE VHNKASLTPL 120 
LLSnXRSEQ IVEFLLIKKA NANA VNKYKC TALMLA VCHG SSEI VGMLLQ QNVDVFA ADI 1 80 
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CG VTAEHYAV TCGFHHIHEQ IMEYIRKLSK KHQNTNPEGT SAGTPDEAAP LAERTPDTAE 240 
SLVEKTPDEA APLVERTPDT AESLVEKTPD EAASLVEGTS DKIQCLEKAT SGKFEQSAEE 300 
TPREITSPAK ETSEKFTWPA KQRPRKIAWE KKEDTPREIM SPAKETSEKF TWAAKGRPRK 360 
lAWEKXETPV KTGCVARVTS NKTKVLEKGR SKMIACPTKE SSTKASANDQ RFPSESKQEE 420 
DEEYSCDSRS LFESSAKIQV CIPESIYQICV MEINREVEEP PKKPSAFKPA lEMQNS VPNK 480 
AFELKNEQTL RADPMFPPES KQKDYEENSW DSESLCETVS QKDVCLPKAT HQKEIDKING 540 
KLEESPNKDG LLKATCGMKV SIPTKALEUC DMQTFKAEPP GKPSAFEPAT EMQKSVPNKA 600 
LELKNEQTWR ADEILPSESK QKDYEENSWD TESLCETVSQ KDVCLPKAAH QKEIDKINGK 660 
LEGSPVKDGL LKANCX3MKVS IPTKALELMD MQTFKAEPPE KPSAFEPAIE MQKSVPNKAL 720 
ELKNEQTLRA DEILPSESKQ KDYEESSWDS ESLCETVSQK DVCLPKATHQ KEIDKINGKL 780 
EESPDNDGFL KAPCRMKVSI PTKALELMDM QTFKAEPPEKl PSAFEPAIEM QKSVPNKALE 840 
LKNEQTLRAD QMFPSESKQK KVEENSWDSE SLRETVSQKD VCVPKATHQK EMDKISGKLE 900 
DSTSLSKILD TVHSCERARE LQKDHCEQRT GKMEQMKKKF CVLKKKLSEA KEIKSQLENQ 960 
KVKWEQELCS VRLTLNQEEE KRRNADILNE KIREELGRIE EQHRKELEVK QQLEQALRIQ 1020 
DIELKSVESN LNQVSHTHEN ENYLLHENCM LKKEIAMUCL EIATUCHQYQ EKENKYFEDI 1080 
KILKEKNAEL QMTLKLKEES LTKRASQYSG QLKVLIAENT MLTSKLKEKQ DKEILEAEIE 1 140 
SHHPRLASAV QDHDQIVTSR KSQEPAFHIA GDACLQRKMN VDVSSTIYNN EVLHQPLSEA 1200 
QRKSKSLKIN LNYAGDALRE NTLVSEHAQR DQRETQCQMK EAEHMYQKEQ DNVNKHTEQQ 1260 
ESLDQKLFQL QSKNMWLQQQ LVHAHKKADN KSKITIDIHF LERKMQHHLL KEKNEEIFNY 1320 
NNHLKNRIYQ YEKEKAETEN S 

Seq ID NO: S DNA sequence 

Nucieic Acid Accession #: none found 

Coding sequence: 273-1785 (underlined sequences coirespond to start and stop codons) 



1 11 21 31 41 51 

1111)1 

CGCGGGGCGC GGAGTCGGCG GGGCCTCGCG GGACGCGGGC AGTGCGGAGA CCGCGGCGCT 60 
GAGGACGCGG GAGCCGGGAG CGCACGCGCG GGGTGGAGTT CAGCCTACTC TTTCTTAGAT 120 
GTG A A AGG A A AGG AAG ATCA TTTCATGCCT TGTTG ATAA A Ci r3TTCAG ACT TCTGCTG ATT 1 80 
CATAACCATT TGGCTCTGAG CTATG ACAAG AGAGGAAACA AAAAGTTAAA CTTACAAGCC 240 
TGCCATAAGT GAGAAGCAAA CTTCCTTGAT AACATGCTTT TGCGAAGTGC AGGAAAATTA 300 
AATGTGGGCA CCAAGAAAG A GG ATGGTGAG AGTACAGCCC CCACCCCCCG TCCAAAGGTC 360 
TTGCGTTGTA AATGCCACCA CCATTGTCCA GAAGACTCAG TCAACAATAT TTGCAGCACA 420 
GACGGATATT GTTTCACG AT GATAG AAG AG GATG ACTCTG GGTTGCCTGT GGTCACTTCT 480 
GGTTGCCTAG G ACT AG A AGG CTCAGATTTT CAGTGTCGGG ACACTCCCAT TCCTCATCAA 540 
AG AAG ATCAA TTGAATGCTG CACAGAAAGG AACGAATGTA ATAAAGACCT ACACCCTACA 600 
CTGCCTCCAT TGAAAAACAG AGATTTTGTT GATGGACCTA TACACCACAG GGCTTTACTT 660 
ATATCTGTGA CTGTCTGTAG TTTGCTCTTG GTCCTTATCA TATTATTTTG TTACTTCCGO 720 
TATAAAAG AC AAGAAACCAG ACCTCGATAC AGCATTGGGT TAG AACAGG A TGAAACrTAC 780 
ATTCCTCCTQ GAGAATCCCT GAGAGACTTA ATTGAGCAGT CTCAGAGCTC AGGAAGTGGA 840 
TCAGGCCTCC CTCTGCTGGT CCAAAGGACT ATAGCTAAGC AGATTCAGAT GGTGAAACAG 900 
ATTGGAAAAG GTCGCTATGG GGAAGTITGG ATGGGAAAGT GGCGTGGCGA AAAGGTAGCT 960 
GTG AAAGTGT TCTTCACCAC AGAGGAAGCC AGCTGGTTCA GAGAGACAGA AATATATCAG 1020 
ACAGTGTTGA TGAGGCATGA AAACATTTTG GGTTTCATTG CTGCAGATAT CAAAGGGACA 1080 
GGGTCCTGGA CCCAGTTGTA CCTAATCACA GACTATCATG AAAATGGTTC CCTTTATGAT 1 140 
TATCTGAAGTCCACCACCCT AGACGCTAAATCAATGCTGA AGTTAGCCTA CTCTTCTGTC 1200 
AGTGGCTTAT GTCATTTACA CACAGAAATC TTTAGTACTC AAGGCAAACC AGCAATTGCC 1260 
CATCGAGATC TGAAAAGTAA AAACATTCTG GTGAAGAAAA ATGGAACTTG CTGTATTGCT 1320 
GACCTGGGCC TGGCTGTTAA ATTTATTAGT GATACAAATG AAGTTGACAT ACCACCTAAC 1380 
ACTCGAGTTG GCACCAAACG CTATATGCCT CCAGAAGTGT TGGACGAGAG CTTGAACAGA 1440 
AATCACTTCC AGTCTTACAT CATGGCTGAC ATGTATAGTT TTGGCCTCAT CCTTTGGGAG 1 500 
GTTGCTAGQA GATGTGTATC AGGAGGTATA GTGGAAGAAT ACCAGCTTCC TTATCATGAC 1560 
CTAGTGCCCA GTGACCCCTC TTATGAGGAC ATGAGGGAGA TTGTGTGCAT CAAGAAGTTA 1620 
CGCCCCTCAT TCCCAA ACCG GTGGAGCAGT GATGAGTGTC TA AGGCAGAT GGGAAAACTC 1 680 
ATGACAGAAT GCTGGGCTCA CAATCCTGCA TCAAGGCTGA CAGCCCTGCG GGTTAAGAAA 1740 
ACACTTGCCA AAATGTCAGA GTCCCAGGAC ATTAAACTCT GATAGGAGAG GAAAAGTAAG 1800 
CATCTCTGCA GAAAGCCAAC AGGTACTCTT CTGTTTGTGG GCAGAGCAAA AGACATCAAA 1860 
TAAGCATCCA CAGTACAAGC CTTGAACATC GTCCTGCTTC CCAGTGGGTT CAGACCTCAC 1920 
CTTTCAGGGA GCGACCTGGG CAAAGACAG A GAAGCTCCCA GAAGGAGAG A TTGATCCGTG 1980 
TCTGTTTGTA GGCGG AG AAA CCGTTGGGTA ACTTGTTCAA G ATATGATGC AT 



Seq XD NO: 6 Protein sequence: 
Protein Accession #: none found 
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I I I I I I 

MLLRSAGKLN VGTKKEDGES TAPTPRPKVL RCKCHHHCPE DSVNNICSTD GYCFTMIEED 60 
DSGLPVVTSG CLGLEGSDFQ CRDTPIPHQR RSIECCTERN ECNKDLHPTL PPLKNRDFVD 120 
GPIHHRALLI SVTVCSLLLV LIILFCYFRY KRQETRPRYS IGLEQDETYI PPGESLRDLI 1 80 
EQSQSSGSGS GLPLLVQRTI AKQIQMVKQI GKGRYGEVWM GKWRGEKVA V KVFFTTEEAS 240 
WFRETEIYQT VLMRHENILG FIAADIKGTG SWTQLYLITD YHENGSLYDY LKSTTLDAKS 300 
MLKLA YSSVS GLCHLHTEIF STQGKPAIAH RDLKSKNILV KKNGTCCIAD LGLAVKFISD 360 
TNEVDIPPNT RVGTKRYMPP EVLDESLNRN HFQSYIMADM YSFGLILWEV ARRCVSGGIV 420 
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EEYQLPYHDL VPSDPSYEDM REIVCIKKLR PSPPNRWSSD ECLRQMGKLM TECWAHNPAS 480 
RLTALRVKKT LAKMSESQDl KL 

Seq ID NO: 7 DNA sequence 

Nucleic Acid Accession none found 

Coding sequence: 482-3007(underlined sequences cotrespond to start and stop codons) 



1 U 21 31 41 SI 
1)1)1! 

AACTGAGCTA ACAAGAAATA CTAG AAAAGG AGG AAGGAGA ACATTGCTGC AGCTTGGATC 60 
. TACAACCTAA GAAAGCAAGA GTGATCAATC TCAGCTCTGT TAAACATCTT GTTTACTTAC 120 
TGCATTCAGC AGCTTGCAAA TGGTTAACTA TATGCAAAAA AGTCAGCATA GCTGTGAAGT 1 80 
ATGCCGTGAA TTTTAATTGA GGGAAAAAGG GACAATTGCT TCAGGATGCT CTA GTAT GCA 240 
CTCTGCTTGA AATATTTTCA ATGAAATGCT CAGTATTCTA TCTTTGACCA GAGGTTTTAA 300 
CTTTATGAAG CTATGGGACT TG ACAAAAAG TGATATTTGA GAAGAAAGTA CGCAGTGGTT 360 
GGTGTTTTCT TTTTTTTAAT AAAGGAATTG AATTACTTTG AACACCTCTT CCAGCTGTGC 420 
ATTACAG ATA ACGTCAGGAA GAGTCTCTGC TTTACAGAAT CGGATTTCAT CACATGACAA 480 
C ATG AAGCTG TGG ATTCATC TCTTTTATTC ATCTCTCCTT GCCTGTATAT CTTTACACTC 540 
CCAAACTCCA GTGCTCTCAT CCAGAGGCTC TTGTGATTCT CTTTGCAATT GTGAGGAAAA 600 
AGATGGCACA ATGCTAATAA ATTGTGAAGC AAAAGGTATC AAGATGGTAT CTGAAATAAG 660 
TGTGCCACCA TCACGACCTT TCCAACTAAG CTTATTAAAT AACGGCTTGA CGATGCTTCA 720 
CACAAATGAC TTTTCTGGGC TTACCAATGC TATTTCAATA CACCTTGGAT TTAACAATAT 780 
TGCAGATATT GAG ATAGGTG CATTTAATGG CCTTGGCCTC CTGAAACAAC TTCATATCAA 840 
TCACAATTCT TTAGAAATTC TTAAAGAGGA TACTTTCCAT GGACTGG AAA ACCTGG AATT 900 
CCTGCAAGCA GATAACAATT TTATCACAGT GATTGAACCA AGTGCCTTTA GCAAGCTCAA 960 
CAGACTCAAA GTGTTAATTT TAAATGACAA TGCTATTGAG AGTCTTCCTC CAAACATCTT 1020 
CCGATTTGTT CCTTTAACCC ATCTAGATCT TCGTGGAAAT CAATTACAAA CATTG CCTTA 1080 
TGTTGGTTTT CTCG AACACA TTGGCCGAAT ATTGGATCTT CAGTTGGAGG ACAACAAATG 1 1 40 
GGCCTGCAAT TGTGACTTAT TGCAGTTAAA AACTTGGTTG GAGAACATGC CTCCACAGTC 1200 
TATAATTGGT GATGTTGTCT GCAACAOCCC TCCATTTTTT AAAGGAAGTA TACTCAGTAG 1260 
ACTAAAGAAG GAATCTATTT GCCCTACTCC ACCAGTGTAT GAAGAACATG AGGATCCTTC 1320 
AGGATCATTA CATCTGGCAG CAACATCTTC AATAAATGAT AGTCGCATGT CAACTAAGAC 1380 
CACGTCCATT CTAAAACTAC CCACCAAAGC ACCAGGTTTG ATACCTTATA TTACAAAGCC 1440 
ATCCACTCAA CTTCCAGGAC CTTACTGCCC TATTCCTTGT AACTGCAAAG TCCTATCCCC 1 500 
ATCAGGACTT CTAATACATT GTCAGGAGCG CAACATTGAA AGCTTATCAG ATCTGAGACC 1560 
TCCTCCGCAA AATCCTAG AA AGCTCATTCT AGCGGGAAAT ATTATTCACA GTTTAATGAA 1620 
GTCTGATCTA GTGGAATATT TCACTTTGGA AATGCTTCAC TTGGGAAACA ATCGTATTGA 1680 
AGTTCTTGAA GAAGGATCGT TTATGAACCT AACGAGATTA CAAAAACTCT ATCTAAATGG 1740 
TAACCACCTG ACCAAATTAA GTAAAGGCAT GTTCCTTGGT CTCCATAATC TTGAATACTT 1800 
ATATCTTGAA TACAATGCCA TTAAGGAAAT ACTGCCAGGA ACCTTTAATC CAA TGCCT AA 1860 
ACTTAAAGTC CTGTATTTAA ATAACAACCT CCTCCAAGTT TTACCACCAC ATATTTnTC 1920 
AGGGGTTCCT CTAACTAAGG TAAATCTTAA AACAAACCAG TTTACCCATC TACCTGTAAG 1980 
TAATATTTTG GATGATCTTG ATTTACTAAC CCAGATTGAC CTTGAGGATA ACCCCTGGGA 2040 
CTGCTCCTGT GACCTGGTrC GACTGCAGCA ATGGATACAA AAGTTAAGCA AGAACACAGT 2100 
GACAGATGAC ATCCTCTGCA CTTCCCCCGG GCATCTCGAC AAAAAGGAAT TGAAAGCCCT 2160 
AAATAGTGAA ATTCTCTGTC CAGGTTTAGT AAATAACCCA TCCATGCCAA CACAGACTAG 2220 
TTACCTTATG GTCACCACTC CTGCAACAAC AACAAATACG GCTGATACTA TTTTACGATC 2280 
TCTTACGGAC GCTGTGCCAC TGTCTGTTCT AATATTGGGA CTTCTGATTA TGTTCATCAC 2340 
TATTGTTTTC TGTGCTGCAG GG ATAGTGGT TCTTGTTCTT CACCGCAGG A GAAGATACAA 2400 
AAAGAAACAA GTAGATGAGC AAATGAGAGA CAACAGTCCT GTGCATCTTC AGTACAGCAT 2460 
GTATGGCCAT AAAACCACTC ATCACACTAC TGAAAGACCC TCTGCCTCAC TCTATG AACA 2520 
GCACATGGTG AGCCCCATGG TTCATGTCTA TAGAAGTCCA TCCTTTGGTC CAAAGCATCT 2580 
GGAAGAGGAA GAAGAGAGGA ATGAGAAAGA AGGAAGTGAT GCAAAACATC TCCAAAGAAG 2640 
TCTTTTGGAA CAGGAAAATC ATTCACCACT CACAGGGTCA AATATG AAAT ACAAAACCAC 2700 
G AACCAATCA ACAG AATTTT TATCCTTCCA AGATGCCAGC TCATTGTACA GAAACATTTT 2760 
AG AAAAAGAA AGGG AACTTC AGCAACTGGG AATCACAGAA TACCTAAGGA AAAACATTGC 2820 
TCAGCTCCAG CCTGATATGG AGGCACATTA TCCTGGAGCC CACGAAGAGC TGAAGTTAAT 2880 
GGAAACATTA ATGTACTCAC GTCCAAGGAA GGTATTAGTG GAACAGACAA AAAATGAGTA 2940 
TTTTGAACTT AAAGCTAATT TACATGCTGA ACCTGACTAT TTAG AAGTCC TGG AGCAGCA 3000 
AACATAGATG GAG AGTTTGA GGGCTTTCGC AGAAATGCTG TGATTCTGTT TTAAGTCCAT 3060 
ACCTTGTAAA TAAGTGCCTT ACGTGAGTGT GTGATCAATC AGAACCTAAG CACAGCAGTA 3120 
AACTATGGGG AAAAAAAAAG AAGAAGAAAA GAAACTCAGG GATCACTGGG AGAAGCCATG 3180 
GCATTATCTT CAGGCAATTT AGTCTGTCCC AAATAAAATC AATCCTTGCA TGTAAATC 



Seq ID NO: 8 Protein sequence: 
Protein Accession #: none found 

I 11 21 31 41 51 
1111(1 

MKLWIHLFYS SLLACISLHS QTPVLSSRGS CDSLCNCEEK DGTMUNCEA KGIKMVSEIS 60 
VPPSRPFQLS LLNNGLTMLH TNDFSGLTNA ISIHLGFNNl ADIEIGAFNG U3LLKQLHIN 1 20 
HNSLEILKED TFHGLENLEF LQADNNFITV lEPSAFSKLN RUCVLILNDN AlESLPPNIF 180 
RFVPLTHLDL RGNQLQTLPY VGFLEHIGRI LDLQLEDNKW ACNCDLLQUC TWLENMPPQS 240 
HGDWCNSP'PFFKGSILSR LKKESICPTP PVYEEHEDPS GSLHLAATSS INDSRMSTKT 300 
TSILKLPTKA PGLIPYITKP STQLPGPYCP IPCNCKVLSP SGLUHCQER NIESLSDLRP 360 
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PPQNPRKLIL AGNIIHSLMIC SDLVEYFTLE MLHLGNNRIE VLEEGSFMNL TRLQKLYLNG 420 
NHLTKLSKGM FLGLHNLEYL YLEYNADCEI LPGTFNPMPK UCVLYLNNNL LQVLPPHIFS 4S0 
GVPLTK>n«JC TNQFTHLPVS NILDDLDLLT QIDLEDNPWD CSCDLVGLQQ WIQKLSKNTV 540 
TDDILCTSPG HLDKKELKAL NSEILCPGLV NNPSMPTQTS YLMVTTPATT TNTADTILRS 600 
5 LTDAVPLSVL ILGLLIMFIT IVFCAAGIVV LVLHRRRRYK KKQVDEQMRD NSPVHLQYSM 660 
YGHKTTHHTT ERPSASLYEQ HMVSPMVHVY RSPSFGPKHL EEEEERNEKE GSDAKHLQRS 720 
LLEQENHSPL TGSNMKYKTT NQSTEFLSFQ DASSLYRNIL EKERELQQLG ITEYLRKNIA 780 
QLQPDMEAHY PGAHEELKLM ETLMYSRPRK VLVEQTKNEY FELKANLHAE PDYLEVLEQQ 840 



Seq ID NO: 9 DNA sequence 

Nucleic Acid Accession #: NM_003474 

Coding sequence: 307-3036 (underlined sequences conespond to start and stop codons) 

15 

1 11 il 31 41 51 
) I I I i 1 

CACTAACGCT CTTCCTAGTC CCCGGGCCAA CTCGGACAGT TTGCTCATTT ATTGCAACGG 60 
TCAAGGCTGG CTTGTGCCAG AACGGCGCGC GCGCGACGCA CGCACACACA CGGGGGGAAA 120 

20 CTTTTTTAAA AATGAAAGGC TAGAAGAGCT CAGCGGCGGC GCGGGCCGTG CGCGAGGGCT 180 
CCGGAGCTGA CTCGCCGAGG CAGGAAATCC CTCCGGTCGC GACGCCCGGC CCCGCTCGGC 240 
GCCCGCGTGG GATGGTGCAG CGCTCGCCGC CGGGCCCGAG AGCTGCTGCA CTGAAGGCCG 300 
GCGACO ATGG CAGCGCGCCC GCTGCCCGTG TCCCCCGCCC GCGCCXTCCT GCTCGCCCTG 360 
GCCGGTGCTC TGCTCGCGCC CTGCGAGGCC CGAGGGGTGA GCTTATGGAA CGAAGGAAGA 420 

25 GCTGATGAAG TTGTCAGTGC CTCTGTTCGG AGTGGGGACC TCTGGATCCC AGTGAAGAGC 480 
TTCGACTCCA AGAATCATCC AGAAGTGCTG AATATTCGAC TACAACGGGA AAGCAAAGAA 540 
CTG ATCATAA ATCTGGAAAG AAATGAAGGT CTCATTGCCA GCAGTTTCAC GGAAACCCAC 600 
TATCTGCAAG ACGGTACTGA TGTCTCCCTC GCTCGAAATT ACACGGTAAT TCTGGGTCAC 660 
^ TGTTACTACC ATGG ACATGT ACGGGG ATAT TCTGATTCAG CAGTCAGTCT CAGCACGTGT 720 

30 TCTGGTCTCA GGGGACTTAT TGTGTTTGAA AATGAAAGCT ATGTCTTAGA ACCAATG AAA 780 
AGTGCAACCA ACAGATACAA ACTCTTCCCA GCGAAGAAGC TGAAAAGCGT CCGGGGATCA 840 
TGTGGATCAC ATCACAACAC ACCAAACCTC GCTGCAAAGA ATGTGTTTCC ACCACCCTCT 900 
CAGACATGGG CAAGAAGGCA TAAAAGAGAG ACCCTCAAGG CAACTAAGTA TGTGGAGCTG 960 
GTGATCGTGG CAGACAACCG AGAGTTTCAG AGGCAAGGAA AAGATCTGGA AAAAGTTAAG 1020 

3 5 CAGCGATTAA TAGAG ATTGC TAATCACGTT G ACAAGTTTT ACAGACCACT GAACATTCGG 1080 
ATCGTGTTGG TAGGCGTGGA AGTGTGGAAT GACATGGACA AATGCTCTGT AAGTCAGGAC 1 140 
CCATTCACCA GCCTCCATGA ATTTCTGGAC TGGAGGAAGA TGAAGCTTCT ACCTCGCAAA 1200 
TCCCATGACA ATGCGCAGCT TGTCAGTGGG GTTTATTTCC AAGGGACCAC CATCGGCATG 1260 
. ^ GCCCCAATCA TGAGCATGTG CACGGCAG AC CAGTCTGGGG G AATTGTCAT GG ACCATTCA 1 320 

40 GACAATCCCC TTGGTGCAGC CGTGACCCTG GCACATGAGC TGGGCCACAA TTTCGGGATG 1380 
AATCATGACA CACTGGACAG GGGCTGTAGC TGTCAAATGG CGGTTGAGAA AGGAGGCTGC 1440 
ATCATGAACG CTTCCACCGG GTACCCATTT CCCATGGTGT TCAGCAGTTG CAGCAGGAAG 1500 
GACTTGGAGA CCAGCCTGGA GAAAGGAATG GGGGTGTGCC TGTTTAACCT GCCGGAAGTC 1560 
. AGCGAGTCTT TCGGGCGCCA GAAGTGTCGG AACAGATTTG TGGAAGAAGG AGAGGAGTGT 1620 

45 GACTGTGGGG AGCCAGAGG A ATGTATGAAT CGCTGCTGCA ATGCCACCAC CTGTACCCTG 1680 
AAGCCGGACG CTGTGTGCGC ACATGGGCTG TGCTGTGAAG ACTGCCAGCT GAAGCCTGCA 1740 
GGAACAGCGT GCAGGGACTC CAGCAACTCC TGTGACCTCC CAGAGTTCTG CACAGGGGCC 1800 
AGCCCTCACT GCCCAGCCAA CGTGTACCTG CACGATGGGC ACTCATGTCA GGATGTGGAC 1860 

^ GGCTACTGCT ACAATGGCAT CTGCCAGACT CACGAGCAGC AGTGTGTCAC ACTCTGGGGA 1920 

50 CCAGGTGCTA AACCTGCCCC TGGGATCTGC TTTGAGAGAG TCAATTCTGC AGGTGATCCT 1980 
TATGGCAACT GTGGCAAAGT CTCGAAG AGT TCCTTTGCCA AATGCGAG AT GAG AG ATGCT 2040 
AAATGTGGAA AAATCCAGTG TCAAGGAGGT GCCAGCCGGC CAGTCATTGG TACCAATGCC 2100 
GTTTCCATAG AAACAAACAT CCCCCTGCAG CAAGGAGGCC GGATTGTGTG CCGGGGGACC 2160 
_ CACGTGTACT TOGGCG ATGA CATGCCGGAC CCAGGGCTTG TGCTTGCAGG CACAAAGTGT 2220 

55 GCAGATGGAA AAATCTGCCT G AATCGTCAA TGTCAAAATA TTAGTGTCTT TGGGGTTCAC 2280 
GAGTGTGCAA TGCAGTGCCA CGGCAGAGGG GTGTGCAACA ACAGGAAGAA CTGCCACTGC 2340 
GAGGCCCACT GGGCACCTCC CTTCTGTG AC AAGTTTGGCT TTGGAGGAAG CACAGACAGC 2400 
GGCCCCATCC GGCAAGCAGA TAACCAAGGT TTAACCATAG GAATTCTGGT GACCATCCTG 2460 
TGTCTTCTTG CTGCCGGATT TGTGGTTTAT CTCAAAAGGA AGACCTTGAT ACGACTGCTG 2520 

60 TTTACAAATA AGAAGACCAC CATTGAAAAA CTAAGGTGTG TGCGCCCTTC CCGGCCACCC 2580 
CGTGGCTTCC AACCXTTGTCA GGCTCACCTC GGCCACCTTG GAAAAGGCCT GATG AGGAAG 2640 
CCGCCAGATT CCTACCCACC GAAGGACAAT CCCAGG AG AT TGCTGCAGTG TCAG A ATGTT 2700 
G ACATCAGCA G ACCCCTCAA CGGCCTGAAT GTCCCTCAGC CCCAGTCAAC TCAGCG AGTG 2760 
CTTCCTCCCC TCCACCGGGC CCCACGTGCA CCTAGCGTCC CTGCCAGACC CCTGCCAGCC 2820 

65 AAGCCTGCAC TTAGGCAGGC CCAGGGGACC TGTAAGCCAA ACCCCCCTCA GAAGCCTCTG 2880 
CCTGCAGATC CTCTGGCCAG AACAACTCGG CTCACTCATG CCTTGGCCAG GACCCCAGGA 2940 
CAATGGGAG A CTGGGCTCCG CCTGGCACCC CTCAGACCTG CTCCACAATA TCCACACXZAA 3000 
GTOCCCAGAT CCACCCACAC CGCCTATATT AAGJCAGAAG CCG ACACCTT TTTTCAACAG 3060 
^ TGAAGACAGA AGTTTGCACT ATCTTTCAGC TCCAGTTGGA GTTTTTTGTA CCAACTTTTA 3120 

70 GG ATTTTTTT TAATGTTTAA AACATCATTA CTATAAGAAC TTTG AGCTAC TGCCGTCAGT 3 1 80 
GCTGTGCTGT GCTATGGTGC TCTGTCTACT TGCACAGGTA CTTGT AAATT ATTAATTTAT 3240 
GCAGAATGTT GATTACAGTG CAGTGCQCTG TAGTAGGCAT TTTTACCATC ACTGAGTTTT 3300 
CCATGGCAGG AAGGCTTGTT GTG CI 1 1 i A G TATTTTAGTG AACTTGAAAT ATCCTGCTTG 3360 
ATGGGATTCT GGACAGG ATG TGTTTGCTTT CTGATCAAGG CCTTATTGGA AAGCAGTCCC 3420 

75 CCAACTACCC CCAGCTGTGC TTATGGTACC AGATGCAGCT CAAGAGATCC CAAGTAGAAT 3480 
CTCAGTTGAT TTTCTGGATT CCCCATCTCA GGCCAGAGCC AAGGGGCTTC AGGTCCAGGC 3540 
TGTGTrrGGC TTTCAGGGAG GCCCTGTGCC CCTTGACAAC TGGCAGGCAG GCTCCCAGGG 3600 
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ACACCTGGGA GAAATCTGGC TTCTGGCCAG GAAGCTTTGG TGAGAACCTG GGTTGCAGAC 3660 
AGGAATCTTA AGGTGTAGCC ACACCAGGAT AGAGACTGGA ACACTAGACA AGCCAGAACT 3720 
TGACCCTG AG CTGACCAGCC GTGAGCATGT TTGGAAGGGG TCTGTAGTGT CACTCAAGGC 3780 
GGTGCTTGAT AGAAATGCCA AGCACTTCTT TTTCTCGCTG TCCTTTCTAG AGCACTGCCA 3840 
CCAGTAGGTT ATTTAGCTTG GGAAAGGTGG TGTTTCTGTA AGAAACCTAC TGCCCAGGCA 3900 
CTGCAAACCG CCACCTCCCT ATACTGCTTG GAGCTGAGCA AATCACCACA AACTGTAATA 3960 
CAATGATCCT GTATTCAGAC AGATGAGGAC TTTCXrATGGG ACCACAACTA TTTTCAGATG 4020 
TGAACCATTA ACCAGATCTA GTCAATCAAG TCTGTTTACT GCAAGGTTCA ACTTATTAAC 4080 
AATTAGGCAG ACTCTTTATG CTTGCAAAAA CTACAACCAA TGGAATGTGA TGTTCATGGG 4140 
TATAGTTCAT GTCTGCTATC ATTATTCGTA GATATTGGAC AAAGAACCTT CTCTATGGGG 4200 
CATCCTCTTT TTCCAACTTG GCTGCAGGAA TCTTTAAAAG ATGCTTTTAA CAGAGTCTGA 4260 
ACCTATTTCT TAAACACTTG CAACTTACCT GTTGAGCATC ACAGAATGTG ATAAGGAAAT 4320 
CAACTTGCTT ATCAACTTCC TAAATATTAT G AG ATGTGGC TTGGGCAGCA TCCCCTTGAA 4380 
CTCTTCACTC TTCAAATGCC TGACTAGGGA GCCATGTTTC ACAAGGTCTT TAAAGTGACT 4440 
AATGGCATGA GAAATACAAA AATACTCAGA TAAGGTAAAA TGCCATGATG CCTCTGTCTT 4500 
CTGGACTGGT TTTCACATTA GAAGACAATT GACAACAGTT ACATAATTCA CTCTGAGTGT 4560 
TTTATG AGA A AGCCTTCTTT TGGGGTCAAC AGTTTTCCTA TGCTTTGAAA CAGAAAAATA 4620 
TGTACCAAGAATCTTGGTTTGCCTTCCAGAAAACAAAACTGCATTTCACTTrCCCGGTGT 4680 
TCCCCACTGT ATCTAGGCAA CATAGTATTC ATGACTATGG ATAAACTAAA CACGTGACAC 4740 
AAACACACAC AAAAGGGAAC CCAGCTCTAA TACATTCCAA CTCGTATAGC ATGCATCTGT 4800 
TTATTCTATA GTTATTAAGT TCTTTAAAAT GTAAAGCCAT GCTGGAAAAT AATACTGCTG 4860 
AGATACATAC AGAATTACTG TAACTGATTA CACTTGGTAA TTGTACTAAA GCCAAACATA 4920 
TATATACTAT TAAAAAGGTT TACAGAATTT TATGGTGCAT TACGTGGGCA TTGTCTTTTT 4980 
AGATGCCCAA ATCCTTAGAT CTGGCATGTT AGCCCTTCCT CCAATTATAA GAGGATATGA 5040 
ACCAAAAAAA AAAAAAAAAA AA 



Sep ID NO: 10 Protein sequence: 
Protein Accession NP_003465.2 

1 11 21 31 41 51 
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MAARPLPVSP ARALLLALAG ALLAPCEARG VSLWNEGRAD EWSASVRSG DLWIPVKSFD 60 
SKNHPEVLNI RLQRESKELI INLERNEGLI ASSFTETHYL QDGTDVSLAR NYTVILGHCY 120 
YHGHVRG YSD SAVSLSTCSG LRGLIVFENE SYVLEPMKSA TNRYKLFPAK ICLKS VRGSCG 1 80 
SHHNTPNLAA KNVFPPPSQT WARRHKRETL KATKYVELVI VADNREFQRQ GKDLEKVKQR 240 
LIEIANHVDK FYRPLNIRIV LVGVEVWNDM DKCSVSQDPF TSLHEFLDWR KMKLLPRKSH 300 
DNAQLVSGVY FQGTTIGMAP IMSMCTADQS GGIVMDHSDN PLGAAVTLAH ELGHNFGMNH 360 
DTLDRGCSCQ MAVEKGGCIM NASTGYPFPM VFSSCSRKDL ETSLEKGMGV CLFNLPEVRE 420 
SFGGQKCGNR FVEEGEECDC GEPEECMNRC CNATTCTLKP DAVCAHGLCC EDCQLKPAGT 480 
ACRDSSNSCD LPEFCTGASP HCPANVYLHD GHSCQDVDGY CYNGICQTHE QQCVTLWGPG 540 
AKPAPGICFE RVNSAGDPYG NCGKVSKSSF AKCEMRDAKC GKIQCQGGAS RPVIGTNAVS 600 
lETNlPLQQG GRILCRGTHV YLGDDMPDPG LVLAGTKCAD GKICLNRQCQ NISVFGVHEC 660 
AMQCHGRGVC NNRKNCHCEA HWAPPFCDKF GFGGSTDSGP IRQADNQGLT IGILVTILCL 720 
LAAGFWYLK RKTLIRLLFT NKKTTIEKLR CVRPSRPPRG FQPCQAHLGH LGKGLMRKPP 780 
DSYPPKDNPR RLLQCQNVDI SRPLNGLNVP QPQSTQRVLP PLHRAPRAPS VPARPLPAKP 840 
ALRQAQGTCK PNPPQKPLPA DPLARTTRLT HALARTPGQW ETGLRLAPLR PAPQYPHQVP 900 
RSTHTAYIK 



Seq ID NO: 1 1 DNA sequence 

Nucleic Acid Accession NM_001394 

Coding sequence: 400-lS84(undeiiined sequences conespond to stait and stop codons) 

I 11 21 31 41 51 
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GGCACQAGGG CTGAGCGCCG GAGGAGCX5TA GGCAGGGCAG CGCTGGCGCX: AGCGGCGACA 60 
GGAGCCGCGC GACCGGCAAA AATACACX3GG AGGCCGTCGC CQAAAAGAGT CCGCGGTCCT 120 
CTCTCGTAAA CACACTCTCC TCCACCGGCG CCTCCCCCTC CGCTCTGCGC GCCGCCCGGC 1 80 
TGGGCGCCCG AGGCCGCTCC GACTGCTATG TGACCGCGAG GCTGCGGGAG GAAGGGGACA 240 
GGGAAGAAGA GGCTCTCCCG CGGGAGCCCT TGAGGACCAA GTTTGCGGCC ACTTCTGCAG 300 
GCGTCCCTTC TTAGCTCTCG CCTGCCCCTT TCTGCAGCCT AGGCGGCCCA GGTTCTCTrC 360 
TCTTCCTCGC GCGCCCAGCC GCCTCGG1TC CCGGCGACCATGGTGACGAT GGAGGAGCTG 420 
CGGGAGATGG ACTGCAGTGT GCTCAAAAGG CTGATGAACC GGG ACGAG AA TGGCGGCGGC 480 
GCGGGCGGCA GCGGCAGCCA CGGCACCCTG GGGCTGCCGA GCGGCGGCAA GTGCCTGCTG 540 
CTGGACTGCA GACCGTTCCT GGCGCACAGC GCGGGCTACA TCCTAGGTTC GGTCAACGTG 600 
CGCTGTAACA CCATCGTGCG GCGGCGGGCT AAGGGCTCCG TGAGCCTGGA GCAGATCCTG 660 
CCCGCCG AGG AGGAGGTACG CGCCCGCTTG CGCTCCGGCC TCTACTCGGC GGTCATCGTC 720 
TACGACGAGC GCAGCCX:GCG CGCCGAGAGC CTCCGCGAGG ACAGCACCGT GTCGCTGGTG 780 
GTGCAGGCGC TGCGCCGCAA CGCCG AGCGC ACCGACATCT GCCTGCTCAA AGGCGGCTAT 840 
G AGAGGTTTT CCTCCGAGTA CCCAG AATTC TGTTCTAAAA CCAAGGCCCT GGCAGCCATC 900 
CCACCCCCGG TTCCCCCCAG CGCCACAGAG CCCTTGGACC TGGGCTGCAG CTCCTGTGGG 960 
ACCCCACTAC ACGACCAGGG GGGTCCTGTG GAGATCCTTC CCTTCCTCTA CCTCGGCAGT 1020 
GCCTACCATG CTGCCCGGAG AGACATGCTG GACGCCCTGG GCATCACGGC TCTGTTGAAT 1080 
GTCTCCTCGG ACTGCCCAAA CCACTTTGAA GGACACTATC AGTACAAGTG CATCCCAGTG 1 140 
GAAGATAACC ACAAGGCCGA CATCAGCTCC TGGTTCATGG AAGCCATAGA GTACATCGAT 1200 
GCCGTGAAGG ACTGCCGTGG GCGCGTGCTG GTGCACTGCC AGGCGGGCAT CTCGCGGTCG 1260 
GCX:ACCATCT GCCTGGCCTA CCTGATQATG AAGAAACX3GG TGAQGCTGGA GGAGGCCTTC 1320 
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G AGTTCOTTA AGCAGCGCCG CAGCATCATC TCGCCCAACT TC AGCTTCAT GGGGCAGCTG 1 380 
CTGCAGTTCG AGTCCCAGGT GCTGGCCACG TCXTTOTGCTG CGGAGGCTGC TAGCCCCTCG 1440 
GGACCCCTGCGGGAGCGGGGCAAGACCCCCGCCACCCCCACCTCGCAGTTCGTCTTCAGC 1500 
. TTTCCGGTCT CCGTGGGCGT GCACTCGGCC CCCAGCAGCC TGCCCTACCT GCACAGCCCC 1560 
5 ATCACCACCT CTCCCAGCTG TTAGAGCCGC CCTGGGGGCC CCAGAACCAG AGCTGGCTCC 1 620 

CAGCAAGGGT AGGACGGGCC GCATGCGGCA GAAAGTTGGG ACTGAGCAGC TGGGAGCAGG 1680 
CGACCGAGCT CCTTCCCCAT CATTTCTCCT TGGCCAACGA CX3AGGCCAGC CAGAATGGCA 1740 
ATAAGG ACTC CGAATACATA ATAAAAGCAA ACAG AACACT CCAACTTAGA GCAATAACCX3 1 800 
GTGCCGCAGC AGCCAGGG AA G ACCTTGGTT TGCTTT ATGT GTCAGTTTCA CTTTTCCGAT 1 860 

1 0 AGAAATTTCT TACCTCATTT TTTTAAGCAG TAAGGCTTGA AGTGATGAAA CCCACAGATC 1920 

CTAGCAAATG TGCCCAACCA GCTTTACTAA AGGGGGAGGA AGGGAGGGCA AAGGGATGAG 1980 
AAGACAAGTT TCCCAGA AGT GCCTGGTTCT GGGTACTTGT CCCTTTGTTG TCGTTGTTGT 2040 
AGTTAAAGGA ATTTCATTTT TAAAAGAAAT CTTCGAAGGT GTGGTT TTCA TTTCTCAGTC 2100 
ACCAACAGAT GAATAATTAT GCTTAATAAT AAAGTATTTA TTAAGACTTT CTTCAGAGTA 2160 

1 5 TGAAAGTACA AA AAGTCTAG TTACAGTGG A TTTAG AATAT ATTTATGTTG ATGTCAAACA 2220 
GCTGAGCACC GTAGCATGCA GATGTCAAGG CAGTTAGGAA GTAAATGGTG TCTTGTAGAT 2280 
ATGTGCAAGG TAGCATGATG AGCAACTTGA GTTTGTTGCC AC TGAGA AGC AGGCGGGTTG 2340 
GGTGGGAGGA GGAAGAAAGG GAAGAATTAG GTTTGAATTG d i lTlAAAA AAAAAAGAAA 2400 
AGAAAAAGAC AGCATCTCAC TATGTTGCCA AGGCTCATCT TGAGAA GCAG GCGGGTTGGG 2460 

20 TGGGAGGAGG AAGAAAGGGA AG AATTAGGT TTG AATTGCT TTTTTAAAAA AAAA 

Seq ID NO: 12 Protein sequence: 
Protein Accession #: NP_001385 

25 I n 21 31 41 51 

MVTMEELREM DCSVLKRLMN RDENGGGAGG SGSHGTLGLP SGGKCLLLDC RPFLAHSAGY 60 
ILGSVNVRCN TIVRRRAKGS VSLEQILPAE EEVRARLRSG LYSAVIVYDE RSPRAESLRE 120 
DSTVSLWQA LRRMAERTDI CLLKGGYERF SSEYPEFCSK TKALAAIPPP VPPSATEPLD 1 80 
30 LGCSSCGTPL HDQGGPVEIL PFLYLGSAYH AARRDMLDAL GITALLNVSS DCPNHFEGHY 240 

QYKCIPVEDN HKADISSWFM EAIEYIDAVK DCRGRVLVHC QAGISRSATl CLAYLMMKKR 300 
VRLEEAFEF V KQRRSUSPN FSFMGQLLQF ESQVLATSCA AEAASPSGPL RERGKTPATP 360 
TSQFVFSFPV SVGVHSAPSS LPYLHSPITT SPSC 

35 Seq ID NO: 13 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 68«340(underlined sequences correspond to start and stop codons) 



40 1 11 21 31 41 51 

AGCGCCTTGC CTTCTCTTAG GCTTTGAAGC ATTTTTGTCT GTGCTCCCTG ATCTTCAGGT 60 
CACCACCATG AAGTTCTTAG CAGTCCTGGT ACTCTTGGGA GTTTCCATCT TTCTGGTCTC 120 
TGCCCAG AAT CCGACAACAG CTGCTCCAGC TGACACGTAT CCAGCTACTG GTCXTGCTGA 1 80 

45 TGATGAAGCC CCTGATGCTG AAACCACTGC TGCTGCAACC ACTGCGACC A CTGC TGCTCC 240 
TACCACTGCA ACCACCGCTG CTTCTACCAC TGCTCGTAAA GACATTCCAG TTTTACCCAA 300 
ATGGGTTGGG GATCTCCCGA ATGGTAGAGT GTGTCCCIGA G ATGGAATCA GCTTGAGTCT 360 
TCTGCAATTG GGTCACAACT ATTCATGCTT CCTGTGATTT CATCCAACTA CTTACCTTGC 420 
CTACG ATATC CCCTTTATCT CTAATCAGTT TATTTTCTTT CAAATAAAAA ATAACTATGA 480 

50 GCGAGCTAAC AT 



55 



65 



Seq ID NO: 14 Protein sequence: 
Protein Accession #: none found 



1 n 21 31 41 51 

MKFLAVLVLL G VSIFLVSAQ NPTTAAPADT YPATGPADDE APDAETTAAA TTATTAAPTT 60 
60 ' ATTAASTTARKDIPVLPKWVGDLPNGRVCP 



Seq ID NO: 15 DNA sequence 

Nucleic Acid Accession #: NM_01 6640.2 

Coding sequence: 39-1 358(underUned sequences correspond to start and stop codons) 



I n 21 31 41 51 

GCTTAAGTTG ACCTCTGGGT CCGG AATCGC GGGCAAAGAT GGCGGCGGCC AGGTGTTGGA 60 
70 GGCCTTTGCT ACGCGGTCCG AGGCTTTCAT TGCACACCGC GGCTAATGCC GCCGCCACGG 120 
CTACAGAAAC GACCTCCCAA GACGTCGCGG CGACCCCCGT CGCGCGGTAC CCGCCGATTG 180 
TGGCCTCCAT GACAGCCGAC AGCAAAGCTG CACGGCTGCG GCGG ATCGAG CGCTGGCAGG 240 
CGACGGTGCA CGCTGCGGAG TCGGTAGACG AGAAGCTGCG AATCCTCACC AAGATGCAGT 300 
TTATGAAGTA CATGGTTTAC CCGCAGACCT TCGCACTGAA TGCCGACCGC TGGTACCAGT 360 
75 ACTTCACCAA GACCGTGTTC CTGTCGGGTC TGCCGCCGCC CCCAGCGGAG CCCGAGCCCG 420 
AGCCCGAACC CGAACCTGAA CCTGCGCTGG ACCTCGCGGC GCTGCGTGCG GTCGCGTGCG 480 
ACTGCCTGCT GCAGGAGCAC TTCTACCTGC GGCGCAGGCG GCGCGTGCAC CGTTACG AGG 540 
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AGAGCG AGGT CATATCTTTG CCCTTCCTGG ATCAGCTGGT GTCAACCCTC GTGGGCCTCX: 600 
TCAGCCCACA CAACCCGGCC CTGGCCGCTG CCGCCCTCGA TTATAGATGC CCAGTTCATT 660 
TTTACTGGGT GCGTGGTG AA GAAATTATTC CTCGTGGTCA TCGAAGAGGT CGAATTGATG 720 
ACTTGCGATA CCAGATAGAT GATAAACCAA ACAACCAGAT TCGAATATCC AAGCAACTCG 780 
CAG AGTTTGT GCCATTGG AT TATTCTGTTC CTATAGAAAT CCCCACTATA AAATGTAAAC 840 
CAGACAAACT TCCATTATTC AAACGGCAGT ATGAAAACCA CATATTTGTT GGCTCAA AAA 900 
CTGCAG ATCC TTGCTGTTAC GGTCACACC€ AGTTTCATCT GTTACCTGAC AAATTAAG AA 960 
GGGAAAGGCT TTTG AG ACAA AACTGTGCTG ATCAGATAGA AGTTGTTTTT AGAGCTAATG 1020 
CTATTGCAAG CCTTTTTGCT TGGACTGGAG CACAAGCTAT GTATCAAGGA TTCTGGAGTG 1080 
A AGCAGA TGT TACTCGACCT TTTGTCTCCC AGGCTGTG AT CACAGATGGA AAATACTTTT 1 140 
CCTTTTTCTG CTACCAGCTA AATACTTTGG CACTGACTAC ACAAGCTGAT CAAAATAACC 1200 
CTCGTAAAAA TATATGTTGG GGTACACAAA GTAAGCCTCT TTATGAAACA ATTGAGGATA 1260 
ATGATGTGAA AGGTTTTAAT GATGATGTTC TACTTCAGAT AGTTCACTTT CTACTGAATA 1320 
GACCAAAAGA AGAAAAATCA CAGCTGTTGG AAAA CTGA AA AAGCATATTT GATTGAGAAC 1380 
TGTGGGAATA TTTAAATTTT ACTGAAGGAA CAATAATGAT GAGATTTGTA ACTGTCAACT 1440 
ATTAAATACA TTGATTTTTG AGACAAATAT TTCTTATGTC AACCTGTTAT TAGATCTCTT 1500 
ACTCTGCTCA AATTC ATC AC TG AAAG ATTT AATTTTAGTT ACCTTTTGTT G ATTTAAAAA 1 560 
TAATTGCATT TGTATATTGC TAACTGATAA GACAAATTGA GTTATTGAGC TATTAAATGC 1620 
ACATTTTAAT ATAAATGCAG AAATCCCAAA TAAAATGCTA ACATACTGAA TTCAGTAATT 1680 
AAAAGAACCC ACTGC 



Seq ID NO: 1 6 Protein sequence: 
Protein Accession #: NP_057724. 1 

1 11 21 31 41 51 

I I I I I 1 

MAAARCWRPL LRGPRLSLHT AANAAATATE TTSQDVAATP VARYPPIVAS MTADSKAARL 60 
RJUER WQAT V HA AES VDEKL RILTKMQFMK YM VYPQTFAL NADRWYQYFT KT VFLSG LPP 120 
PPAEPEPEPE PEPEPALDLA ALRAVACDCL LQEHFYLRRR RRVHRYEESE VISLPFLDQL 1 80 
VSTLVGLLSP HNPALAAAAL DYRCPVHFYW VRGEEIIPRG HRRGRIDDLR YQlDDKP^fNQ 240 
IRISKQLAEF VPLDYSVPIE EPTIKCKPDK LPLFKRQYEN HIFVGSKTAD PCCYGHTQFH 300 
LLPDKLRRER LLRQNCADQI E WFRANAIA SLFAWTGAQA MYQGFWSEAD VTRPFVSQAV 360 
ITDGKYFSFF CYQLNTLALT TQADQNNPRK NICWGTQSKP LYETIEDNDV KGFNDDVLLQ 420 
IVHFLLNRPK EEKSQLLEN 

Seq ID NO: 1 7 DNA sequence 

Nucleic Acid Accession #: NMJ025059.1 

Coding sequence: 3-2 1 SO (underlined sequences correspond to start and stop codons) 



I n 21 31 41 51 

] i I I I t 

GCATG AGCCT GGACTGCACC AQCCATATCG CGCTGGGTGC CGCTTCGCCA GCGCCCGAGG 60 
AAACTTACGA TCATCTTTCG GAAGTCCCGG TCACGCGGGA GCAGTTAAAC CACTATCGGA 120 
ATGTGGCTCA AAATGCTCG A AGTGAACTTG CAGCAACTTT GGTCAAATTT G AATGTGCTC 1 80 
AGTCTGAGCT TCAAGACCTC CGATCCAAGA TGCTTTCTAA AGAAGTCTCC TGTCAAGAAC 240 
TGAAAGCTGA AATGGAG AGC TACAAGGAAA ACAATGCCAG AAAATCATCT CTCCTTACCT 300 
CriTGAGAGA CAGAGTTCAG GAACTAGAAG AAGAATCAGC AGCACTTTCC ACTTCTAAAA 360 
TCAGAACAGA AATCACAGCT CACGCTGCAA TCAAGGAGAA CCAGGAATTA AAGAAGAAAG 420 
TTGTAGAGTT AAATGAAAAA TTACAAAAGT GTTCAAAAGA AAATGAGGAG AATAAGAAAC 480 
AAGTTTCAAA GAATTGCAGG AAACATGAGG AATTTCTGAC TCAACTGCGT GACTGCTTGG 540 
ATCCAGATGA GAGGAATGAC AAGGCATCAG ATGAAGATTT AATTTTAAAG CTTAGAGACC 600 
TGCGCAA AG A AAATG AATTC GTG AA AGG AC AAATTGTTAT TCTTG A AG AG ACTATAAATG 660 
TCCATGAGAT GGAAGCAAAA GCTAGCAGAG AAACG ATCAT G AGGCTGGCT TCAGAAGTCA 720 
ACAGAG AGCA GAAAAAAGCT GCCTCCTGTA CTGAAGAG AA AGAGAAGCTG AACCAGGACC 780 
TGCTCAGTGC TGTAG AAGCA AAAGAAGCTC TTGAAAGGGA AGTTAAGATC TTCCAAGAAA 840 
GGCTGCTTGC TGGCCAGCAG GTCTGGGATG CCTCAAAGCA GGAAGTGAGC CTCCTG AAGA 900 
A AAGCTCTTC TG AGTTGG AG AAG AGTTTG A AGGCCAGTCA GG ATGCAGTC ACAACCTCAC 960 
AAAGCCAGTA CTCCTCATTT AGGGAGAAAA TCGCAGCCCT CCTTAGGGGC AGATTGAGCA 1020 
TGACTGGGTC CACTGAGGAC ACCATTTTGG AGAAGATTCG AGAAATGGAC AGCCGGGAAG 1080 
AAAGCAGGGA CCGGATGGTC TCCCAGCTTG AAGCCCAAAT ATCTGAGCTT GTTGAACAGT 1 140 
TGGGAAAGGA GTCTGGGTTT CACCAGAAAG CTCTCCAGAG GGCCCAGAAA GCAGAGAATA 1200 
TGTTGGAGAC TCTTCAGGGT CAGCTGACAC ACCTGGAGGC AGAGCTGGTT TCTGGAGGTG 1260 
TTTTGCGAGA CAACTTGAAT TTTGAGAAAC AAAAATATCT TAAATTTCTG GATCAGCTTT 1320 
CTCAGAAAAT GAAGTTGGAC CAGATGGCTG CCGAACTTGG CTTTGACACG CGGCTGGACG 1380 
TGGTTTTAGC TCGAACAGAG CAGCTGGTTC GTCTTGAGAG CAATGCAGTC ATTGAGAACA 1440 
AGACCATTGC OCACAATTTG CAGAGAAAGC TAAAGACACA GAAAGAGAGA CTGGAGAGCA 1500 
AAG AATTACA CATGAGCCTC CTCCGGCAG A AAATAGCCCA GCTGGAGGAG G AG AAGCAGG 1 560 
CACGCACGGC CTTGGTGGTT GAGAGGGACA ACGCGCATCT TACCATCAGG AACTTGCAGA 1620 
AGAAGGTGGA GAGGCTGCAG AAAGAGCTGA ACACGTGTCG AGACTTGCAC ACCGAGCTCA 1680 
AAGCCAAACr GGCCGACACC AATGAACTGA AGATTAAAAC TTTGGAACAG ACTAAAGCCA 1740 
TTGAAGATCT AAACAAATCC AGAGACCAAC TGGAGAAGAT GAAGGAGAAA GCTGAGAAAA 1800 
AGCTCATGTC TGTCAAGTCA GAACTGGATA CCACAGAACA TGAGGCTAAG GAGAATAAAG 1860 
AAAGGGCCAG AAACATGATA GAAGTGGTAA CCAGTGAAAT GAAGACACTA AAAAAATCTC 1920 
TGGAAGAAGC AGAAAAGAGA GAAAAGCAGC TGGCAGACTT CAGGGAGGTG GTGTCGCAGA 1980 
TGCTAGGCTT GAACGTGACC AGCCTTGCTC TTCCTGATTA TG AAATCATC AAGTGTCTTG 2040 
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AAAGATTGGT CCATTCACAT CAGCATCACT TTGTTACCTG TGCCTGCCTC AAAQATGTGA 2100 
CTACTGGGCA AGAGAGGCAC CCACAAGGCC ATTTACAGCT TCTTCATTGA ACACTGTATC 2160 
TCTTGAGAGA GGTGGCCATA AGACATGGCA CACAATTCCC AATTTCACAA ATTCCTCATG 2220 
TCTTTGAGATTTGATCAGTTTGTGAATATTTTATGCTTTG ATGATATAGT QAGAATGCAT 2280 
5 CACTTGCAAA AACGATCTCA AAAGTGTCAG CCTTAG ATAA ACGTCAGCAT TAAAAAACGC 2340 
CAAAAAAAAA AAAAAAAAGC ATTTTAGG AT CCAGAAGAAT TCCACCAGAT TGCATGAGTT 2400 
AGATTGGGAA ATGGGAGTGG GAGATAATAT TGGGAGGTAT CTATTTTAAG TCAGGGGCTT 2460 
TACTAGCCGA TTTAGTTCTC ACAATAACCA TGTGGAG AAG CTGTG ACATT TTTAATTTAC 2520 
AACCTTTCTG GGGCTCAGAC ATAAAGTTAC CTATCCAAGG TTGCAGTTGG GTAGTGGTGG 2580 

1 0 GACCAGGATG GACAACTCAT TGGCCCTGCC TCAAAAGCCA TACCTCTTCT CCTGCTATGC 2640 
AGAATCTGTT TCTCCTGAAT CTCTGTGATG CTGGTGGGAA TTGTTTGCAT AGAGGAAGGA 2700 
CAATAACCCT GCCATCGTGA GTTAATGTCC GGGCTGGTCA CAGTGGTTCA TGCCTGTAAT 2760 
CCCAGCACTT TGGGAGTCCA AGGCAGGCAT ATCATTTGAG GTCAGGAGTT TAAGACCAGC 2820 
CTGGCTAACA TAGTGAGACC CTGTTTCTAC TAAAAATACA AAAATAAGCC AGGTGTGGTG 2880 

1 5 GTGCATGACT GTAATCCCAA CTACTCAGCA GGAGAAGCAC TTGAACCCAG GAGACGGAGG 2940 
CGGCAGTGAG CCAAGATTGT GCCACTGCAC TCCAGCCTGG GCGACAGAGT GAAACTACAT 3000 
CTCAG AAAAA AAAAAAAAAA AAAAAAAAAA AAGTTAATGT CCAAAAATGA CAGATTTACA 3060 
AGTGTAAGCT ATATGATTTC TTCAAAAAGC AAAAGCAATA TACCTAATTC ATTTGGATCA 3120 
AACTTACATA GGTCTCAGGT CCTGTAAG AA ACTTGCCTGT TCTAACTGTT GCTACCAG AT 3 1 80 

20 TATATCTGGT GGTAATTGTT AATGTTTCAG CAGGGCTGGT CTCAGTCCTT TAAAATGGAA 3240 
AGCTTTATTT GGAGCCCAAC CCTATAAGAT GAAG AAATCC TATATAGTCT TATTCACCAA 3300 
TATATCCAAT ACACCCACAG CAATGGTACC TTTTTAAGAT CAGGATTTTA TTATG AATTC 3360 
CTGTCACTTT CTGTTTTCCA TTTAAATTTC TATTTTACAA ATTTTTCAOG G AATCATATT 3420 
CTTAACTTCA CTGAGAAAAA TGTGTTACTC TTTTGGACAA TTTATCTTAT TTCTATCATA 3480 

25 TAAGATGTAT TTTTTTATTG TCCTTAAAAG AAGCTCTAGC ATGAAATTAA AGGAAAGGGA 3540 
AAGAAATAGA TCTGGTGCAC CCGAACATTA GGAGAAAATG AAAAA T^.TAC AACCAACCGT 3600 
TCGTGAGTCA TCAAAAAGTC AAAGTCAGCC TGGCCAACAT GGCAAA AjTTC CGTCTCTGCA 3660 
AAAAATACAA AAATGAGCCC GGTATGTTGG CATATGCCTG TAATCCCAGC TACTCGAGAG 3720 
GCTGAGACAC GAAAATTGCT TGAACCTGGG AGGCGG AGGT TGCAATGAGC CGAGATCGCG 3780 

30 CTACTGCACT CCAGCCTGGG CAACAG AGAG AGACCTTGTC TCAAAAAAGA ACAACAACAA 3840 
AAAGTCAAAG TCATAATAAG CAAATTATTG GCTTCTTTCT TCTAGACYAA AAGAAATTAA 3900 
AGAGATGAAA CAATCAATTG CAAGGGTCAA a ACTAGATTG GATCTTGGTT TGAATGAAAA 3960 
AAAGCATAAA ATATTCTTGC AATAATTGTA AAAATTTGAA TGTGGACTAA G TCCT AGATT 4020 
ATATTAAAAT ATTTTTAATT TTTTAAGCTT GACAAATGCA CTGATTGTTA TACTTTAAAT 4080 

35 AACTAAAAAT CTGAGAATCC ACAGTGCTAC AGACAATAAA TGATAAAATG GGAAAAAAAA 4140 
AAAAAAAAAA A 



Seq ID NO: 1 8 Profein sequence: 
40 Protein Accession NP_079335.l 

1 n 21 31 41 51 

I I I I 1 1 

MSLDCTSHIA LGAASPAPEE TVDHLSEVPV TREQLNHYRN VAQNARSELA ATLVKFECAQ 60 
45 SELQDLRSKM LSKE VSCQEL KAEMES YKEN N ARKSSLLTS LRDRVQELEE ESAALSTSKI 1 20 
RTEITAHAAI KENQELKKK V VELNEKLQKC SKENEENKKQ VSKNCRKHEE FLTQLRDCLD 1 80 
PDERNDKASD EDLILKLRDL RKENEFVKGQ IVILEETINV HEMEAKASRE TIMRLASEVN 240 
REQKKAASCT EEKEICLNQDL LSAVEAKEAL EREVKIFQER LLAGQQVWDA SKQEVSLLKK 300 
SSSELEKSLK ASQDAVTTSQ SQYSSFREKI AALLRGRLSM TGSTEDTILE KIREMDSREE 360 
5 0 SRDRMVSQLE AQISELVEQL GKESGFHQKA LQRAQKAENM LETLQGQLTH LEAELVSGG V 420 
LRDNLNFEKQ KYLKFLDQLS QKMKLDQMAA ELGFDTRJLDV VLARTEQLVR LESNAVIENK 480 
TIAHNLQRKL KTQKERLESK ELHMSLLRQK lAQLEEEfCQA RTALVVERDN AHLTIRNLQK 540 
KVERLQKELNTCRDLHTElXAKI^TNELKIKTLEQTKAIEDLhKSRDQLEKMKEKAEKK 600 
LMSVKSELDT TEHEAKENKE RARNMIEVVT SEMKTLKKSL EEAEKREKQL ADFREWSQM 660 
5 5 LGLNVTSLAL PDYEIHCCLE RLVHSHQHHF VTCACUCDVT TGQERHPQGH LQLLH 



Seq ID NO: 19 DNA sequence 
^ _ Nucleic Acid Accession #: AF07 1552. NM_000662 

60 Coding sequence: 441-13 1 3 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

65 CTTTGTATAA GGCTCAGCTa! AAAGGGAAAT TGAGTGGGTC AGGTACCACG G ATACTATAC 60 
ACTCTATTGC ATGATTCTCC TGCCTACATC AG AAG ACGTT TATAAGCCTA TTTTAAAGGA 1 20 
TACCAGTTGG AATCTCTCTT TTATTAATCA CCAAG AG AAC CATG AACAAG CTGTTTATCA 1 80 
TTTGACTCAT CATTTAATCT TGATTTCCAG CTTCTCACAC TTGAAAG AAG acataataca ::4o 
TTTCTCACAG GATTCTGGGA CTATTAACTG AACTTATGTG TGTAAAAGGA ATTCATACAA 300 

70 TGAAAGCACT AGAAATAATT ATTATACTTA TAACCATTGT ATTTTTACAT GTTTA AAATA 360 
TAGCCATAAT TAGCCTACTC AAATCCAAGT GTAAAAGTAA AATGATTTGC nrCGTTTTG 420 
TTTTCCTTGC TTAGGGGATC ATGG ACATTG AAGCATATCT TGAAAGAATT GGCTATAAGA 480 
AGTCTAGGAA CAAATTGGAC TTGGAAACAT TAACTG ACAT TCTTCAACAC CAGATCCGAG 540 
CTGTTCCCrrTGAGAACCTTAACATCCATTGTGGGGATGCCATGGACTTAGGCTTAGAGG 600 

75 CC Ai U I IG A TCAAGTTGTG AGAAGAAATC GGGGTGGATG GTGTCTCCAG GTCAATCATC 660 
TTCTGTACTG GGCTCTCACC ACTATTGGTT TTGAGACCAC GATGTTGGGA GGGTATGTTT 720 
ACAGCACTCC AGCCAAAAAA TACAGCACTG GCATG ATTCA CCTTCTCCTG CAGGTGACCA 780 
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TTGATGGCAG GAACTACATT GTCGATGCTG GGTTTGGACG CTCATACCAG ATGTGGCAGC 840 
CTCTGGAGTT AATTTCTGGG AAGGATCAGC CTCAGGTGCC TTGTGTCTTC CGTTTGACGG 900 
AAGAGAATGG ATTCTGGTAT CTAGACCAAA TCAGAAGGGA ACAGTACATT CCAAATGAAG 960 
AATTTCTTCATTCTGATCrCCTAGAAGACAGCAAATACCG AAAAATCTACTCCTTTACT^ 1020 
5 TTA AGCCTCG AACA ATTG AA GATTTTGAGT CTATG AATAC ATACCTGCAG ACATCTCCAT 1 080 
CATCTGTGTT TACTAGTAAA TCATTTTGTT CCTTGCAGAC CXTCAGATGGG GTTCACTGTT 1 140 
TGGTGGGCTT CACCCTCACG CATAGGAGAT TCAATTATAA GGACAATACA GATCTAATAG 1200 
AGTTCAAGACTCTGAGTGAGGAAGAAATAG AAAAAGTGCTGAAAAATATATTTAATATTT 1260 
CCTTGCAGAG AAAGCTTGTG CCCAAACATG GTGATAG ATT TTTTACTATT TAGA ATAAGG 1320 

1 0 AGTAAAACAA TCTTGTCTAT TTGTCATCCA GCTCACCAGT TATCAACTGA CGACCTATCA 1 380 
TGTATCTTCT GTACCCTTAC CTTATTTTGA AGAAAATCCT AGACATCAAA TCATTTCACC 1440 
TATAAAAATG TCATCATATA TAATTAAACA GC UMT AAA GAAACATAAC CACAAACCTT 1500 
TTCAAATAAT AATAATAATA ATAATAATAA ATGTATTTTA AAGATGGCCT GTGGTTATCT 1560 

J ^ TGGAAATTGG TGATTTATGC TAG A AAGCTT TTAATGTTGG TTTATTGTTG AATTC 

Scq ID NO: 20 Protein scQuence: 
Protein Accession U: NP_000653 . 1 



20 1 II 21 31 41 51 

) I I I I I 

MDIEAYLERI G YKKSRNKLD LETLTDILQH QIRAVPFEhO. NIHCGDAMDL GLEAIFDQVV 60 
RRNRGG WCLQ VNHLLYWALT TIGFETTMLG GYVYSTPAKK YSTGMIHLLL QVTIDGRNYI 1 20 
_ ^ VDAGFGRSYQ M WQPLEUSG KDQPQVPCVF RLTEENGFWY LDQIRREQYI PNEEFLHSDL 1 80 
25 L£DSKYRKIYSFTLKPRTIEDFESMNT^LQTSPSSVFTSKSFCSLQTPDGVHCLVGFTLT 240 
HRRFNYKDNT DUEFKTLSE EEIEKVLKN! FNISLQRKLV PKHGDRFFTI 

Seq ID NO: 21 ON A sequence 
' Nucleic Acid Accession NM_003714 
3 0 Coding sequence: 1 23- 1 03 1 (underlined sequences correspond to start and stop codons) 



1 11-21 31 41 51 

I I r I I I 

3 5 CGGCACG AGO AAAAAGGAAG AGTGGGAGG A GGAGGGGAAG CGGCG AAGG A GG AAG AGGAG 60 
GAGGAGGAAG AGGGGAGCAC AAAGGATCCA GGTCTCCCGA CGGGAGGTTA ATACCAAGAA 120 
CC ATGT GTGC CG AGCGGCTG GGCCAGTTCA TG ACCCTGGC TTTGGTGTTG GCCACCTTTG 1 80 
ACCCGGCGCG GGGGACCGAC GCCACCAACC CACCCGAGGG TCCCCAAGAC AGGAGCTCCC 240 

^ _ AGCAGAAAGG CCGCCTGTCC CTGCAGAATA CAGCGGAGAT CCAGCACTGT TTGGTCAACG 300 

40 CTGGCGATGT GGGGTGTGGC GTGTTTGAAT GTTTCGAGAA CAACTCTTGT GAGATTCGGG 360 
GCTTACATGG GATTTGCATG ACTTTTCTGC ACAACGCTGG AAAATTTGAT GCCCAGGGCA 420 
AGTCATTCAT CAAAGACGCC TTGAAATGTA AGGCCCACGC TCTGCGGCAC AGGTTCGGCT 480 ' 
GCATAAGCCG GAAGTGCCCG GCCATCAGGG AAATGGTGTC CCAGTTGCAG CGGGAATGCT 540 
. ACCTCAAGCA CG ACCTGTGC GCGGCTGCCC AGGAGAACAC CCGGGTGATA GTGG AG ATGA 600 

45 TCCATTTCAA GGACTTGCTG CTGCACGAAC CCTACGTGGA CCTCGTGAAC TTGCTGCTGA 660 

CCTGTGGGGA GGAGGTGAAG GAGGCCATCA CCXTACAGCGT GCAGGTTCAG TGTGAGCAGA 720 
ACTGGGGAAG CCTGTGCTCC ATCTTGAGCT TCTGCACCTC GGCCATCCAG AAGCCTCCCA 780 
CGGCGCCCCC CGAGGGCCAG CCCCAGGTGG ACAGAACCAA GCTCTCCAGG GCCCACCACG 840 
OGGAAGCAGG ACATCACCTC CCAGAGCCCA GCAGTAGGGA GACTGGCCGA GGTGCCAAGG 900 

50 GTGAGCGAGG TAGCAAGAGC CACCCAAACG CCCATGCCCG AGGCAGAGTC GGGGGCCTTG 960 
GGGCTCAGGG ACCTTCCGGA AGCAGGGAGT GGGAAGACGA ACAGTCTGAG TATTCTGATA 1020 
TCCGGAGGTGAAATGAAAGG CCTGGCCACG AAATCTTTCC TCCACGCCGT CCATTTTCTT 1080 
ATCTATGGAC ATTCCAAAAC ATTTACCATT AGAGAGGGGG GATGTCACAC GCAGGATTCT 1 140 
GTGGGGACTG TGGACTTCAT CGAGGTOTGT GTTCOCGGAA CGGACAGOTG AGATGG AG AC 1 200 

55 CCCTGGGGCCGtGGGGTCTCAGGGGTGCCTGGTGAATTCTGCACTTACACGTACTCAAGG 1260 
GAGCGCGCCC GCGTTATCCT CGTACCTTTG TCTTCTTTCC ATCTGTGG AG TCAGTGGGTG 1 320 
TCGGCCGCTC TGTTGTGGGG GAGGTGAACC AGGGAGGGGC AGGGCAAGGC AGGGCCCCCA 1380 
GAGCTGGGCC ACACAGTGGG TGCTGGGCCT CGCCCCGAAG CTTCTGGTGC AGCAGCCTCT 1440 
GGTGCTGTCT CCGCGGAAGT CAGGGCGGCT GGATTCCAGG ACAGGAGTGA ATGTAAAAAT 1500 

60 AAATATCGCT TAGAATGCAG GAG AAGGGTG GAG AGGAGGC AGGGGCCG AG GGGGTGCTTG 1 560 
GTGCCAAACT GAAATTCAGT TTCTTGTGTG GGGCCTTGCG GTTCAO AGCT CTTGGCG AGG 1 620 
GTGGAGGGAG GAGTGTCATT TCTATGTGTA ATTTCTGAGC CATTG TACTG TCTGGGCTGG 1 680 
GGGGGACACT GTCCAAGGGA GTGGCCCCTA TOAGTTTATA TTTTAACCAC TGCTTCAAAT 1 740 
CTCG ATTTC A CTTTTTTTAT TTATCC AGTT ATATCTACAT ATCTGTCATC TAAATAAATG 1 800 

65 GCTTTCAAAC AAAAAAAAAA AAAAAAAAAA AAAAAAA 

Seq ID NO: 22 Protein scQuence: 
Protein Accession #: NPJ)03705 

70 

1 11 21 31 41 51 
I i I I i I 

MCAERLGQFM TLALVLATFD PARGTDATNP PEGPQDRSSQ QKGRLSLQKT AEIQHCLVNA 60 
_ _ GDVGCG VFEC FENNSCEJRG LHGICMTFLH NAGKFDAQGK SFIKDALKCK AHALRHRFGC 1 20 
75 ISRKCPAIRE MVSQLQRECY UCHDLCAAAQ ENTRVIVEMI HFKDLLLHEP YVDLVNLLLT 1 80 
CGEEVKEAIT HSVQVQCEQN WGSLCSILSF CTSAIQKPPT APPERQPQVD RTKLSRAHHG 240 
EAGHHLPEPS SRETGRGAKG ERGSKSHPNA HARGRVGGLG AQGPSGSSEW EDEQSEYSDI 300 
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Scq ID NO: 23 DNA sequence 
5 Nucleic Acid Accession #: NM_005264. 1 

Coding sequence: 5S7-1954(underlined sequences coirespond to start and stop codons) 

1 11 21 31 41 51 

10 1 I 1 I I 1 

GAATTCCGGC CAGAAGAAAT CTGGCCTCGG AACACGCCAT TCTCCGCGCC GCTTCCAATA 60 
ACCACTAACA TCCCTAACGA GCATCCGAGC CGAGGGCTCT GCTCGGAA AT CGTCCTGGCC 120 
CAACTCGGCC CTTCG AGCTC TCG AAGATTA CCGCATCTAT TTTTTTTTTC I H ITl 1 ICT 1 80 
TTTCCTAGCG CAGATAAAGT G AGCCCGGAA AGGGAAGGAG GGGGCGGGGA CACCATTGCC 240 
1 5 CTGAAAG AAT AAATAAGTAA ATAAACAAAC TGGCTCCTCG CCGCAGCTGG ACGCGGTCGG 300 
TTGAGTCCAG GTTGGGTCGG ACCTGAACCC CTAAAAGCGG AACCGCCTCC CGCCCTCGCC 360 
ATCCCGG AGC TGAGTCGCCG GCGGCGGTGG CTGCTGCCAG ACCCGGAGTT TCCTCTTTCA 420 
CTGGATGGAG CTGAACTTTG GGCGGCCAGA GCAGCACAGC TGTCCGGGGA TCGCTGCACG 480 
CTGAGCTCCC TCGGCAAGAC CCAGCGGCGG CTCGGGATTT TTTTGGGGGG GCGGGGACCA 540 
20 GCarcaCGCC GGCAC CATGT TCCTGGCGAC CCTGTACTTC GCXSCTGCCGC TCTTGGACTT 600 

GCTCCTGTCG GCCGAAGTGA GCGGCGGAGA CCGCCTGGAT TGCGTGAAAG CCAGTGATCA 660 
GTGCCTGAAG GAGCAGAGCT GCAGCACCAA GTACCGCACG CTAAGGCAGT GCGTGGCGGG 720 
CAAGGAGACC AACTTCAGCC TGGCATCCGG CCTGGAGGCC AAGGATGAGT GCCGCAGCGC 780 
CATGGAGGCC CTG AAGCAGA AGTCGCTCTA CAACTGCCGC TGCAAGCGGG GTATGAAGAA 840 
25 GGAGAAGAAC TGCCTGCGCA TTTACTGGAG CATGTACCAG AGCCTGCAGO GAAATGATCT 900 
GCTGGAGGAT TCCCCATATG AACCAGTTAA CAGCAGATTG TCAGATATAT TCCGGGTGGT 960 
CCCATTCATA TCAGATGTTT TTCAGCAAGT GGAGCACATT CCCAAAGGGA ACAACTGCCT 1020 
GGAtGCAGCG AAGGCCTGCA ACCTCGACGA CATTTGCAAG AAGTACAGGT CGGCGTACAT 1080 
CACCCCGTGC ACCACCAGCG TGTCC AACGA TGTCTGC AAC CGCCGCA AGT GCCACAAGGC 1 1 40 
30 CCTCCGGCAG TTCTTTGACA AGGTCCCGGC CAAGCACAGC TACGGAATGC TCTTCTGCTC 1 200 
CTGCCGGGAC ATCGCCTGCA CAGAGCGGAG GCGACAG ACC ATCGTGCCTG TGTGCTCCTA 1260 
TGAAGAGAGG GAGAAGCCCA ACTGTTTGAA TTTGCAGGAC TCCTGCAAGA CX3AATTACAT 1320 
CTGCAGATCT CGCCTTGCGG ATTTTTTTAC CAACTGCCAG CCAGAGTCAA GGTCTGTCAG 1380 
CAGCTGTCTA AAGGAAAACT ACGCTGACTG CCTCCTCGCC TACTCGGGGC TTATTGGCAC 1440 
3 5 AGTCATGACC CCCAACTACA TAG ACTCGAG TAGCCTCAGT GTGGCCCCAT GGTGTGACTG 1500 
CAGCAACAGT GGGAACG ACC TAGAAGAGTG CTTGAAATTT TTGAATTTCT TCAAGGACAA 1560 
TACATGTCTT AAAAATGCAA TTCAAGCCTT TGGCAATGGC TCCGATGTGA CCGTGTGGCA 1620 
GCCAGCCTTC CCAGTACAGA CCACCACTGC CACTACCACC ACTGCCCTCC GGGTTAAGAA 1680 
CAAGCCCCTG GGGCCAGCAG GGTCTGAGAA TGAAATTCCC ACTCATGTTT TGCCACCGTG 1740 
40 TGCAAATTTA CAGGCACAG A AGCTGAAATC CAATGTGTCX} GGCAATACAC ACCTCTGTAT 1 800 
TTCCAATGGT AATTATGAAA AAGAAGGTCT CGGTGCTTCC AGCCACATAA CCACAAAATC 1860 
AATGGCTGCT CCTCCAAGCT GTGGTCTGAG CCCACTGCTG GTCCTGGTGG TAACCGCTCT 1920 
GTCCACCCTA TTATCTTTAA CAG AAACATC ATAGC TGCAT TAAAAAAATA CAATATGG AC 1980 
ATGTAAAAAG ACAAAAACCA AGTTATCTGT TTCCTGTTCT CTT GTATAGC TGAAATTCCA 2040. 
45 GTTTAGG AGC TCAGTTGAG A AACAGTTCCA TTCAACTGGA ACATTTTTTT TTTTCCTTTT 2 100 
AAGAAAGCTT CTTGTGATCC TTCGGGGCTT CTGTGAAAAA CCTGATGCAG TGCTCCATCC 2160 
AAACTCAGAA GGCTTTGGGA TATGCTGTAT TTTAAAGGGA CAGTTTGTAA CTTGGGCTGT 2220 
AAAGCAAACT GGGGCTGTGT TTTCGATG AT GATG ATCATC ATGATCATG A TGATTTTAAC 2280 
AGTTTTACTT CTGGCCTTTC CTAGCTAGAG AAGGAGTTAA TATTTCTAAG GTAACTCCCA 2340 
50 TATCTCCTTT AATGACATTG ATTTCTAATG ATATAAATTT CAGCCTACAT TGATGCCAAG 2400 
CTTTTTTGCC ACAAAGAAGA TTCTTACCAA GAGTGGGCTT TGTGGAAACA GCTGGTACTG 2460 
ATGTTCACCT TTATATATGT ACTAGCATTT TCCACGCTG A TGTTTATGTA CTGTAAACAG 2520 
TTCTGCACTC TTGTACAAAA G AAAAAACCA CCCGGAATTC 

55 Seq ID NO; 24 Protein sequence:. 

Protein Accession NP_005255 

1 n 21 31 41 51 

I I I 1 I I 

60 MFLATLYFAL PLLDLLLSAE VSGGDRLDCV KASDQCLKEQ SCSTKYRTLR QCVAGKETNF 60 

SLASGLEAKD ECRSAMEALK QKSLYNCRCK RGMKXEKNCL RIYWSMYQSL QGNDLLEDSP 120 
YEPVNSRLSD IFR WPFISD VFQQVEHIPK GNNCLDAAKA CNLDDICKKY RSAYITPCTT 1 80 
SVSNDVCNRR KCHKALRQFF DKVPAKHSYG MLFCSCRDIA CTERRRQTIV PVCSYEEREK 240 
PNCLNLQDSC KTNYICRSRL ADFFTNCQPE SRSVSSCLKE NYADCLLAYS GLIGTVMTPN 300 

65 YIDSSSLSVA PWCDCSNSGN DLEECLKFLN FFKDNTCLKN AIQAFGNGSD VTVWQPAFPV 360 
QTTTATrrrALRVKNia>LGPAGSENEnn-HVU»PCANLQAQKLKSNVSGNTHLCISNGNY 420 
EKEGLGASSH ITTKSMAAPP SCGLSPLLVL WTALSTLLS LTETS 

Scq ID NO: 25 DNA sequence 
70 Nucleic Acid Accession FGENESH predicted 

Coding sequence: I576(cntire sequence represents open reading frame) 

1 II 21 31 41 SI 
„^ I I I I 1 I 

75 CTTTTGTTTC GCCATGCCTA GTCTAGTGGT ATCTGGAATA ATGGAAAGAA ATGGGGGCTT 60 

TGGAGAACTA GGATGTTTCG GGGGAAGCGC TAAGGACCGA GGGCTGCTGG AAGACGAGCG 120 
CGCCCTTCAG CTGGCTCTCG ATCAACTCTG CCTCCTGGGT TTGGGGG AGC CCCCCGCCCC 1 80 
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CAGGGCGGGC GAGGACGGGG GAGGTGGGGG GGGCGGCGCC CCCGCGCAGC CGACAGCCCC 240 
CCCGCAGCCG GCGCCGCCGC CGCCGCCCGC GGCGCCCCCG GCCGCCCCG A CG ACGGCCCC 300 
CGCAGCXjCAG ACGCCCCAGC CCCCCACCGC CCCCAAAGGG GCGAGCGACG CCAAGCTCTG 360 
CGCTCTCTAC AAAGAGGCCG AGCTGCGCCT GAAGGGCAGC AGCAACACCA CGGAGTGTGT 420 
5 TCCCGTGCCC ACCTCCGAGC ACGTGGCCGA GATCGTGGGC AGGCAAGGCT GCAAGATTAA 480 
GGCCTTG AGO GCCAAGACCA ACACCTACAT CAAGACACCG GTGAGGGGCG AGGAACCAGT 540 
GTTCATGGTG ACAGGGCGAC GGGAGGACGT GGCCACAGCC CGGCGGGAAA TCAT CTCAGC 600 
AGCGGAGCAC TTCTCCATGA TCCGTGCCTC CCGCAACAAG TCAGGCGCCG CCTTTGGTGT 660 
GGCTCCTGCT CTGCCCGGCC AGGTGACCAT CCGTGTGCGG GTGCCCTACC GCGTGGTGGG 720 

10 GCTGGTGGTG GGCCCCAAAG GGGCAACCAT CAAGCGCATC CAGCAGCAAA CCAACACATA 780 
CATTATCACA CCAAGCCGTG ACCGCGACCC CGTGTTCGAG ATCACGGGTG CXCXTAGGCAA 840 
CGTGGAGCGT GCGCGCGAGG AGATCG AGAC GCACATCGCG GTGCGCACTG GCAAGATCCT 900 
CGAGTACAAC AATGAAAACG ACTTCCTGGC GGGGAGCCCC GACGCAGCAA TCGATAGCCG 960 
CTACTCCGAC GCCTGGCGGG TGCACCAGCC CGGCTGCAAG CCCCTCTCCA CCTTCCGGCA 1020 

1 5 GAACAGCCTG GGCTGCATCG GCGAGTGCGG AGTGGACTCT GGCTTTGAGG CCCCACGCCT 1080 
GGGTGAGCAG GGCGGGGACT TTGGCTACGG CGGGTACCTC TTTCCGGGCT ATGGCGTGGG 1 140 
CAAGCAGGAT GTGTACTACG GCGTGGCCGA GACTAGCCCC CCGCTGTGGG CGGGCCAGGA 1200 
G AACGCCACG CCCACCTCCG TGCTCTTCTC CTCTGCCTCC TCCTCCTCCT CCTCTTCCGC 1260 
CAAGGCCCGC GCTGGGCCCC CGGGCGCACA CCGCTCCCCT GCCACTTCCG CGGGACCCGA 1320 

20 GCTGGCCGGA CTCCCGAGGC GCCCCCCGGG AGAGCCGCTC CAGGGCTTCT CTAAACTTGG 1380 
TGGGGGCGGC CTGCGGAGCC CCGGCGGCGG GCGGGATTGC ATGGTCTGCT TTGAGAGCGA 1440 
AGTG ACTGCC GCCCTTGTGC CCTGCGGACA CAACCTGTTC TGCATGGAGT GTGCAGTACG 1 500 
CATCTGCGAG AGG ACGGACC CAGAGTGTCC CX3TCTGCCAC ATCACAGCCA CGCAAGCCAT 1560 
CCGAATATTCTCCTAA 

25 

Seq ID NO: 26 Protein sequence: 

Protein Accession »: FGENESH predicted 



30 1 11 21 31 41 51 

FCFAMPSLVV SGIMERNGGF GELGCFGGSA KDRGLLEDER ALQLALDQLC LLGLGEPPAP 60 
RAGEDGGGGG GGAPAQPTAP PQPAPPPPPA APPAAPTTAP AAQTPQPPTA PKGASDAKLC 120 
ALYKEAELRL KGSSNTTECV PVPTSEHVAE IVGRQGCKIK ALRAKTNTYI KTPVRGEEPV 1 80 

3 5 FM VTGRREDV ATARREIISA AEHFSMIRAS RNKSGAAFG V APALPGQVTI RVRVPYRWG 240 
LWGPKGATI KRIQQQTNTY IITPSRDRDP VFEITGAPGN VERAREEIET HIAVRTGKIL 300 
EYNNENDFLA GSPDAAIDSR YSDAWRVHQP GCKPLSTFRQ NSLCaCECG VDSGFEAPRL 360 
GEQGGDFGYG GYLFPGYGVG KQDVYYGVAE TSPPLWAGQE NATPTSVLFS.SASSSSSSSA 420 
KARAGPPGAH RSPATSAGPE LAGLPRRPPG EPLQGFSFCU3 GGGLRSPGGG RDCMVCFESE 480 

40 VTAALVPCGH NLFCMECAVR ICERTDPECP VCHITATQAI RIPS 

Seq ID NO: 27 DNA sequence 

Nucleic Acid Accession ff: FGENESH predicted 

Coding sequence: 1 -2070 (underlined sequences correspond to start and stop codons) 

45 

1 11 21 31 41 51 

1 I I I I I 

ATG AGCGGTG CGGGGGTGGC GGCTGGGACG CGGCCCCCCA GCTCGCCGAC CCCGGGCTCT 60 
CGGCGCCGGC GCCAGCXjCCC CTCTGTGGGC GTCCAGTCCT TG AGGCCXjCA GAGCCCGCAG 120 

50 CTCAGGCAGA GCGACCCGCA GAAACGGAAC CTGGACCTGG AGAAAAGCCT GCAGTTCCTG 180 
CAGCAGCAGC ACTCGGAGAT GCTGGCCAAG CTCCATG AGG AGATCG AGCA TCTG AAGCGG 240 
GAAAACAAGG GTG AGCCGGC GCGGGGCCCT AGGCCGGCCC TGCCTCCCCA GGCACACTCA 300 
ACACTGCCGC TCCCGCAGCA CAG AAACACA GCCATCAACT CCAGCACACG CCTGGGCTCA 360 
GGGGGAACAC AGGACGGGGA GCCCCTCCAG ACTGTCCTTG CCCACCTGGC TGCACTGGCC 420 

5 5 CCTGTATGCC AACCCAGTGG GTACAGGTTC TGGGGG ACCT GGACAGATGC CGCTACCTCT 480 
AGCCQTGGCT GGACG ATGTT ATGCAGCCAA GCACAGCACG TGCTGCTCTC GGGAAQCCCA 540 
GGGCCTGAGG TCATTGCAGG GCGGCAGGTG GCCACAGGGT GCTCCCCAGA CCTCC CTCC T 600 
CCAAGTAGAG CTGAAATGGG AAGGAACCCC TGGGACAGCC CCTGCCCTGC TAGATCTTTG 660 
CCTCAGATTG CTGCTGTGGC CAGGCCCAGG ATTTCCAGCC CTATGGCTCT GAGTCCTCAC 720 

60 ATGCTGGGGG CCCAGGGGAT ATGGACACAC TCCATCCAGG GATCCCTTCC TGCCATCTGG 780 
GCAGCAACCA TGGGGACAAA GGGAGG AAGC AGAGTCGTGT TTCCTTGCCA CTTGTCCAAG 840 
GCACTTCCCC ATCCTGACAG CGGCCCCCAC CCAGCCCAGG ATCCTGGGCT GTGGTCTCAA 900 
GCTCACTTCC CATTATCTTT GGGGCTGGGG CTGACATCAG GAGG ACATCT GACTGGTGGA 960 
TGGAGCCAGC CTGGGAACAT CGCAGCTGGG GCAGTGCCTA GGGCTCTCCC TTCCCAGGGA 1020 

65 GACATGGAGA AGGGGGTTGA GGGAGGGCCC TTCCCTAGCC GCTGTGGCAA CTCCAGTGAG 1080 
CTGTTCTGGG CAAAGTGTGG CCCAAGTCGG CAGCCCCAGC CCTGCAGTGC TGGGGACGCT 1 140 
GACAGGACAC GGGAAGAGGC CATGCTTTCC CTCGG GACC T GCTGTTCCAT GTGTCCCAAG 1200 
CCCTCCTGCT TTCCAGATGG CCCCTCAGGA AACCACCTTT CCAGGGCCTC TGCTCCCTTG 1260 
GGCGCTCGCT GGGTCTGCAT CAACGGAGTG TGGGTAGAGC CGGGAGGACC CAGCCCTGCC 1320 

70 AGGCTGAAGG AGGGCTCCTC ACGGACACAC AGGCCAGGAG GCAAGCGTGG GCGTCTTGCG 1380 
GGCGGTAGCG CCGACACTGT GCGCTCTCCT GCAGACAGCC TCTCCATGTC AAGCTTCCAG 1440 
TCTGTCAAGT CCATCTCTAA TTCAGCCAAC TCTCAAGGCA AGGCCAGGCC CCAGCCCGGC 1500 
TCCTTCAACA AGCAAGATTC AAAAGCTGAC GTCTCCCAGA AGGCGGACCT GGAAGAGGAG 1560 
CCCCTACTTC ACAACAGCAA GCTGGACAAA GTTCCTGGGG TACAAGGGCA GGCCAGAAAG 1620 

75 G AGAAAGCAG AGGCCTCTAA TGCAGGAGCT GCCTGTATGG GGAACAGCCA GCACCAGGGC 1 680 
AGGCAGATGG GGGCGGGGGC ACACCCCCCA ATGATCCTOC CCCTTCCCCT GCGAAAGCCC 1740 
ACCACACTTA GGCAGTGCG A AGTGCTCATC CGCG AGCTGT GGAATACC A A CCTCCTGC AG 1 800 
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ACCCAAG AGC TGCGGCACCT CAAGTCCCTC CTGGAAGGGA GCCAGAGGCC CCAGGCAGCC 1 860 
CCGGAGGAAG CTAGCTTTCC CAGGGACCAA GAAGCXTACGC ATTTCCCCAA GGTCTCCACC 1920 
AAGAGCCTCT CCAAGAAATG CCTGAGCCCA CCTGTGGCGG AGCGTGCCAT CCTGCCCGCA 1980 
CTGAAGCAGA CCCCGAAGAA CAACTTTGCC GAGAGGCAGA AGAGGCTGCA GGCAATGCAG 2040 
5 AAACGGCGCC TGCATCGCTC AGTGCT TTGA 

Seq ID NO: 28 Protein sequence: 

Protein Accession FGENESH predicted 

10 

1 II 21 31 41 51 
111)11 

MSGAGVAAGT RPPSSPTPGS RRRRQRPSVG VQSLRPQSPQ LRQSDPQKRN LDLEKSLQFL 60 
QQQHSEMLAK LHEEIEHLKR ENKGEPARGP RPALPPQAHS TLPLPQHRNT AnsiSSTRLGS 120 

15 GGTQDGEPLQTVLAHIJVAU^PVCQPSGYRFWGTWTDAATSSRGWTMLCSQAQHVLLSGSP 180 
GPEVIAGRQV ATGCSPDLPPPSRAEMGRNPWDSPCPARSLPQiAAVARPRISSPMALSPH 240 
MLGAQGIWTH SIQGSLPAIW AATMGTKGGS RVLFPCHLSK ALPHPDSGPH PAQDPGLWSQ 300 
AHFPLSLGLG LTSGGHLTGG WSQPGNIAAG AVPRALPSQG DMEKGVEGGP FPSRCGNSSE 360 
LFWAKCGPSR QPQPCSAGDA DRTREEAMLS LGTCCSMCPK PSCFPDGPSG NHLSRASAPL 420 

20 GARWVCINGV WVEPGGPSPA RLKEGSSRTH RPGGKRGRLA GGSADTVRSP ADSLSMSSFQ 480 
SVKSISNSAN SQGKARPQPG SFNKQDSKAD VSQKADLEEE PLLHNSKLDK VPGVQGQARK 540 
EKAEASNAGAACMGNSQHQGRQMGAGAHPPMILPLPLRKPTTLRQCEVLIRELWNTNU.Q 600 
TQELRHLK.SL LEGSQRPQAA PEEASFPRDQ EATHFPKVST KSLSKKCLSP PVAERAILPA 660 

2^ LKQTPKNNFA ERQKRLQAMQ KRRLHRSVL 

Seq ID NO: 29 DNA sequence 

Nucleic Acid Accession #: NM_0123 1 9.2 

Coding sequence: 1 38-2405 (underlined sequences correspond to 5'- :r. 4nd stop codons) 

30 

1 n 21 31 41 51 

I ) I 1 I ! 

CTCGTGCCGA ATTCGGCACG AGACCGCGTG TTCGCGCCTG GTAGAGATTT CTCGAAGACA 60 
CCAGTGGGCC CGTGTGG AAC CAAACCTGCG CGCGTGGCCG GGCCGTGGGA CaaCG AGGCC 120 
3 5 GCGGAGACG A AGGCGCAATG GCGAGG AAGT TATCTGTAAT CTTG ATCCTG ACXTTTGCCC 1 80 
TCTCTGTCAC AAATCCCCTT CATGAACTAA A AGCAGCTGC TTTCCCCCAG ACCACTGAG A 240 
AAATTAGTCC GAATTGGGAA TCTGGCATTA ATGTTGACTT GGCAATTTCC ACACGGCAAT 300 
ATCATCTACA ACAGCTTTTC TACCGCTATG GAGAAAATAA TTCnTGTCA GTTGAAGGGT 360 
TCAGAAAATT ACTTCAAAAT ATAGGCATAG ATAAGATTAA AAGAATCCAT ATACACCATG 420 
40 ACCACGACCA TCACTCAGAC CACGAGCATC ACTCAGACCA TGAGCGTCAC TCAGACCATG 480 
AGCATCACTC AGACCACGAG CATCACTCTG ACCATGATCA TCACTCTCAC CATAATCATG 540 
CTGCTTCTGG TAAAAATAAG CGAAAAGCTC TTTGCCCAGA CCATGACTCA GATAGTTCAG 600 
GTAAAGATCC TAG A AAC AGC CAGGGGAAAG G AGCTCACCG ACCAGAACAT GCCAGTGGTA 660 
GAAGGAATGT CAAGGACAGT GTTAGTGCTA GTGAAGTGAC CTCAACTGTG TACAACACTG 720 

45 TCTCTGAAGG AACTCACTTT CTAGAGACAA TAGAGACTCC AAGACCTGGA AAACTCTTCC 780 
CCAAAGATGT AAGCAGCTCC ACTCCACCCA GTGTCACATC AAAGAGCCGG GTGAGCCGGC 840 
TGGCTGGTAG GAAAACAAAT GAATCTGTGA GTGAGCCCCG AAAAGGCTTT ATGTATTCCA 900 
GAAACACAAA TG AAAATCCT CAGGAGTGTT TCAATGCATC AAAGCTACTG ACATCTCATG 960 
^ GCATGGGCAT CCAGGTTCCG CTGAATGCAA CAGAGTTCAA CTATCTCTGT CCAGCCATCA 1020 

50 TCAACCAAAT TGATGCTAGA TCTTGTCTGA TTCATACAAG TGAAAAGAAG GCTGAAATCC 1080 
CTCCAAAGAC CTATTCATTA CAAATAGCCT GGGTTGGTGG TTTTATAGCC ATTTCCATCA 1 140 
TCAGTTTCCT GTCTCTGCTG GGGGTTATCT TAGTGCCTCT CATGAATCGG GTGT TTTTCA 1200 
AATTTCTCCT GAGTTTCCTT GTGGCACTGG CCGTTGGGAC TTTGAGTGGT GATGCl 1 1 1 1' 1260 
TACACCTTCT TCCACATTCT CATGCAAGTC ACCACCATAG TCATAGCCAT GAAG AACCAG 1 320 

5 5 CAATGG AAAT G AA AAG AGG A CCACTTTTCA GTC ATCTGTC TTCTCAAA AC ATAG AAG AAA 1 380 
GTGCCTATTT TGATTGCACG TGGAAGGGTC TAACAGCTCT AGGAGGCCTG TATTTCATGT 1440 
TTCTTGTTGA ACATGTCCTC ACATTGATCA AACAATTTAA AGATAAGAAG AAAAAGAATC 1500 
AGAAGAAACC TGAAAATGAT GATG ATGTGG AGATTAAGAA GCAGTTGTCC AAGTATGAAT 1560 
CTCAACTTTC A ACAAATG AG G AGAAAGTAG ATACAGATGA TCGAACTGAA GGCTATTTAC 1 620 

60 GAGCAGACTC ACAAG AGCCC TCCC ACTTTG ATTCTCAGC A GCCTGC AGTC TTGG AAG A AG 1 680 
AAGAGGTCAT GATAGCTCAT GCTCATCCAC AGG AAGTCTA CAATG A ATAT GTACCCAG AG 1 740 
GGTGCAAGA A TAAATGCC AT TC AC ATTTCC ACG ATACACT CGGCC AGTC A G ACG ATCTCA 1 800 
TTCACCACCA TCATGACTAC CATCATATTC TCCATCATCA CCACCACCAA AACCACCATC I860 
CTCACAGTCA CAGCCAGCGC TACTCTCGGG AGGAGCTGAA AGATGCCGGC GTCGCCACTT 1920 

65 TGGCCTGG AT GGTGATAATG GGTG ATGGCC TGCACAATTT CAGCGATGGC CTAGCAATTG 1 980 
GTGCTGCTTT TACTG AAGGC TTATCA AGTG GTTTAAGTAC TTCTGTTGCT GTGTTCTGTC 2040 
ATGAGTTGCC TCATGAATTA GGTG ACTTTG CTG7TCTACT AAAGGCTGGC ATGACCGTTA 2100 
AGCAGGCTGT CCTTTATAAT GCATTGTCAG CCATGCTGGC GTATCTTGGA ATGGCAACAG 2160 
GAATTTTCAT TGGTCATTAT GCTGAAAATG TTTCTATGTG GATATTTGCA CTTACTGCTG 2220 

70 GCTTATTCAT GTATGTTGCT CTGGTTGATA TGGTACCTGA AATGCTGCAC AATG ATGCTA 2280 
GTGACCATGG ATGTAGCCGC TGGGGGTATT TCTTTTTACA GAATGCTGGG ATGCl l l'lGG 2340 
GTTTTGGAAT TATGTTACTT ATTTCCATAT TTGAACATAA AATCXjTGTTT CGTATAAATT 2400 
TC TAGT TAAG GTTTAAATGC TAGAGTAGCT TAAAAAGTTG TCATAGTTTC AGTAGGTCAT- 2460 
AGGGAGATGA GTTTGTATGC TGTACTATGC AGCGTTTAAA GTTAGTGGGT rrTGTGATTT 2520 

75 TTGTATTGAA TATTGCTGTC TGTTACAAAG TCAGTTAAAG GTACGTTTTA ATATTTAAGT 2580 
TATTCTATCT TGGAG ATAAA ATCTGTATGT GCAATTCACC GGTATTACCA GTTTATTATG 2640 
TAAACAAGAG AnTGGCATG ACATGTTCTG TATGTTTCAG GGAAAAATGT CTTTAATGCT 2700 
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TTTTCAAGAA CTAACACAGT TATTCCTATA CTGGATTTTA GGTCTCTGAA GAACTGCTGG 2760 
TGTTTAGGAA TAAGAATGTG CATGAAGCCT AAAATACCAA GAAAGCTTAT ACTGAATTTA 2820 
AGCAAAG AAA TAAAGGAGAA AAGAGAAGAA TCTGAGAATT GGGGAGGCAT AGATTCTTAT 2880 
AAAAATCACA AAATTTGTTG TAAATTAGAG GGGAGAAATT TAGAATTAAG TATAAAAAGG 2940 
D CAGAATTAGT ATAGAGTACA TTCATTAAAC ATTTTTGTCA GGATTATTTC CCGTAAAAAC 3000 
GTAGTGAGCA CTCTCATATA CTAATTAGTG TACATTTAAC TTTGTATAAT ACAGAAATCT 3060 
AAATATATTT AATGAATTCA AGCAATATAC ACTTGACCAA GAAATTGGAA TTTCAAAATG 3120 
TTCGTGCGGG TTATATACCA GATGAGTACA GTGAGTAGTT TATGTATCAC CAGACTGGGT 3180 
TATTGCCAAG TTATATATCA CCAAAAGCTG TATGACTGGA TGTTCTGGTT ACCTGGTTTA 3240 
i U CAAA ATTATC AG AGTAGTAA AACTTTGATA TATATGAGG A TATTAAAACT ACACTAAGTA 3300 
TCATTTGATT CGATT CAGAA AGTACTTTGA TATCTCTCAG TGCTTCAGTG CTATCATTGT 3360 
G AGCAATTGT CTTTATATAC GGTACTGTAG CCATACTAGG CCTGTCTGTG GCATTCTCTA 3420 
GATGTTTCTT TTTTACACA A TAAATTCCTT ATATCAGCTT G 

15 Seq ID NO; 30 Protein sequence: 

Protein Accession #: NPJ)3645 1 .2 

1 II 21 31 41 51 

1 I I I I I 

ZKJ MARKLSVILI LTFALSVTNP LHELKAAAFP QTTEKISPNW ESGINVDLAI STRQYHLQQL 60 

FYRYGENNSL S VEGFRKLLQ NIGIDKIKRI HIHHDHDHHS DHEHHSDHER HSDHEHHSDH 1 20 
EHHSDHDHHS HHNHAASGKN KRKALCPDHD SDSSGKDPRN SQGKGAHRPE HASGRRNVKD 1 80 
SVSASEVTST VYNTVSEGTH FLE'llETPRP GKLFPKDVSS STPPSVTSKS RVSRLAGRKT 240 
NESVSEPRKG FMYSRNTNEN PQECFNASKL LTSHGMGIQV PLNATEFNYL CPAIINQIDA 300 

Zd RSCUHTSEK KAEIPPKTYS LQIAWVGGFl AISIISFLSL LGVILVPLMN RVFFKFLLSF 360 

LVALAVGTLS GDAFLHLLPH SHASHHHSHS HEEPAMEMKR GPLFSHLSSQ NIEESAYFDS 420 
TWKGLTALGG LYFMFLVEHV LTLDCQFKDK KKKNQKKPEN DDDVEDCKQL SKYESQLSTN 480 
EEKVDTDDRT EGYLRADSQE PSHFDSQQPA VLEEEEVMIA HAHPQEVYNE YVPRGCKNKC 540 
HSHFHDTLGQ SDDLIHHHHD YHHILHHHHH QNHHPHSHSQ RYSREELKDA GVATLAWMVl 600 

JU MGIX3LHNFSD GLAIGAAFTE GLSSGLSTSV AVFCHEU»HE LGDFAVLUCA GMTVKQAV^^ 660 
NALSAMIAYLGMATGIFIGHYAEWSMWIFALTAGUfMYVALVDMVPEMLHNDASDHGCS 720 
RWGYFFLQNA GMLLGFGIML LISIFEHKIV FRINF 

Seq ID NO: 31 DNA sequence 
3 5 Nucleic Acid Accession #: NM_002 1 84. 1 

Coding sequence: 256-301 2(under!ined sequences correspond to start and stop codons) 

I II 21 31 41 51 
I I 1 I I I 

4U GAGCAGCCAA AAGGCCCGCG G AGTCGCGCT GGGCCGCCCC GGCGCAGCTG AACCGGGGGC 60 
CGCGCCTGCC AGGCCG ACGG GTCTGGCCCA GCCTGGCGCC AAGGGGTTCG TGCGCTGTGG 1 20 
AGACGCGG AG GGTCG AGGCG GCGCGGCCTG AGTG AAACCC AATGGAAAAA GCATGACATT 1 80 
TAGAAGTAGA AGAC1TAGCT TCAAATCCCT ACTCCTTCAC TTACTAATTT TGTG ATTTGG 240 
AAATATCCGC GCAAGATGTT GACGTTGCAG ACTTGGGTAG TGCAAGCCTT GTTTATTTTC 300 

45 CTCACCACTG AATCTACAGG TGAACTTCTA GATCCATGTG GTTATATCAG TCCTGAATCT 360 
CCAGTTGTAC AACTTCATTC TAATTTCACT GCAGTTTGTG TGCTAAAGGA AAAATGTATG 420 
GATTATTTTC ATGTAAATGCTAATTACATTGTCTGGAAAACAAACCATTTTACTATTCCT 480 
AAGGAGCAAt ATACTATCAT AAACAGAACA GCATCCAGTG TCACCTTTAC AGATATAGCT 540 
TCATTA AATA TTCAGCTCAC TTGCAACATT CTTACATTCX3 GACAGCTTGA ACAGAATGTT 600 

D U T ATGGA ATC A CAATAATTTC AGGCTTGCXTT CCAGAAAAAC CTAAAAATTT GAGTTGCATT 660 

GTGAACGAGG GGAAGAAAAT G AGGTGTGAG TGGGATGGTG GAAGGGAAAC ACACTTGGAG 720 
ACAAACTTCA CTTTAAAATC TG AATGGGCA ACACACAAGT TTGCTGATTG CAAAGCAAAA 780 
CGTG ACACCC CCACCTCATG CACTGTTGAT TATTCTACTG TGTATTTTGT CAACATTGAA 840 
GTCTGGGTAG AAGCAG AGAA TGCCXTTTOGG AAGOTTACAT CAGATCATAT CAATTTTGAT 900 

O 3 CXrTGTATATA AAGTGAAGCC CAATCCGCCA CATAATTTAT CAGTGATCAA CTCAGAGGAA 960 
CTGTCTAGTA TCTTAAAATT GACATGGACC AACCCAAGTA TTAAGAGTGT TATAATACTA 1020 
AAATATAACA TTCAATATAG GACCAAAG AT GCCTCAACTT GG AGCC AG AT TCCTCCTGAA 1 080 
GACACAGCAT CCACCCGATC TTCATTCACT GTCCAAGACC TTAAACCTTT TACAGAATAT 1 140 
GTGTTTAGGA TTCGCTGTAT GAAGGAAGAT GGTAAGGGAT ACTGGAGTGA CTGGAGTGAA 1200 

OU GAAGCAAGTG GGATCACCTA TGAAGATAGA CCATCTAAAG CACCAAGTTT CTGGTATAAA 1260 
A TAGAT CCAT CCCATACTCA AGGCTACAGA ACTGTACAAC TCGTGTGGAA GACATTGCCT 1320 
CCi iTTGAAG CCAATGGAAA AATCTTGGAT TATGAAGTGA CTCtCACAAG ATGGAAATCA 1380 
CATTTACAAA ATTACACAGT TAATGCCACA AAACTGACAG TAAATCTCAC AAATGATCGC 1440 
TATCTAGCAA CCCTAACAGT AAGAA ATCTT GTTGGCAAAT CAGATGCAGC TGTTTTAACT 1 500 

65 ATCCCTGCCT GTGACTTTCA AGCTACTCAC CCTGTAATGG ATCTTAAAGC ATTCCCCAAA 1 560 
GATAACATGC TTTGGGTGG A ATGG ACTACT CCAAGGGAAT CTGTAAAGAA ATATATACTT 1 620 
GAGTGGTGTG TGTTATCAG A TAAAGCACCC TGTATCACAG ACTGGCAACA AGAAGATGGT 1 680 
ACCGTGCATC GCACCTATTT AAGAGGGAAC TTAGCAGAGA GCAAATGCTA TTTGATAACA 1740 
GTT ACTCCAG TATATGCTG A TGG ACCAGG A AGCCCTG AAT CCATAAAGGC ATACCTTA AA 1 800 

70 CAAGCTCCAC CTTCCAAAGG ACCTACTGTT CGGACAAAAA AAGTAGGGAA AAACGAAGCT 1860 
GTC TTAG AGT GGGACCA ACT TCCTGTTGAT GTTCAGAATG GATTTATCAG AAATTATACT 1 920 
ATATTTTATA GAACCATCAT TGGAAATGAA ACTGCTGTGA ATGTGGATTC TTCCCACACA 1980 
G AATATACAT TGTCCTCTTT G ACTAGTGAC ACATTGTACA TGGTACG AAT GGCAGCATAC 2040 
ACAGATGAAG GTGGGAAGGA TGGTCCAGAA TTCACTTTTA CTACCCCAAA GTTTGCTCAA 2100 

75 GGAGAAATTG AAGCCATAGT CGTGCCTGTT TGCTTAGCAT TCCTATTGAC AACTCTTCTG 2160 
GGAGTGCTGT TCTGCTTTAA TAAGCGAGAC CTAATTAAAA AACACATCTG GCCTAATGTT 2220 
CCAGATCCTT CAAAGAGTCA TATTGCCCAG TGGTCACCTC ACACICXTTCC AAGGCACAAT 2280 
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TTTAATTCAA AAGATCAAAT GTATTCAGAT GGCAATTTCA CTGATGTAAG TGTTGTGGAA 2340 
ATAGAAGCAA ATGACAAAAA GCCTTTTCCA GAAGATCTGA AATCATTGGA CCTGTTCAAA 2400 
AAGGAAAAAA TTAATACTGA AGGACACAGC AGTGGTATTG GGGGGTCTTC ATGCATGTCA 2460 
TCTTCTAGGC CAAGCATTTC TAGCAGTGAT GAAAATGAAT CTTCACAAAA CACTTCGAGC 2520 
5 ACTGTCCAGT ATTCTACCGT GGTACACAGT GGCTACAG AC ACCAAGTTCC GTCAGTCCAA 2580 
GTCTTCTCAA OATCCGAGTC TACCCAGCCC TTGTTAGATT CAGAGGAGCXj GCCAGAAGAT. 2640 
CTACAATTAG TAOATCATGT AGATGGCGGT GATGGTATTT TGCCCAOGCA ACAOTACTTC 2700 
AAACAG AACT GCAGTCAGCA TGAATCCAGT CCAGATATTT CACATTTTGA AAGGTCAAAG 2760 
CAAGTTTCAT CAGTCAATG A GGAAG ATTTT GTTAG ACTTA AACAGCAGAT TTCAG ATCAT 2820 
1 0 ATTTCACAAT CCTGTGG ATC TGGGCAAATG AAAATGTTTC AGGAAGTTTC TGCAGCAGAT 2880 
GCTTTTGGTC CAGGTACTGA GGGACAAGTA GAAAGATTTG AAACAGTTGG CATGGAGGCT 2940 
GCGACTGATG AAGGCATGCC TAAAAGTTAC TTACCACAG A CTGTACGGCA AGGCGGCTAC 3000 
ATGCCTCAG T GAA GG ACTAG TAGTTCCTGC TACAACTTCA GCAGTACCTA TAAAGTAAAG 3060 
CTAAAATGATTTTATCTGTG AATTC 

Scq ID NO: 32 Protein sequence: 
Protein Accession #: NP_002 1 75. 1 

_ 1 II 21 31 41 51 

20 I ) i I I I 

MLTLQTWWQ ALFIFLTTES TGELLDPCGY ISPESPVVQL HSNFTAVCVL KEKCMDYFHV 60 
NANYIVWKTN HFTIPKEQYT IINRTASSVT FTDIASLNIQ LTCNILTFGQ LEQNVYGITI 120 
ISGLPPEKPK NLSCIVNEGK KMRCEWDGGR ETHLETNFTL KSEW ATHKFA DCKAKRDTPT 1 80 
SCTVDYSTVY FVNIEVWVEA ENALGKVTSD HINFDPVYKV KPWPHNLSV INSEELSSIL 240 

25 ICLTWTNPSIKSVnLKYNlQYRTKDASTWSQIPPEDTASTRSSFTVQDLKPFTEYVFRIR 300 

CMKEDGKGYW SDWSEEASGl TYEDRPSKAP SFWYKIDPSH TQGYRTVQLV WKTLPPFEAN 360 
GKILDYEVTL TRWKSHLQNY TVNATKLTVN LTNDRYLATL TVRNLVGKSD AAVLTIPACD 420 
FQATHPVMDL KAFPKDNMLW VEWTTPRESV KKYILEWCVL SDKAPCITDW QQEDGTVHRT 480 
YLRGNLAESK CYLITVTPVY ADGPGSPESI KAYLKQAPPS KGPTVRTKKV GKNEAVLEWD 540 

30 QLPVDVQNGF IRNYTIFYRT UGNETAVNV DSSHTEYTLS SLTSDTLYMV RMAAYTDEGG 600 . 
KDGPEFTFTT PKFAQGEIEA FS^PVCLAFL LTTLLGVLFC FNKRDLDCKH IWPNVPDPSK 660 
SHIAQWSPHT PPRHNFNSKD QMYSDGNFTD VSVVEIEAND KKPFPEDLKS LDLFKKEKIN 720 
TEGHSSGIGG SSCMSSSRPS ISSSDENESS QNTSSTVQYS TWHSGYRHQ VPSVQVFSRS 780 
ESTQPLLDSE ERPEDLQLVD HVDGGDGILP RQQYFK.QNCS QHESSPDISH FERSKQVSSV 840 

3 5 NEEDF VRUCQ QISDHISQSC GSGQMKMFQE VSAADAFGPG TEGQVERFET VGMEAATDEG 900 
MPKSYLPQTV RQGGYMPQ 

Scq ID NO: 33 DNA sequence 
Nucleic Acid Accession #: NM_01 8255. 1 

40 Coding sequence: 1 .1 -249 1 (underlined sequences correspond to start and stop codons) 

1 n 21 31 41 51 

] I I I I I ' 

AGTTGGCG AC ATG GTGGCAC CCGTGCTGG A G ACTTCTCAC GTGTTTTQCT GCCCAAACCG 60 

45 GGTGCGGGGA GTCCTGAACT GG AGCTCTGG GCCCAGAGGA CTTCTGGCCT TTGGCACGTC 120 
CTGCTCCGTG GTGCTCTATG ACCCCCTGAA AAGGGTTGTT GTTACCA ACT TGAATGGTCA 1 80 
CACCGCCCGA GTCAATTGCA TACAGTGGAT TTGTA AACAG GATGGCTCCC CTTCTACTGA 240 
ATTAGTTTCT GGAGGATCTG ATAATCAAGT GATTCACTGG GAAATAGAGG ATAATCAGCT 300 

_ _ TTTAAAAGCA GTGCATCTTC AAGGCCATGA AGGACCTGTT TATGCGGTGC ATGCTGTTTA 360 

5 0 CCAGAGGAGG ACATCAG ATC CTGCATTATG TACACTGATC GTTTCTGCAG CTGC AGATTC 420 
TGCTGTTCGA CTCTGGTCTA AAAAGGGTCC AGAAGTAATG TGCCTTCAGA CTTTAAACTT 480 
TGGAAATGGA TTTGCTTTGG CTCTCTGCTT ATCTTTTTTG CCAAATACTG ATGTACCAAT 540 
ATTAGCATGTGGCAATGATG ATTGCAGAATTCACATATTTGCTCAACAAA ATGATCAGTT 600 
TCAGAAAGTG CTTTCTCTCT GTGGACATGA GGATTGG ATT AGAGGAGTGG AATGGGCAGC 660 

5 5 CTTTGGTAG A G ATCTTTTCC TAGC AAGCTG TTC AC AAG AT TGCCTG ATAA G AATATGG AA 720 
GCTGTATATA AAGTCAACAT CTTTAGAAAC TCAGGATGAC GATAACATAA GACTGAAAGA 780 
AAATACTTTT ACCATAGAAA ATG AAAGTGT TAAAATAGCA TTTGCTGTTA CT CTGGA GAC 840 
AGTGCTAGCC GGTCATGAAA ACTGGGTAAA TGCAGTTCAC TGGCAACCTG TGTTTTACAA 900 
^ AGATGGTGTC CTACAGCAGC CAGTG AGATT ATTATCTGCT TCCATGGATA AAACCATGAT 960 

60, TCTCTGGGCT CCAGATGAAG AGTCAGGAGT TTGGCTAGAA CAGGTTCGAG TAGGTGAAGT 1020 
AGGTGGG AAT ACTTTGGGAT TTTATGATTG CCAGTTCAAT GAAGATGGCT CCATGATCAT 1 080 
TGCTCATGCTTTCCACGGAG CGTTGCACCT TTGG AAACAG AATACAGTTA ACCCAAGAGA 1 140 
GTGGACTCCA GAGATTGTCA TTTCAGGACA CTTTGATGGT GTCCAAGACC TAGTCTGGGA 1200 
TCCAGAAGGA GAATTTATTA TCACTGTTGG TACTGATCAG ACAACTAGAC TTTTTGCTCC 1260 

65 ATGGAACAGA AAAGACCAAT CACAGGTGAC TTGGCATGAA ATTGCAAGGC CTCAGATACA 1320 
TGGGTATGAC CTGAAATGTT TGGCA ATG AT TAATCGGTTT CAGTTTGTAT CTGG AGCAG A 1 380 
TGAAAAAGTT CTTCGGGTTT TTTCTGCACC TCGGAATTTT GTGGAAAATT TTTGTGCCAT 1440 
TACAGGACAA TCACTGAATC ATGTGCTCTG TAATCAAGAT AGTGATCTTC CAGAAGGAGC 1500 
CACTGTCCCT GCATTGGGAT TATCAAATAA AGCTGTCTTT CAGGGAGATA TAGCTTCTCA 1560 

70 GCCTTCTG AT G A AGAGGAGC TGTTAACTAG TACTGGTTrT G AGTATCAGC AGGTGGCCTT 1 620 
TCAGCCCTCC ATACTTACTG AGCCTCCCAC TGAGG ATCAT CTTCTGCAGA ATACTTTGTG 1680 
GCCTGAAGTT CAAAAACTAT ATGGGCACGG TTATGAAATA TTTTGTGTTA CTTGTAACAG 1740 
TTCAAAGACTCTGCTTGCCT CAGCTTGTAA GGCAGCTAAG AAAGAGCATG CAGCTATCAT 1800 
TCTTTGG AAC ACTACATCTT GG AAACAGGT GCAG AATTTA GTTTTCCACA GTTTG ACAGT 1 860 

75 CACGCAGATG GCCTTCTCAC CTAATGAGAA GTTCTTACTA GCTGTTTCCA GAGATCGAAC 1920 
CTGGTCATTG TGGAAAAAGC AGGATACAAT CTCACCTGAG TTCGAGCCAG TTTTTAGTCT 1980 
TTTTGarrTCACCAACAAAATrACTTCTGTGCACAGTAGA ATTATTTGGTCTTGTGATTG 2040 
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GAGTCCTGAC AGCAAGTATT TCTTCACTGG GAGTCGAGAC AAAAAGGTGG TTGTCTGGGG 2100 
TGAGTGCGAC TCCACTGATG ACTGTATTGA GCACAACATT GGCCCCTGCT CCTCAGTCCT 2160 
GGACGTGGGT GGGGCTGTGA CAGCTGTCAG CGTCTGCCCA GTGCTCCACC CTTCTCAACG 2220 
ATACGTGQTT GCAGTAGGAT TGG AGTGTGG AAAGATTTGC TTATATACCT GG AAAA AG AC 2280 
5 TGATCAAGTT CTAGAAATAA ATGACTGG AC CCACTGTGTA GAAACAAGTC AAAGCCA AAG 2340 
TCATACACTG GCTATCAGAA AATTATGCTG G AAG AATTGC AGTGGAAAAA CTGAACAGAA 2400 
GGAAGCAGAA GGTGCTGAGT GGTTACACTT TGCAAGCTGT GGTGAAGATC ACACTGTGAA 2460 
G ATACACAGA GTCAATAAAT GTGCACT GTA AT GG 

10 Seq ID NO: 34 Protein sequence: 

Protein Accession #: NP_060725. 1 

I 11 21 31 41 51 
I I i I I I 

1 D M VAPVLETSH VFCCPNRVRG VLNWSSGPRG LLAFGTSCSV VLYDPLKRVV VTNLNGHTAR 60 

VNCIQWICKQ DGSPSTELVS GGSDNQVIHW EIEDNQLLKA VHLQGHEGPV YAVHAVYQRR 120 
TSDPALCTLI VSAAADSAVR LWSKKGPEVM CLQTLNFGNG FALALCLSFL PNTDVPILAC 1 80 
GNDDCRIHIF AQQNDQFQKV LSLCGHEDWI RGVEWAAFGR DLFLASCSQD CLIRIWKLYI 240 
KSTSLETQDD DNIRLICENTF TIENESVKIA FAVTLETVLA GHENfWVNAVH WQPVFYKDGV 300 

20 LQQPVRLLSA SMDKTMILWA PDEESGVWLE QVRVGEVGGN TLGFYDCQFN EDGSMIIAHA 360 

FHG ALHLWKQ NTVNPREWTP EIVISGHFDG VQDLVWDPEG EFIITVGTDQ TTRLFAPWKR 420 
KDQSQVTWHE lARPQIHGYD LKCLAMINRF QFVSGADEKV LRVFSAPRNF VENFCAITGQ 480 
SLNHVLCNQD SDLPEGATVP ALGLSNKAVF QGDIASQPSD EEELLTSTGF EYQQVAFQPS 540 
ILTEPPTEDH LLQNTLWPEV QKLYGHGYEI FCVTCNSSKT LLASACKAAK KEHAAHLWN 600 

25 TTSWKQVQNL VFHSLTVTQM AFSPNEKFLL AVSRDRTWSL WKKQDTISPE FEPVFSLFAF 660 

TNKITSVHSR IIWSCDWSPD SKYFFTGSRD KKVWWGECD STDDCIEHNI GPCSSVLDVG 720 
GAVTAVSVCP VLHPSQRYVV AVGLECGKIC LYTWKKTDQV PEINDWTHCV ETSQSQSHTL 780 
AIRiCLCWKNC SGKTEQKEAE GAEWLHFASC GEDHTVKIHR VNKCAL 

30 Seq ID NO: 35 DNA sequence 

Nucleic Acid Accession #: NM_022131 

Coding sequence: 1 1 -2878 (underlined sequences conespond to start and stop codons) 

I 11 21 31 41 51 

35 I I ! t 1 I 

TGCTGCGAGG ATGCTGCCTG GGCGGCTGTG CTGGGTGCCG CTCCTGCTGG CGCTGGGCGT 60 
GGGGAGCGGC AGCGGCGGTG GCGGGGACAG CCGGCAGCGC CGCCTCCTCG CGGCTAAAGT 120 
CAATAAGCAC AAGCCATGGA TCGAGACTTC ATATCATGGA GTCATAACTG AGAACAATGA 1 80 
CACAGTCATT TTGGACCCAC CACTGGTAGC CCTGGATAAA GATGCACCGG TTCCmTGC 240 

40 AGGGGAAATC TGTGCGTTCA AGATCCATGG CCAGGAGCTG CCCTTTGAGG CTGTGGTGCT 300 
CAACAAGACA TCAGGAGAGG GCCGGCTCCG TGCCAAGAGC CCCATTGACT GTGAGTTGCA 360 
GAAGGAGTAC ACATTCATCA TCCAGGCCTA TGACTGTGGT GCTGGGCCCC ACGAGACAGC 420 
CTGGAAAAAG TCACACAAGG CCGTGGTCCA TATACAGGTG AAGGATGTCA ACGAGTTTGC 480 

' TCCCACCTTC AAAGAGCCAG CCTACAAGGC TGTTGTGACG G AGGGCAAGA TCTATGACAG 540 

45 CATTCTGCAG GTGGAGGCCA TTGACGAGGA CTGCTCCCCA CAGTACAGCC AGATCTGCAA 600 
CTATGAAATC GTCACCACAG ATGTGCCTTT TGCCATCGAC AGAAATGGCA ACATCAGGAA 660 
CACTGAGAAG CTG AGCTATG ACAAACAACA CCAGTATGAG ATCCTGGTGA CCGCCTACGA 720 
CTGTGGACAG AAGCCCGCTG CTCAGGACAC CCTGGTGCAG GTGGATGTGA AGCCAGTTTG 780 
CAAGCCTGGC TGGCAAGACT GGACCAAGAG GATTGAGTAC CAGCCTGGCT CCGGGAGCAT 840 

50 GCCCCTGTTC CCCAGCATCC ACCTGGAGAC GTGCGATGGA GCCGTGTCTT CCCTCCAGAT 900 
CGTCACAGAG CTGCAG ACTA ATTACATTGG GAAGGGTTGT GACCGGGAGA CCTACTCTGA 960 
GAAATCCCTT CAGAAGTTAT GTGGAGCCTC CTCTGGCATC ATTGACCTCT TGCCATCCCC 1020 
TAGCGCTGCC ACCAACTGGA CTGCAGGACT GCTGGTGGAC AGCAGTGAGA TG ATCTTCAA 1080 
GTTTGACGGC AGGCAGGGTG CCAAAATCCC CGATGGGATT GTGCCCAAGA ACCTGACCGA 1 140 

55 TCAGTTCACC ATCACCATGT GGATGAAACA CGGCCCCAGC CCTGGTGTGA GAGCCGAGAA 1200 
GGAAACCATC CTCTGCAACT CAGAGAAAAC CGAAATGAAC CGGCATCACT ATGCCCTGTA 1260 
TGTGCACAAC TGCCGCCTCG TCTTTCTCTT GCGGAAGGAC TTCGACCAGG CTGACACCTT 1320 
TCGCCCCGCG GAGTTCCACT GGAAGCTGG A TCAG ATTTGT G ACAAAGAGT GGCACTACTA 1 380 
TGTCATCAAT GTGGAGTTTC CTGTGGTAAC CTTATACATG GATGGAGCAA CATATGAACC 1440 

60 ATACCTGGTG ACCAACGACT GGCCCATTCA TCCATCTCAC ATAGCCATGC AACTCACAGT 1500 
CGGCGCTTGT TGGCAAGGAG GAGAAGTCAC CAAACCACAG TTTGCTCAGT TCTTTCATGG 1560 
AAGCCTGGCC AGTCTC ACCA TCCGCCCTGG CAAAATGG AA AGCCAG AAGG TG ATCTCCTG 1 620 
CCTGCAGGCC TGCAAGGAAG GGCTGGACAT TAATTCCTTG GAAAGCCTTG GCCAAGGAAT 1680 
A AAGTATCAC TTCAACCCCT CGCAGTCCAT CCTGGTGATG G AAGGTGACG ACATTGGG AA 1 740 

65 CATTAACCGT GCTCTCCAGA AAGTCTCCTA CATCAACTCC AGGCAGTTCC CAACGGCGGG 1 800 
TGTGCGGCGC CTCAA AGTAT CCTCCAAAGT CCAGTGCTTT GGGG AAG ACG TATGCATCAG 1 860 
TATCCCTGAG GTAGATGCCT ATGTGATGGT CCTCCAGGCC ATCGAGCCCC GGATCACCCT 1920 
CCGGGGCACA GACCACTTCT GGAGACCTGC TGCCCAGTTT GAAAGTGCCA GGGGAGTGAC 1980 
CCTCTTCCCTGATATCAAGATTGTGAGCACCTTCGCCAAA ACCGAAGCCCCCGGGGACGT 2040 

70 GAAAACCACA GACCCCAAAT CAGAAGTCTT AGAGGAAATG CTTCATAACT TAGATTTCTG 2100 
TGACATnTG GTGATCGGAG GGGACTTGGA CCCAAGGCAG GAGTGCTTGG AGCTCAACCA 2160 
CAGTGAGCTC CACCAACGAC ACCTGG ATGC CACTAATTCT ACTGCAGGCT ACTCCATCTA 2220 
CGGTGTGGGC TCCATGAGCC GCTATGAGCA GGTGCTACAT CACATOCGCT ACCGCAACTG 2280 

_ _ GCGTCCGGCT TCCCTTGAGG CCCGGCGTTT CCGGATTAAG TGCTCAGAAC TCAATGGGCG 2340 

75 CTACACTAGC AATGAGTTCA ACTTGGAGGT CAGCATCCTT CATG AAGACC AAGTCTCAG A 2400 
TAAGGAGCAT GTCAATCATC TGATTGTGCA GCCTCCCTTC CTCCAGTCTG TCCATCATCC 2460 
TGAGTCCCGG AGTAGCATCC AGCACAGTTC AGTGGTCCCA AGCATTGCCA CAGTGGTCAT 2520 
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CATCATCTCC GTGTGCATGCTTGTGTTTGTCGTGGCCATG GGTGTGTACC GGGTCCGGAT 2580 
CX3CCCACCAG CACTTCATCC AGGAGACTG A GGCTGCCAAG GAATCTG AGA TGGACTGGGA 2640 
CGATTCTGCG CTGACTATCA CAGTCAACCC CATGGAGAAA CATGAAGGAC CAGGGCATGG 2700 
GGAAGATGAG ACTGAGGGAG AAGAGGAGGA AG AAGCCGAG GAAGAAATGA GCTCCAGCAG 2760 
5 TGGCTCTGAC GACAGCGAAG AGGAGGAGGA GGAGGAAGGG ATGGGCAGAG GCAGACATGG 2820 
GCAGAATGGA GCCAGGCAAG CCCAGCTGGA GTGGGATG AC TCX:ACCCTCC CCTACTAGTG 2880 
CCXAGGGGTC TGCTGCCTGG CCCACATGTC CXTI 1 ITGTAA ACCCTGACCC AGTGTATGCX; 2940 
CATGTCTATC ATACCTCACC TCTGATGTCT GTGACATGTC TGGG AAGGCC TTCTCCAGCT 3000 
TCCTGGAGCC CACCCTTTAA GCCTTGGGCA CTCCCTGTGT TTCATCCATG GGGAAGTTCC 3060 

1 0 ' A AGAAGCCCA GCATGGCCAT CAGTG AGG AC TTCAGGGTAG ACTTTGTCCT GTA GCCTCC A 3 120 
CTTCTGCCCT AAGTTCCCCA GCATCCTGAC TACCTGTCTG CAGAGTTTGC CTTTGTTTTT 3180 
TCCTGCAGGG AAGAAGGCCC ACCTTTGTGT CACTCACCTC CCCAGGCTCA GAGTCCCCAA 3240 
GGCCCTGGGG TTCCAACTCA CTGTGCGTCT CCTCCACACA GACCAGTAGG TTCTCCTATG 3300 
CTGACTCCAG GTTGCTTCAT ACAAGGAGGG TGGTTGAACT TCACACACGT AAGGTCTTAG 3360 

1 5 TGCTTAACAG TTTAAAGGAA AGTCCTTGTT GAGGCAGAAC TAAGTTTACA GGGAAAGGTA 3420 
CACACATTCT CTCTCTCTCT CTCTCTCTGT CTATCTAGTT CCCCAGCTTG GAG AGCCTTT 3480 
CCCCTTGCTT CTTTCTG AGG CCATATAAGC TTATAAGAAA AGTGCCAAAC CAAGAATAGG 3540 
TCCTTGGCCA CAAGC AGGGT CTG ATCCCCC ATC AGAGCTA TCTGAGCCTG CCTGTCTGGG 3600 
CACCTGCTGC AACCATGCAG CTACCCTGCC AGGGGCACTC AGCAAACAGA ACCACAGGGC 3660 

20 CCAGGAGGCA TTCCACACAG GCACTGCCCC AGGACAACAC AACAAGGACA GTCACAACAA 3720 
GGACAACAAG GACACAACAC AACACACAAC AAGGACAGTC ACAACAAGCC TAGAGCCAGA 3780 
AAGCAGATGG AAATGCTAAT GAGGTCAAAC GTAGGCTTCA TGGTGGGTGG AGTGGGGGTG 3840 
GCTGGGCTCC CCCAGGACAG AGGGGACCCT GAGGTTGGCA AGGCTCTCAC CACTCAGCCT 3900 
TATGGTCCCT TATCTCCTAT CTTCCCTCTT GAG AAAATAC ACGCTTTCTG CATGTATTAG 3960 

25 AAACGCACGA GCTCCACCAA GTCTACAATG AAAGTTTGAA ATTTAACTGC AAGGAATTAG 4020 
AAGCATATTT GCAATCATTG CAGCTTCTTC TTTCTTCTGC TCATAAAAGG AGGAACACTT 4080 
. TAGATAGAGG GCAAATATAT CTGAAAACCT AATTTCTTTC TTTTTTTGAT AAGGAAATCT 4140 
TTTCCATCTCCATCCTAACATGCACAACCTGTGAAGAGAATTGTTTCTATAGTAACTGGT 4200 
CTGTGATCTT TTGTGGCCAA GAGAATAGCA GGCAAGAATT AGGGCCTTGA CAGAATTTCC 4260 

30 ACGAAGCTCT GAGAACATGT TTGTTTCGAA TGTCTG ATTC CTCTTTGTCA TCA ATGTGTA 4320 
TGCTCTGTCC CCATCCTTCA CTCCTCCTCA AGCTCACACC AATTGGTTTG GCACAGGCAC 4380 
AG AGCTGGTC CCTAGTTAAG TGGCATTTAT GTTAAA AAAA AATAGTTCAG AATCTCAGCC 4440 
TTTTCTTTGT GTCATCAAA A CAGCTTAAG A AGGGG ACTAC TGCCA ATGTC CTCTAGTCTG 4500 
ACCTCCACCC AGGGAGGACC CATGGCAGGT CTTTTCAACT TTCTGATTCA TGAGAACAAC 4560 

3 5 CTTGTG AAGC TTTTCCCACC TCCTAAAGTG TTTTCTGCAT CTGTTCCTTC CTTTGGACCT 4620 

CACAACAAAT CCTGTGAAGT AACTGAGACA TCTGTTGTTA GATACATTTT TGTGATGAGT 4680 
AAACTGAGGC TTCG 

Scq ID NO; 36 Protein sequence: 
40 Protein Accession #: NPJ07I414.1 

I 11 21 31 41 51 

MLPGRLCWWLLLALGVGSGSGGGGDSRQRRIXAAKVNKHKPWICT 60 
45 LDPPLVALDK DAPVPFAGEI CAFKIHGQEL PFEA VVLNKT SGEGRLRAKS PIDCELQKEY 1 20 
TFUQAYDCG AGPHETAWKK SHKA WHIQ V KD VNEFAPTF KEP A YKA V VT EGKIYDSILQ 1 80 
VEAIDEDCSP QYSQICNYEI VTTDVPFAID RNGNIRNTEK LSYDKQHQYE ILVTAYDCGQ 240 
KPAAQDTLVQ VDVKPVCKPG WQDWTKRIEY QPGSGSMPLF PSIHLETCDG AVSSLQIVTE 300 
LQTNYIGKGC DRETYSEKSL QKLCGASSGI IDLLPSPSAA TNWTAGLLVD SSEMIFKFDG 360 
5 0 RQG AKIPDGI VPKNLTDQFT ITMWMKHGPS PG VRAEKETI LCNSDKTEMN RHHY ALYVHN 420 
CRLVFLLRKDFDQADTFRPAEFHWIO-DQlCDKEWHYYVINyEFPVVTLYMDGAT^ 480 
TNDWPIHPSH lAMQLTVGAC WQGGEVTKPQ FAQFFHGSLA SLTIRPGKME SQKVISCLQA 540 
CKEGLDINSL ESLGQGIKYH FNPSQSILVM EGDDIGNINR ALQKVSYINS RQFPTAGVRR 600 
LKVSSKVQCF GEDVCISIPE VDAYVMVLQA lEPRITLRGT DHFWRPAAQF ESARGVTLFP 660 
5 5 DIKIVSTFAK TEAPGDVKTT DPKSEVLEEM LH^a.DFCDIL VIGGDLDPRQ ECLELNHSEL 720 

HQRHLDATNS TAGYSIYGVG SMSRYEQVLH HIRYRNWRPA SLEARRFRIK CSELNGRYTS 780 
KEFNLEVSILHEDQVSDKEHV>mLIVQPPFLQSVHHPESRSSIQHSSVWSIATVVinS 840 
VCMLVFWAM GVYRVRIAHQ HFIQETEAAK ESEMDWDDSA LTnVNPMEK HEGPGHGEDE 900 
TEGEEEEEAE EEMSSSSGSD DSEEEEEEEG MGRGRHGQNG ARQAQLEWDD STLPY 

60 

Scq ID NO: 37 DNA sequence 
Nucleic Acid Accession U: none found 

- Coding sequence: 143-874 (luiderlined sequences correspond to start and stop codons) 

65 1 11 21 31 41 51 

GGG AGGGAGA GAGGCGCGCG GGTGAAAGGC GCATTG ATGC AGCCTGCGGC GGCCTCGGAG 60 
CGCGGCGGAG CCAGACGCTG ACCACGTTCC TCTCCTCGGT CTCCTCCGCC TCCAGCTCCG 120 
CGCTGCCCGG CAGCCGGGAG CCATGCGACC CCAGGGCCCC GCCGCCTCCC CGCAGCGGCT 180 

70 CCGCGGCCTC CTGCTGCTCC TGCTGCTGCA CCTGCCCGCG CCGTCGAGCG CCTCTGAGAT 240 

CCCCAAGGGG AAGCAAAAGG CGCAGCTCCG GCAGAGGG AG GTGGTGGACC TGTATAATGG 300 
AATGTGCTTA CAAGGGCCAG CAGGAGTGCC TGGTCGAGAC GGGAGCCCTG GGGCCAATGG 360 
CATTCCGGGT ACACCTGGGA TCCCAGGTCG GGATGGATTC AAAGGAGAAA AGGGGGAATG 420 
TCTG AGGGAA AGCTTTGAGG AGTCCTGGAC ACCCAACTAC AAGCAGTGTT CATGGAGTTC 480 

75 ATTGAATTAT GGCATAGATC TTGGGAAAAT TGCGG AGTGT ACATTTACAA AGATGCGTTC 540 
AAATAGTGCT CTAAGAGTTT TGTTCAGTGG CTCACTTCGG CTAAAATGCA GAAATGCATG 600 
CTOTCAGCGT TGGTATTTCA CATTCAATGG AGCTGAATGT TCAGGACCTC TTCCCATTGA 660 
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AGCTATAATT TATTTGGACC AAGGAAGCXC TGAAATG AAT TCAACAATTA ATATTCATCG 720 
CACTTCTTCT GTGG A AGGAC TTTGTGAAGG AATTGGTGCT GGATTAGTGG ATGTTGCTAT 780 
CTGGGTTGGC ACTTGTTCAG ATTACCCAAA AGG AG ATGCT TCTACTGGAT GGAATTCAGT 840 
TTCTCGCATC ATTATTGAAG AACTACCAAA ATAA ATGCTT TAATTTTCAT TTGCTACCTC 900 
niilllATT ATGCCTTGGA ATGGTTCACT TAAATG ACAT TTTAAATAAG TTTATGTATA 960 
CATCTGAATG AAAAGCAAAG CTAAATATGT TTACAGACCA AAGTGTGATT TCACACTGTT 1020 
TTTAAATCTA GCATTATTCA TTTTGCTTCA ATCAAAAGTG GTTTCAATAT TTTTTTTAGT 1080 
TGGTTAG AAT ACTTTCTTCA TAGTCACATT CTCTCAACCT ATAATTTGGA ATATTGTTGT 1 1 40 
GGTCTTTTGT TTTTTCTCTT AGTATAGCAT TTTTAAAA AA ATATA AA AGC TACC AATCTT 1 200 
TGTACAATTT GTAAATGTTA AGAATTTTTT TTATATCTGT TAAATAAAAA TTATTTeCAA 1260 
CAACCTTAAA AAAAAAAAAA AAAA 

Seq ID NO: 38 Protein seouence: 
Protein Accession #: none found 

1 11 21 31 41 51 

1 I i ! i I 

MRPQGPAASP QRLRG LLLLL LLQLPAPSS A SEIPKGKQKA QLRQREVVDL YNGMCLQGPA 60 
GVPGRDGSPG ANGIPGTPGI PGRDGFKGEK GECLRESFEE SWTPNYKQCS WSSLNYGIDL 120 
GKIAECTFTK MRSNSALRVL FSGSLRLKCR NACCQRWYFT FNG AECSGPL PIEAIIYLDQ 1 80 
GSPEMNSTIN IHRTSSVEGL CEGIGAGLVD VAIWVGTCSD YPKGDASTGW NSVSRIIIEB 240 
LPK 

Seq ID NO: 39 DNA sequence 

Nucleic Acid Accession #: NM_000949 

Coding sequence: 285-2 1 53(underlined sequences coirespond to start and stop codons) 

1 11 21 31 41 51 
I I i I I I 

GGAGGCTGAA ATCCCCAGAC GCCGGTTTTC TGGGCTGGGC TTTCTGCTTA CTCACTCCTT 60 
CTCCXrrCTTT CTGG ATTTTA CCGACCGTTC GCG AAACAGC TTTCCACACA ATGG AGCTTC 1 20 
ATGTCCTCGT GCAGGAAGTA CTCATCGACT GATGTGGCAG ACTTTGCTCC CTGACAAAAC 1 80 
TAAAGAACTC TCCTATTCAT GG AGGCGAAC ACTG AGGATG CTTTCCACAT GAACCCTGAA 240 
GTGAACTTCT GATACATTTC CTGCAGCAAG AGAAGGCAGC CAACATG AAG GAAAATGTGG 300 
CATCTGCA AC CGTTTTCACT CTGCTACTTT TTCTCAACAC CTGCCTTCTG AATGGACAGT 360 
TACCTCCTGG AAAACCTGAG ATCTTTAAAT GTCGTTCTCC CAATAAGGAA ACATTCACCT 420 
GCTGGTGGAG GCCTGGGACA GATGGAGGAC TTCCTACCAA TTATTCACTG ACTTACCACA 480 
GGGAAGGAGA GACACTCATG CATGAATGTC CAGACTACAT AACCGGTGGC CCCAACTCCT 540 
GCCACTTTGG CAAGCAGTAC ACCTCCATGT GGAGG ACATA CATCATGATG GTCAATGCCA 600 
CTAACCAGAT GGGAAGCAGT TTCTCGGATG AACTTTATGT GGACGTGACT TACATAGTTC 660 
AGCCAGACCC TCCTTTGGAG CTGGCTGTGG AAGTAAAACA GCCAGAAGAC AGAAAACCCT 720 
ACCTGTGGAT TAAATGGTCT CCACCTACCC TGATTGACTT AAAAACTGGT TGGTTCACGC 780 
TCCTGTATGA AATTCGATTA AAACCCGAGA AAGCAGCTGA GTGGGAGATC CATTTTGCTG 840 
GGCAGCAAAC AGAGTTTAAG ATTCTCAGCC TACATCCAGG ACAGAAATAC CTTGTCCAGG 900 
TTCGCTGCAA ACCAGACCAT GGATACTGGA GTGCATGGAG TCCAGCGACC TTCATTCAGA 960 
TACCTAGTGA CTTCACCATG AATGATACAA CCGTGTGGAT CTCTGTGGCT GTCCTTTCTG 1020 
CTGTCATCTG TTTGATTAIT GTCTGGGCAG TGGCTTTGAA GGGCTATAGC ATGGTGACCT 1080 
GCATCTTTCC GCCAGTTCCT GGGCCAAAAA TAAAAGOATT.TGATGCTCAT CTGTTGGAGA 1 140 
AGGGCAAGTC TGAAGAACTA CTGAGTGCCT TGGOATGCCA AGACTTTCCT CCCACTTCTG 1200 
ACTATGAGGA CTTGCTGGTG GAGTATTTAG AAGTAGATGA TAGTGAGGAC CAGCATCTAA 1260 
TGTCAGTCCA TTCAAAAGAA CACCCAAGTC AAGGTATGAA ACCCACATAC CTGGATCCTG 1320 
AC ACTG ACTG AGGCGGGGGG AGCTGTGACA GCCCTTCCCT TTTGTCTGAA AAGTGTGAGG 1380 
AACCCCAGGC CAATCCCTCC ACATTCTATG ATCCTGAGGT CATTGAGAAG CCAGAGAATC 1440 
CTGAAACAAC CCACACCTGG GACCCCCAGT GCATAAGCAT GGAAGGCAAA ATCCCCTATT 1500 
TTCATGCTGG TGGATCCAAA TGTTCAACAT GGCCCTTACC ACAGCCCAGC CAGCACAACC 1 560 
CCAGATCCTC TTACCACAAT ATTACTGATG TGTGTGAGCT GGCTGTGGGC CCTGCAGGTG 1620 
CACCGGCCAC TCTGTTG AAT G AAGCAGGTA AAGATGCTTT AAA ATCCTCT CAAACCATTA 1 680 
AGTCTAGAGA AGAGGGAAAG GCAACCCAGC AGAGGGAGGT AGAAAGCTTC CATTCTGAGA 1740 
CTGACCAGGA TACGCCCTGG CTGCTGCCCC AGGAGAAAAC CCCCTTTGGC TCCGCTAAAC 1 800 
CCTTGG ATTA TGTGG AG ATT CACAAGGTCA ACAAAGATGG TGCATTATCA TTGCTACCAA 1 860 
AACAGAGAGA GAACAGCGGC AAGCCCAAGA AGCCCGGGAC TCCTGAGAAC AATAAGGAGT 1920 
ATGCCAAGGT GTCCGGGGTC ATGGATAACA ACATCCTGGT GTTGGTGCCA GATCCACATG 1980 
CTAAAAACGT GGCTTGCTTT GAAGAATCAG CCAAAGAGGC CCCACCATCA CTTG AACAGA 2040 
ATCAAGCTGA GAAAGCCCTG GCCAACTTCA CTGCAACATC AAGCAAGTGC AGGCTCCAGC 2100 
TGGGTGGTTT GGATTACCTG GATCCCGCAT GTTTTACACA CTCCTTTCAC IGATAGCTTG 2 160 
ACTAATGGAA TGATTGGTTA AAATGTGATT TTTCTTCAGG TAACACTACA GAGTACGTGA 2220 
AATGCTCAAG AATGTAGTCA G ACTG ACACT ACTAAAGCTC CCAGCTCCTT TCATGCTCCA 2280 
TTTTTAACCA CTTGCCTCTT TCTCCAGC AG CTG ATTCCAG AACAAATCAT TATGTTTCCT 2340 
AACTGTGATTTGTAGATTTA CTl ITl'GCTG TTAGTTATAA AACTATGTGT TC AATGAAAT 2400 
AAAAGCACAC TGCTTAGTAT TCTTGAGGG A CAATGCCAAT AGGTATATCC TCTGGA AAAG 2460 
GCTTTCATGA TTTGGCATGG G ACAGACGGA AATGAAATTG TCAAAATTGT TTACCATAGA 2520 
AAGATGACAA AAGAAAATTT TCCACATAGG AAAATGCCAT GAAAATTGCT TTTGAAAAAC 2580 
AACTGCATAA CCTTTACACT CCTCGTCCAT TTTATTAGG A TTACCCAAAT ATAACCATTT 2640 
AAAG AAAGAA TGCATTCCAG AACAAATTGT TTACATAAGT TCCTATACCT TACTGACACA 2700 
TTGCTGATAT GCAAGTAAGA AAT 



Seq ID NO: 40 Protein sequence: 



364 



wo 02/059377 



Protein Accession #: NP_000940, 1 

1 11 21 31 41 51 
I 1 I I 1 I 

MKENVASATV FTLLLFLNTC LLNGQLPPGK PEIFKCRSPN KETFTCWWRP GTDGGLPTNY 60 
SLTYHREGET LMHECPDYIT GGFNSCHFGK QYTSMWRTYI MMVNATNQMG SSFSDELYVD 120 
VTYIVQPDPP LELAVEVKQP EDRKPYLWDC WSPPTUDUC TGWFTLLYEI RLKPEKAAEW 1 80 
EIHFAGQQTE FKILSLHPGQ KYLVQVRCKP DHGYWSAWSP ATFIQIPSDF TMNDTTVWIS 240 , 
VAVLSAVICL UVWAVALKG YSMVTCIFPP VPGPKIKGFD AHLLEKGKSE ELLSALGCQD . 300 
FPPTSDYEDL LVEYLEVDDS EDQHLMSVHS KEHPSQGMKP TYLDPDTDSG RGSCDSPSLL 360 
SEKCEEPQAN PSTFYDPEVI EKPENPETTH TWDPQCISME GKIPYFHAGG SKCSTWPLPQ 420 
PSQHNPRSSY HMTDVCELA VGPAGAPATL l.Nf AGKDAUC SSQTKSREE GKATQQREVE 480 
SFHSETDQDT PWLLPQEKTP FGSAKPLDYV BiHKVNKDGA LSLLPKQREN SGKPKKPGTP 540 
ENNKEYAKVS GVMDNNILVL VPDPHAKNVA CFEESAKEAP PSLEQNQAEK ALANFTATSS 600 
KCRLQLGGLD YLDPACFTHS FH 

Seq ID NOt 41 DNA sequence 

Nucleic Acid Accession 0: none found, Eos cloned sequence 

Coding sequence: 1-1572 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

ATGA CCCAAA ATAAATTAAA GCTTTGTTCC AAAGCCAATG TGTATACTGA AGTGCCTGAT 60 
GG AGGATGGG GCTGGGCGGT AGCTGTTTCA TTTTTCTTCG TTG AAGTCTT CACCTACGGC 1 20 
ATCATCAAGA CATTTGQTGT CTTCTTTAAT GACTTAATGG ACAGT TTTA A TGAATCCAAT 180 
AGCAGG ATCT CATGGATAAT CTCAATCTGT GTGTTTGTCT TAACATTTTC AGCTCXICCTC 240 
GCCACAGTCC TGAGCAATCG TTTCGGACAC CGTCTGGTAG TGATGTTGGG GGGGCTACTT 300 
GTCAGCACCG GGATGGTGGC CGCCTCCTTC TCACAAGAGG TTTCTCATAT GTACGTCGCC 360 
AtCGGCATCA TCTCTGGTCT GGGATACTGC TTTAGTTTTC TCCCAACTGT AACCATCCTA 420 
TCACAATATT TTGGCAAAAG ACGTTCCATA GTCACTGCAG TTGCTTCCAC AGGAGAATGT 480 
TTCGCTGTGT TTGCTTTCGC ACCAGCAATC ATGGCTCTGA AGGAGCGCAT TGGCTGGAGA 540 
TACAGCCTCC TCTTCGTGGG CCTACTACAG TTAAACATTG TCATCTTCQG AGCACTGCTC 600 
AGACCCATCT TTATCAGAGG ACCAGCGTCA CCG AAAATAG TCATCCAGGA AAATCGGAAA 660 
GAAGCGCAGT ATATGCTTGA AAATGAGAAA ACACGAACCT CAATAGACTC CATTGACTCA 720 
GGAGTAGAAC TAACTACCTC ACCTAAAAAT GTGCCTACTC ACACTAACCT GGAACTGGAG 780 
CCGAAGGCCG ACATGCAGCA GGTCCTGGTG AAGACCAGCC CCAGGCCAAG CGAAAAGAAA 840 
GCCCCGCTAT TAGACTTCTC CATTTTGAAA GAGAAAAGTT TTATTTGTTA TGCATTATTT 900 
GGTCTCTTTG CAACACTGGG ATTCTTTGCA CCTTCCTTGT ACATCATTCC TCTGGGCATT 960 
AGTCTGGGCA TTGACCAGGA CCGCGCTGCT TTTTTATTAT CTACGATGGC CATTGCAG AA 1020 
GTTTTCGGAA GGATCGGAGC TGGTTTTGTC CTCAACAGGG AGCCCATTCG TAAGATTTAC 1080 
ATTGAGCTCA TCTGCGTCAT CTTATTGACT GTGTCTCTGT TTGCCTTTAC TTTTGCTACT 1 140 
GAATTCTGGG GTCTAATGTC ATGCAGCATA TTTTTTGGGT TTATGGTTGG AACAATAGGA 1200 
GGGACTCACA TTCCACTGCT TGCTGAGGAT GATGTCGTGG GCATTGAGAA GATGTCTTCT 1260 
GCAGCTGGGGTCTACATCTTCATTCAGAGCATAGCAGGACTGGCTGGACCGCX:CCTTGCA 1320 
GGTTTGTTGG TGGACCAAAG TAAGATCTAC AGCAGGGCCT TCTACTCCTG CXjCAGCTGGC 1380 
ATGGCCCTGG CTGCTGTGTG CCTCGCCCTG GTGAGACCGT GTAAGATGGG ACTGTGCCAG 1440 
CATCATCACT CAGGTGAAAC AAAGGTAGTG aGCCATCGTG GGAAGACTTT ACAGGACATA 1500 
CCTGAAGACT TTCTGGAAAT GGATCTTGCA AAAAATGAGC ACAGAGTTCA CGTGCAAATG 1560 
OAGCCXjGTATGA 

Seq ID NO: 42 Protein sequence: 

Protein Accession none found, Eos cloned sequence 

1 11 21 31 41 51 
1 I I I I I 

MTQNKLKLCS KANVYTEVPD GGWGWAVAVS FFFVEVFTYG IIKTFGVFFN DLMDSFNESN 60 
SRISWIISIC VFVLTFS APL ATVLSNRFGH RL WMLGGLL VSTGM V AASF SQEVSHMYVA 1 20 
IGUSGLGYC FSFLPTVTIL SQYFGKRRSI VTAVASTGEC FAVFAFAPAI MALKERIGWR 1 80 
YSLLFVGLLQ LNIVIFGALL RPIFIRGPAS PKIVIQENRK EAQYMLENEK TRTSIDSIDS 240 
GVELTTSPKN VPTHTNLELE PKADMQQVLV KTSPRPSEKK APLLDFSILK EKSFICYALF 300 
GLFATLGFFA PSLYIEPLGI SLGIDQDRAA FLLSTMAIAE VFGRIGAGFV LNREPIRKJY 360 
lEUCVILLT VSLFAFTFAT EFWGLMSCSI FFGFMVGTIG GTHIPLLAED DVVGIEKMSS 420 
AAGVYIFIQS lAGLAGPPLA GLLVDQSKIY SRAFYSCAAG MALAAVCLAL VRPCKMGLCQ 480 
HHHSGETKVV SHRGKTLQDI PEDFLEMDLA KNEHRVHVQM EPV 

Seq ID NO: 43 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted ORF 

Coding sequence: I -1 749 (underlined sequences correspond to start and stop codons) 

i 11 21 31 41 51 
I I I I I I 

ATGC rrGT(rrG G Ci lCri GAT GAGTCCCAGT ACCCAGCACA GAGCACAGTA CACrrCCCGGA 60 
GG AAAGAAAC TTCCGTGGGA GGCTTCX:ATC GGTGCGCACA CCTCCCGAGG GCGAGGCAGC 120 
GACCGGGAGA GGGAGAGCCG GCCGGAGGCn* GCCGGGCTCC TGTGGGACCG CGCTGCAGCC 180 
GGGGAGGCGG AGAAGGGGAA CCGGGGCGAG CCXjCCCGCCT GGATCCGCX3C CCAGCAGCAG 240 
CCGCGGCCGC CGCCAGCTGG GCAGGCTCCC GGGACTGCGG CTTGGGGGCGC GCAGGACOCT 300 
CGCCTGCGTC CTGGACGTTC CCGGGGGACKj GTCXGCiTTGC CAGTGAAACC TCCAGAGGCT 360 
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TCCGGACGAC AGCCCCGGGG GCCTTCTGAC TGCATCCCGA GATTTCCATC AGCGAGTGCA 420 
ACTCATAAGG CAGTCCCTAA GGGGACCGGG CCACCGGCTG AGGACX3GGGA TGGCTTAGGA 480 
GCTCCTGGAC CTAGGGCCCG GCGTCGTCGC CTCCTGGGCG TCGCX3GCAGA GGGGAGTGGC 540 
CCGCGCGGAA AGCGCCGCGG GACAGTCAGT GACGAGGCCC GGGGGTCGCC GGGGCCACGA 600 
D CTTCTCGGAG ACCGTCCTGC GCTCTCTGGA GACGCGCTGT CCGCGCCCAG GGTGGTGCCA 660 
TGTGGGGCGC TCGCCGCTCG TCCGTCTCCT CATCCTGGAA CGCCGCTTCG CTCCTGCAGC 720 
TGCTGCTGGC TGCGCTGCTG GCGGCGGGGG CGAGGGCCCA GCGG CG AGTA CTGCCACGGC 780 
TGGCTGGACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCG CTTCGACGGC 840 
GGCGACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 900 
1 U GCGCGCCTGG ACCAGGGCGG CTGCG ACAAT GACCGCCAGC AGGGCGCTGG CGAGCCTGGC 960 
CGGGCGGACA AAGACGGGCC CCGACGGCTC GGCAGGGCTT CATGTCTTAG GGGTACCCAA 1020 
GGAGACGGCG AGGGTGCGCC CCCACCCGTG AGGGCCTGGC AGCGGTGCTC CCCTGAAGGC 1080 
TCCXCGAAAG GAAGGCAGCT CCTCAGGGCT TTCCCGGGGC TGCTGCCCCG TGCCAGACGC 1 140 
CGCGGATTCC CATCTTCTCC ACGCGGCGGC CCCTCTCCCC TGCAGCGGCC CGCCTTGCCC 1200 
1 D ATCTACGTGC CGTTCCTCAT TGTTGGCTCC GTGTTTGTCG CCTTTATCAT CTTGGGGTCC 1260 

CTGGTGGCAG CCTGTTGCTG CAGATGTCTC CGGCCTAAGC AGGATCCCCA GCAGAGCCGA 1320 
GCCCCAGGGG GTAACCGCTT GATGGAGACC ATCCCCATGA TCCCCAGTGC CAGCACCTCC 1380 
CGGGGGTCGT CCTCACGCCA GTCCAGCACA GCTGCCAGTT CCAGCTCCAG CGCCAACTCC 1440 
GGGGCCCGGG CGCCCCCAAC AAGGTCACAG ACCAACTGTT GCTTGCCGGA AGGGACCATG 1 500 
ZU AACAACGTGT ATGTCAACAT GCCCACGAAT TTCTCTGTGC TG AACTGTCA GCAGGCCACC 1 560 
CAGATTGTGC CACATCAAGG GCAGTATCTG CATCCCOCAT ACGTGGGGTA CACGGTGCAG 1620 
CACGACTCTG TGCCCATGAC AGCTGTGCCA CCTTTCATGG ACGGCCTGCA GCCTGGCTAC 1680 
AGGCAGATTC AGTCCCCCTT CCCTCACACG AACAGTGAAC AGAAGATGTA CCCAGCGGTG 1 740 
ACTGTATAAC CG AG AGTCAC TGGTGGGTTC CTTTACTGAA GGGAGACGAA GGCAGGGGTG 1 800 
ZD G ATTCTCG AG GTGGAAGTCC GCACATGTCG GTGGTATTTA TGGCACGATT CCTTTGG ATG 1 860 
GCTTCATTTG CXTCCCAGACT GTATGAAAAC ATCTCCGAAT TAGCATTTCT GGATATGTTT 1920 
CATCCAGGGT ATCATTG ATT TATGATGGAA AACCGGCCTC AGCTGGAGAT GACTGTG ATG 1980 
TTGCTGATGG GTGTATAACA AATGCTTGAG TCCGAAGTGC CCTTGAGATA TGGTTGACGA 2040 
AAGAATTTTA TAAACTGATA AATTAAGGAT TTTTATTATG TTGTTATTAT TATTTCmT 2100 
^ TTGTT GTTGA CTGCACAGGA TCAAAATGCC TGTTATCTCC CnTTACTGG G ACTTTTTTT 2 1 60 
I m I TTTTT mTTTTTAA TCAGACAGGG TCTTGCTCTG TTGCCCAGGC TGGAGTGCAG 2220 
TGGTGCGATC TCGGCTCACT GCAACTTCAG CCTCCTGGAT TCAGGCAACA CTCCTGCCTC 2280 
AGCCTCCCAC GTGGCTGGGA TTACAGGTGC CTGCXXCCAT GGCTAATTTT TTGTATTTTT 2340 
TGTAGAGATG GGGTTTCACC ATGTTGGCTG GGCTGGTCTC ACTCTCCTGA CCTCAAGCAA 2400 
J D TCTGCCTGTC T CAGCCT CCC AAAGTGCTGG G ATTACAGGC GTG AGCCXCC GCCCCCAGCC 2460 
TGAGeCTTTT TTiiiiiCTA ATGCATCCAA GGTTAAGGGG AAGACGCAAA TAACAGGACT 2520 
ATTCTAAAAG GAAACCTGTT TG AACTCTGT GAGATCAGTC ATCAGTCTCA GTATTCCACA 2580 
GGCACACCTT AATTTCATTG TAAAAAG ATA TATATATTTT GTCTATTTTT GTGCTTTTGG 2640 
GGGCCTATTT TGTGCTTTTT TACCTTATGT AGAGATCTTA TTACAAAGTG ATTTTCTACA 2700 
TTAAAAAG AG ACTG AAATA A ATTGTATAGT TACTTAACTA ATGAAGACAT TTCAGAACTC 2760 
TGGGATG ATT TTAATCTTGA AGTAGTAGGT GGTATAGTCA TAAAACXTATT CATCCCCTTC 2820 
TTGATTGTAT CTTAATTTTC TGGCTTTAAG GTGACATCTG AGAGGTAATG CATTCTTTTT 2880 
TATATTGAAA TCATAAACTA TCACCCGCTG CTTCTCTGAG TTACTTTTAA TTTTGCCTTG ^940 
TGGTTATGGT TTGGCGTTTC CTTCTGTTTG GTTTTCAGAG CCCCATGTCT ATATAGTCCT 3000 
GAGTG CAAG T AATTACTATA CTTGTAAATG AAGATCAGTA TTTCTGCXTTA GATCTG ATAA 3060 
AAAAATTTTC TTGTCTTAGT T ATAA A AATT CAAAG A AATG TGTTACAAAG ATACTTAGTA 3120 
TAGCTCCTCA GCCA TAAC CT GAGACTTGGG ATGAAATTTA AACCAGATAC GATTTACTTT 3180 
GCAGATCATA AGGCTTTTTA TACTCTTGTT ATCAAAATGG CTTATTTTTC AGGCACTAAG 3240 
G ATTGTTAAG AG AA A AGCTT TTCA ACG AAG GATTGCCTTT CTTCTCCCAC ACTGTTCTTG 3300 
3 U ATTTCCTCTC TCTTTCAGGC CTCAAC AGGC ACTGTATTCA TTGCCAATGT TCCAAATTAT 3360 
CAAATTCAAG TGAATTTATT TGTGTGTTCT TTACTTATAT AAAAAAAGAT AACTTTAAGG 3420 
ATGTGCAAGT ACATTTCCAA CTGCTAGCAC AACCAGTATT TTGTAATTAA ACAAATCGCT 3480 
GTATGGTATG GTCTTCTACA CATTTATGTC TATAGATATC TATCGATCAT CTTTCTATTC 3540 
K < TGTTTC ATGA CTG AATAATG TAAA ACCAGT GTTGGCAATT GGTATCATCA ATQATACTCA 3600 
D D TTTTTTAATA ACCAAAGGCA GGGG AA A ATC ATTTTACTTA TTAATAAATA TTTTATG ATG 3660 
TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



60 



Seq ID NO: 44 Protein sequence: 

Protein Accession #: FGENESH predicted 



1 H 21 31 41 51 
1 i i i I 1 

MLSGFLMSPS TQHRAQYTPG GiGCLPWEASI GAHTSRGRGS DRERESRPEA AGLLWDRAAA 60 
- GEAEKGNRGEPPAWIIO^QQQPRPPPAGQAPGTAAGGAQDPRLREGRSRGRVRLPVKPPEA HO 

OD SGRQPRGPSD CIPRFPSASA THKAVPKGTG PPAEDGDGLG APGPRARRRR LLG VAAEGSG 1 80 
PRGKRRGTVS DEARGSPGPR LLGDRPALSG DALSAPRVVP CGALAARPSP HPGTPLRSCS 240 
CCWLRCWRRG RGPSGEYCHG WLDAQGVWRI GFQCPERFDG GDATICCGSC ALRYCCSSAE 300 
ARLDQGGCDN DRQQGAGEPG RADKDGPRRL GRASCLRGTQ GDGEGAPPPV RAWQRCSPEG 360 
SPKGRQLLRA FPGLLPRARR RGFPSSPRGG PSPLQRPALP lYVPFLIVGS VFVAFIILGS 420 
/ U LVAACCCRCL RPKQDPQQSR APGGNRLMET IPMIPS ASTS RGSSSRQSST AASSSSSANS 480 

GARAPPTRSQ TNCCLPEGTM NNVYVNMPTN FSVLNCQQAT QIVPHQGQYL HPPYVGYTVO 540 
HDSVPMTAVP PFMDGLQPGY RQIQSPFPHT NSEQKMYPAV TV 

Seq ID NO: 45 DNA sequence 
75 Nucleic Acid Accession #: NM_002285 

Coding sequence: 55-3738 (undeilined sequences conespond to start and stop codons) 
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I n 21 31 41 51 
I I I i I i 

GGCCGAGCCT CGGCGGCGGC GGTAGCGGCG GCGGCG ACGC TGACACCTCC CACCATGGAC 60 
AGCTTCG ACT TAGCCCTGCT CCAGGA ATGG G ACCTCGAGT CACTGTGTGT CTATGAACCA 120 
5 GATAGAAATG CATTACGGAG GAAAGAACGA GAAAGAAGAA ATCAAGAAAC TCAACAGGAT 180 
GATGGCACGT TTAATTCTAG TTACTCTCTC TTCAGTGAGC CCTACAAGAC TAACAAGGGG 240 
GATG AACTCT CCAACCGGAT CCAGAACACT TTAGGCAATT ATGATGAAAT GAAAG ACTTT 300 
TTAACTGATA GAACCAATCA GAGTCATCTC GTTGG AGTTC CCAAACCTGG GGTTCCTCAG 360 
ACTCCTGTGA ACAAGATCGA TGAACATTTT GTTGCAGATT CAAGAGCCCA G AACCAGCCC 420 

1 0 . TCGTCTATCT GTAGCACTAC AACTTCCACA CCAGCAGCTG TCCCCGTGCA GCAGAGTAAA 480 
AGAGGCACTA TGGGCTGGCA GAAGGCTGGG CACCCACCCT CTGACGGCCA ACAGAGAGCA 540 
ACACAACAGG GCTCTCTCAG GACCTTGCTT GGAGATGGTG TTGGCAGACA GCAGCCTCGG 600 
GCCAAACAAG TGTGCAATGT GGAGGTGGGC CTTCAGACCC AGGAGAGGCC ACCTGCCATG 660 
GCGGCCAAGC ACAGCAGCAG CGGAGACTGT GTTCAGAACT TTCCTCCATC CCTAGCTTCA 720 

1 5 AAACCCAGCC TGGTCCAGCA G AAACCG ACC GCGTATGTGA GGCCAATGGA CGGCCA AG AT 780 

CAGGCCCCTG ATGAGTCTCC TAAGCTGA AG TCGTCTTCGG AAACCAGCGT GCACTGCACA 840 . 
TCATACAGGG GAGTCCCTGC CAGCAAGCCG GAGCCTGCCA GAGCCAAGGC CAAGCTCTCC 900 
AAGTTCAGCA TCCCCAAGCA GGGGGAGGAG AGTAGATCTG GAGAAACCAA CAGCTGTGTT 960 

• GAAGAAATAA TCCGGG AGAT G ACCTGGCTT CCACCACTTT CTGCTATTCA AGCACCTGGC 1020 

20 AAAGTGG AAC CAACCAAATT TCC ATTTCCA A ATA AGG ACT CTCAGCTTGT ATCCTCTGG A 1 080 

CACAATAATC CAAAGAAAGG TGATGCAGAG CCAGAGAGTC CAGACAATGG CACATCGAAT 11 40 
ACATCAATGC TGGAAGATGA CCTTAAGCTA AGCAGTGATG AAGAGGAGAA TGAACAGCAG 1200 
GCAGCTCAGA GAACGGCTCT CCGCGCTCTC TCTGACAGCG CCGTGGTCCA GCAGCCCAAC 1260 
TGCAGAACCT CGGTGCCTTC CAGCAAGGGC AGCAGCAGCA GC AGC AGCAG CGGCACG AGC 1 320 

25 AGCTCCTCCA GCGACTCAG A G AGCAGCTCC GG ATCTG ACT CGG AGACCGA GAGCAGCTCC 1 380 

AGCGAGAGTO AGGGCAGCAA GCCCCCCCAC TTCTCCAGCC CCGAGGCTGA ACCGGCATCC 1440 
TCTAACAAGT GGCAGCTGGA TAAATGGCTA AACAAAGTTA ATCCCCACAA GCCTCCTATT 1500 
CTGATCCAAA. ATGAAAGCCA CGGGTCAGAG AGCAATCAGT ACTACAACCC GGTGAAAGAG 1560 
G ACGTCCAGG ACTGTGGG AA AGTCCCCG AC GTTTGCCAGC CCAGCCTG AG AG AG A AGG AG 1 620 

30 ATCAAGAGCA CTTGCAAGGA GGAGCAAAGG CCAAGGACAG CCAACAAGGC CCCTGGGAGT 1680 
AAAGGCGTGA AGCAG AAGTC CCCGCCCGCG GCCGTGGCCG TGGCGGTGAG CGCAGCCGCC 1740 
CCGCCACCCG CAGTGCCCTG TGCGCCCGCG GAGAACGCGC CCGCGCCTGC CCGGAGGTCC 1800 
GCGGGCAAGA AGCCCACCAG GCGCACCGAG AGGACCTCAG CCGGGGACGG CGCCAACTGC 1860 
CACCGGCCCG AGGAGCCCGC GGCCGCGGAC GCGCTGGGGA CGAGCGTGGT GGTCCCCCCG 1920 

35 GAGCCCACCA AAACCAGGCC CTGTGGCAAC AACAGAGCGA GCCACCGCAA GGAGCTGCGC 1980 

TCCTCCGTG A CCTGCGAGAA GCGCCGCACG CGGGGGCTAA GCAGGATCGT CCCCAAATCC 2040 
AAGGAGTICA TTGAGACAGA GTCGTCATCT TCATCCTCCT CCTCGGACTC CGACCTGGAG 2100 
TCCGAGCAGG AGGAGTACCC TCTGTCCAAA GCACAGACCG TGGCTGCCTC TGCCtCCTCC 2160 
GGGAATGATC AGAGGCTGAA GGAGGCCGCT GCCAACGGGG GCAGTGGTCC TAGGGCCCCT 2220 

40 GTAGGCTCCA TCAACGCCAG GACCACCAGT GACATCGCCA AGGAGCTGGA GGAGCAGTTC 2280 

TACACACTGG TCCCCTTTGG CCGGAACGAA CTTCTCTCCC CTCTAAAGGA CAGTGATGAG 2340 
ATCAGGTCTC TCTGGGTCAA AATCGACCTG ACCCTCCTGT CCAGGATCCC AGAACACCTG 2400 
CCCCAGGAGC CAGGGGTATT GAGCGCCCCT GCCACCAAGG ACTCTGAG AG CGCACCGCCC 2460 
AGCCACACCT CGGACACACC TGCAGAAAAG GCTTTGCCAA AATCCAAGAG GAAACGCAAG 2520 

45 TGTGACAACG AAGACGACTA CAGGGAGATC AAGAAGTCCC AGGG AGAOAA AGACAGCTCT 2580 
TCAAGACTGG CCACCTCCAC CAGTAATACT TTGTCTGCAA ACCACTGCAA CATGAACATC 2640 
AACAGTGTGG CAATACCAAT AA ATA AAA AT GAAAAAATGC TTCGGTCGCC CATCTCACCC 2700 
CTCTCTGATG CATCTAAACA CAAATACACC AGCGAGGACTTAACTTCTTC CAGCCGACCT 2760 
AATGGCAACA GTTTGTTTAC TTCAGCCTCT TCCAGCAAAA AGCCTAAGGC CGACAGCCAG 2820 

50 . CTGCAGCCTC ACGGCGGAGA CCTCACGAAA GCAGCTCACA ACAATTCTGA AAACATTCCC 2880 
CTCCACAAGT CACGGCCGCA GACGAAGCCG TGGTCTCCAG GCTCCAACGG CCACAGGGAC 2940 
TGCAAGAGGC AGAAACTTGT CTTCGATGAT ATGCCTCGCA GTGCCGATTA TTTTATGCAA 3000 
-. GAAGCTAAAC GAATGAAGCA TAAAGCAGAT GCAATGGTGG AAAAGTTTGG AAAGGCTTTG 3060 
AACTATGCTG AAGCAGCATT GTCGTTTATC GAGTGTGGAA ATGCAATGGA ACAAGGCCCC 3 120 

5 5 ATGG AATCCA AATCTCCTTA TTACCTG ATG TATTC AG AAA CAGTAG AGCT CATCAGGTAT 3 1 80 
GCTATGAGAC TAAAAACCCA CTCAGGCCCC AATGCCACAC CAG AAGACAA ACAACTGGCT 3240 
GCATTATGTT ACCGATGCCT GGCCCTCCTG TACTGGCGGA TGTTTCGACT CAAAAGGGAC 3300 
CACGCTGTAA AGTATTCAAA AGCACTAATC GACTATTTCA AGAACTCATC TAAAGCCGCC 3360 
CAAGCCCCAT CTCCGTGGGG GGCCAGTGGA AAGAGCACTG GAACCCCATC CCCCATTTCT 3420 

60 CCCAACCCCT TTCCCGGCAG CTCCGTGGGG TCTCAGGGCA GCCTCTCCAA CGCCAGCGCC 3480 
CTGTCCCCGT CGACCATCGT CAGCATCCCA CAGCGCATCC ACCAGATGGC GGCCAACCAC 3540 
GTCAGCATCA CCAACAGCAT CCTGCACAGC TACGACTACT GGGAGATGGC CGACAACCTG 3600 
GCCAAGGAAA ACCGAGAATT CTTCAACGAC CTGGATCTGC TCATGGGGCC GGTCACCCTG 3660 
CAC AGCAGCA TGGAGCACCT GGTCCAGTAC TCCCAACAGG GCCTGCACTG GCTGCGG AAC 3720 

65 AGCGCCCACC TGTCATAGGG ACCTCACCCT GGGGCCAGAG TGGGCTCTGG TCTCCACAGA 3780 
TGGCTCAACG TTTTTGGACA CTGTGCTACT GAAACTCCCA GCCACAGCAT TTATAGACTG 3840 
CGGTGAACAT TTCCTCA 

SeqIDNO: 46 Protein sequence: 
70 Protein Accession #: NP_002276 

1 11 21 31 41 51 
I I I I 1 I 

MDSFDLALLQ EWDLESLCVY EPDRN ALRRK ERERRNQETQ QDDGTFNSS Y SLFSEPYKTN 60 
75 KGDELSNRIQ NTLGNYDEMK DFLTDRTNQS HLVGVPKPGV PQTPVNKIDE HFVADSRAQN 120 
QPSSICSTTT STPAAVPVQQ SKRGTMG WQK AGHPPSDGQQ RATQQGSLRT LLGDG VGRQQ 1 80 
PRAKQVCNVE VGLQTQERPP AMAAKHSSSG HCVQNFPPSL ASKPSLVQQK PTAYVRPMDG 240 
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QDQAPDESPK LKSSSETSVH CTSYRGVPAS KPEPARAKAK LSKFSIPKQG EESRSGETNS 300 
CVEEIIREMT WLPPLSAIQA PGKVEPTKFP FPNKDSQLVS SGHNNPKKGD AEPESPDNGT 360 
SNTSMLEDDL KLSSDEEENE QQAAQRTALR ALSDSAWQQ PNCRTSVPSS KGSSSSSSSG 420 
TSSSSSDSES SSGSDSETES SSSESEGSKP PHFSSPEAEP ASSNKWQLDK WLNKVNPHKP 480 
5 PILIQNESHG SESNQYYNPV KEDVQDCGKV PDVCQPSLRE KEDCSTCKEE QRPRTANKAP 540 

GSKGVKQKSPPAAVAVAVSAAAPPPAVPCAPAENAPAPARRSAGKKPTRRTERTSAGDGA «K) 
NCHRPEEPAA ADALGTSVWPPEPTICrRPCGNNRASHRKELRSSVTCEKRRTRGLSRJVP 660 
ICSKEFIETES SSSSSSSDSD LESEQEEYPL SKAQTVAASA SSGNDQRLKE AAANGGSGPR 720 
APVGSINART TSDIAKELEE QFYTLVPFGR NELLSPLKDS DEIRSLWVKI DLTLLSRIPE 780 

1 0 HLPQEPG VLS APATKDSESA PFSHTSDTPA EKALPKSKRK RKCDNEDDYR EIKKSQGEICD 840 
SSSRLATSTS NTLSANHCNM NINS VAIPIN KNEKMLRSPI SPLSDASKHK YTSEDLTSSS 900 
RPNGNSLFTS ASSSKKPKAD SQLQPHGGDL TKAAHNNSEN IPLHKSRPQT KPWSPGSNGH 960 
RDCKRQKLVF DDMPRSADYF MQEAKRMKHK ADAMVEKFGK ALNYAEAALS FIECGNAMEQ 1020 
GPMESKSPYY LMYSETVELI RYAMRLKTHS GPNATPEDKQ LAALCYRCLA LLYWRMFRLK 1080 

1 5 RDHAVKYSKA LIDYFKNSSIC AAQAPSPWGA SGKSTGTPSP ISPNPFPGSS VGSQGSLSNA 1 140 
SALSPSTIVSIPQRmQMAA>nWSITNSILHSYDYWEMADhnJVKENREFFNDLDLLMGPV 1200 
TLHSSMEHLV QYSQQGLHWL RNSAHLS 

SeqIDNO: 47 DNA sequence 
20 Nucleic Acid Accession #: NM J)33 1 5 1 

Coding sequence: 35 1 -4499(underiined sequences correspond to start and stop codons) 

I 11 21 31 41 51 

25 I I I ) I I 

ACTGGG ATAA AGCAAG AAGA CTGATTTTAT GAGCAGGGGT TTGATACATC AAAGGAGATT 60 
GCCCAGGATC AAGGGTGCGG TGTTGGGGGT GGGTTGGGGA GGGTGGTTAG AGAAGGTTTC 120 
ACTAAGTGAT TTGGGCCTGA GGCCTGAGAA GATGTTTAAA AAGAGGGATC AAGCACAGGC 180 
TAAGG AG AGG AAAG AGCAGG CACCCAAACC TCTGCATGGC CCCAATATGC TCCCTGCAGG 240 

30 GTAGTGCCCC CTCTTCTGGC TGCTCAAGGC GAGATCTAAG CTTCTTCTAA CTCCTGCTGT 300 
CmrCATAT TCTCTGATTC TGGGAAACGA AGAATTGGCA GGAACTGAAA ATGA CTAGGA 360 
AGAGGACATA CTGGGTGCCC AACTCTTCTG GTGGCCTCGT GAATCGTGGC ATCGACATAG 420 
GCGATGACAT GGTTTCAGG A CTTATTTATA AAACCTATAC TCTCCAAGAT GGCCCCTGGA 480 
GTCAGCAAGA GAGAAATCCT GAGGCTCCAG GGAGGGCAGC TGTCCCACCG TGGGGGAAGT 540 . 

3 5 ATG ATGCTGC CTTG AG AACC ATG ATTCCCT TCCGTCCCAA GCCGAGGTTT CCTGCCCCCC 600 
AGCCCCTGGA CAATGCTGGC CTGTTCTCCT ACCTCACCGT GTCATGGCTC ACCCCGCTCA 660 
TGATCCAAAG CTTACGGAGT CGCTTAGATG AGAACACCAT CCCTCCACTG TCAGTCCATG 720 
ATGCCTCAG A CAAAAATGTC CAAAGGCTTC ACCGCCTTTG GG AAGAAG A A GTCTCAAGGC 780 
GAGGGATTGA AAAAGCTTCA GTGCTTCTGG TGATGCTGAG GTTCCAGAGA ACAAGGTTGA 840 

40 TTTTCGATGC ACTTCTGGGC ATCTGCTTCT GCATTGCCAG TGTACTCGGG CCAATATTGA 900 

TTATACCAAA GATCCTGGAA TATTCAGAAG AGCAGTTGGG GAATGTTGTC CATGGAGTGG 960 
GACTCTGCTT TGCCCmTT'CTCTCCGAAT GTGTGAAGTC TCTGAGTTTC TCCTCCAGTT 1020 
GGATCATCAA CCAACGCACA GCCATCAGGT TCCGAGCAGC TGTTTCCTCC TTTGCCTTTG 1080 
AGAAGCTCAT CX;AATTTAAG TCTGTAATAC ACATCACCTC AGGAGAGGCC ATCAGCTTCT 1 140 

45 TCACCGGTGA TGTAAACTAC CTGTTTGAAG GGGTGTGCTA TGGACCCCTA GTACTGATCA 1200 
CCTGCGCATC GCTGGTCATC TGCAGCATTT CTTCCTACTT CATTATTGGA TACACTGCAT 1260 
TTATTGCCAT CTTATGCTAT CTCCTGGTTT TCCCACTGGC GGTATTCATG ACAAGAATGG 1320 
CTGTGAAGGC TCAGCATCAC ACATCTGAGG TCAGCGACCA GCGCATCCGT GTGACCAGTG 1380 

_ AAGTTCrCACTTGCATTAAGCTGAtTAAAATGTACACATGGGAGAAACCATTTGCAAAAA 1440 

50 TCATTGAAGA CCTAAGAAGG AAGGAAAGGA AACTATTGGA GAAGTGCGGG CTTGTCX:aGA 1500 
GCCTGACAAG TATAACCTTG TTCATCATCC CCACAGTGGC CACAGCGGTC TGGGTTCTCA 1560 
TCCACACATC CTTAAAGCTG AAACTCACAG CGTCAATGGC CTTCAGCATG CTGGCCTCCT 1620 
TGAATCTCCT TCGGCTGTCA GTGTTCrrTTG TGCCTATTGC AGTCAAAGGT CTCACG AATT 1680 
CCAAGTCTGC AGTGATGAGG TTCAAGAAGT TTTTCCTCCA GGAGAGCCCT GTTTTCTATG 1740 

5 5 TCCAG ACATT ACAAG ACCCC AGCAAAGCTC TGGTCnTGA GG AGGCCACC TTGTCATGGC 1 800 
AACAG ACCTG TCCCGGG ATC GTCAATGGGG CACTGGAGCT GGAG AGGAAC GGGCATGCTT 1 860 
CTGAGGGGAT GACCAGGCCT AGAGATGCCC TOGGGCTAGA GGAAGAAGGG AACAGCCTGG 1920 
GCCCAGAGTT GCACAAGATC AACCTGGTGG TGTCCAAGGG GATGATGTTA GGGGTCTGCG 1 980 
GCAACACGGG GAGTGGTAAG AGCAGCCTGT TGTCAGCCAT CCTGGAGGAG ATGCACTTGC 2040 

60 TCGAGGGCTC GGTGGGGGTG CAGGGAAGCC TGGCCTATGT CCCCCAGCAG GCCTGGATCG 2100 
TCAGCGGGAA CATCAGGGAG AACATCCTCA TGGGAGGCGC ATATGACAAG GCCCGATACC 2160 
TCCAGGTGCT CCACTGCTGC TCCCTGAATC GGGACCTGGA ACTTCTGCCC TTTGGAG ACA 2220 
TGACAGAGAT TGGAGAGCGG GGCCTCAACC TCTCTGGGGG GCAGAAACAG AGGATCAGCC 2280 
TGGCCCGCGC CGTCTATTCC G ACCX5TCAG A TCTACCTGCT GGACG ACCCC CTGTCTGCTG 2340 

65 TGGACGCCCA CGTGGGGAAG CACATTTTTG AGGAGTGCAT T AAGA AGACA CTCAGGGGGA 2400 
AGACGGTCGT CCTGGTGACC CACCAGCTGC AGTACTTAGA ATTTTGTGGC CAGATCATTT 2460 
TGTTGGAAAA TGGGAAAATC TGTGAAAATG GAACTCACAG TGAGTTAATG CAGAAAAAGG 2520 
GGAAATATGC CCAACTTATC CAGAAGATGC ACAAGGAAGC CACTTCGG AC ATGTTGCAGG 2580 
ACACAGCAAA GATAGCAGAG AAGCCAAAGG TAGAAAGTCA GGCTCTGGCC ACCTCCCTGG 2640 

70 AAGAGTCTCT CAACGGA AAT GCTGTGCCGG AGCATCAGCT CACACAGGAG GAGGAGATGG 2700 
AAGAAGGCTC CTTGAGTTGG AGGGTCTACC ACCACTACAT CCAGGCAGCT GGAGGTTACA 2760 
TGGTCTCTTG CATAATTTTC TTCTTCGTGG TGCTGATCGT CTTCTTAACG ATCTTCAGCT 2820 
TCTGGTGGCT GAGCTACTGG TTGG AGCAGG GCTCGGGG AC CAATAGCAGC CGAGAGAGCA 2880 
ATGG AACCAT GGCAG ACCTG GGC AACATTG CAG ACAATCC TCAACTGTCC TTCTACCAGC 2940 

75 TGGTGTACGG GCTCAACGCC CTGCTCCTCA TCTGTGTGGG GGTCTGCTCC TCAGGQATTT 3000 
TCACCAAAGT CACGAGGAAG GCATCCACGG CCCTGCACAA CAA GCTCT TC AACAAGGTTT 3060 
TCCGCTGCCC CATGAGTTTC TTTGACACCA TCCCAATAGG CCGGCmTG AACTGCTTCG 3120 
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CAGGGG ACTT GG AACAGCTG GACCAGCTCT TGCCCATCTT TTCAGAGCAG TTGCTGGTCC 31 80 
TGTCCTTAAT GGTGATCGCC GTCCTGTTG A TTGTCAGTGT GCTGTCTCCA TATATCCTGT 3240 
TAATGGGAGC CATAATCATG GTTATTTGCT TCATTTATTA TATGATGTTC AAGAAGGCCA 3300 
TCGGTGTGTT CAAG AGACTG GAGAACTATA GCCGGTCTCC TTTATTCTCC CACATCCTCA 3360 
5 ATTCTCTGCA AGGCCTGAGC TCCATCCATG TCTATGGAAA AACTGAAGAC TTCATCAGCC 3420 
AGTTTAAG AG GCTG ACTG AT GCGCAGAATA ACTACCTGCT GTTGTTTCTA TCTTCCACAC 3480 
GATGGATGGC ATTGAGGCTG GAGATCATGA CCAACCTTGT GACCTTGGCT GTTGCXXrTGT 3540 
TCGTGGCTTT TGGCATTTCC TCCACCCCCT ACTCCTTTAA AGTCATGGCT GTCAACATCG 3600 
TGCTGCAGCf GGCGTCCAGC TTCCAGGCCA CTGCCCGGAT TGGCTTGGAG ACAGAGGCAC 3660 

1 0 AGTTCACGGC TGTAG AGAGG ATACTGCAGT ACATGAAG AT GTGTGTCTCG GAAGCTCCTT 3720 
TACACATGGA AGGCACAAGT TGTCCCCAGG GGTGGCCACA GCATGGGGAA ATCATATTTC 3780 
AGGATTATCA CATGAAATAC AGAGACAACA CACCCACCGT GCTTCACGGC ATCAACCTGA 3840 
CCATCCGCGG CCACGAAGTG GTGGGCATCG TGGGAAGGAC GGGCTCTGGG AAGTCCTCCT 3900 
TGGGCATGGC TCTCTTCCGC CTGGTGGAGC CCATGGCAGG CCGG ATTCTC ATTG ACGGCG 3960 

1 5 TGG ACATTTG CAGCATCGGC CTGG AGGACT TGCGGTCCAA GCTCTCAGTG ATCCCTCA AG 4020 
ATCCAGTGCT GCTCTCAGGA ACCATCAGAT TCAACCTAGA TCCCTTTGAC CGTCACACTG 4080 
ACCAGCAGAT CTGGGATGCC TTGGAGAGGA CATTCCTGAC CAAGGCCATC TCAAAGTTCC 4140 
CCAAAAAGCT GCATACAGAT GTGGTGGAAA ACGGTGGAAA CTTCTCTGTG GGGGAGAGGC 4200 
AGCTGCTCTG CATTGCCAGG GCTGTGCTTC GCAACTCCAA G ATCATCCTT ATCGATGAAG 4260 

20 CCACAGCCTC CATTGACATG GAGACAGACA CCCTG ATCCA GCGCACAATC CGTGAAGCCT 4320 
TCCAGGGCTG CACCGTGCTC GTCATTGCCC ACCGTGTCAC CACTGTGCTG AACTGTGACC 4380 
ACATCCTGGT TATGGGCAAT GGGAAGGTGG TAGAATTTGA TCGGCCGGAG GTACTGCGGA 4440 
AGAAGCCTGG GTCATTGTTC GCAGCCCTCA TGGCCACAGC CACTTCTTCA CTGAGATAAG 4500 

^ GAGATGTGGA GACTTCATGG AGGCTGGCAG CTGAGCTCAG AGGTTCACAC AGGTGCAGCT 4560 

25 TCGAGGCCCA CAGTCTGCGA CCTTCTTGTr TGGAGATGAG AACTTCTCCT GGAAGCAGGG 4620 
GTAAATGTAG GGGGGGTGGG GATTGCTGGA TGGAAACCCT GGAATAGGCT ACTTGATGGC 4680 
TCTCAAG ACC TTAG AACCCC AGAACCATCT AAGACATGGG ATTCAGTG AT CATGTGGTTC 4740 
TCCTTTTAAC TTACATGCTG AATAATTTTA TAATAAGGTA AAAGCTTATA GTTTTCTG AT 4800 
CTGTGTTAGA AGTGTTGCAA ATGCTGTACT G ACTTTGTAA AATATAAAAC TAAGGAAAAC 4860 

30 TC 

Seq ID NO: 48 Protein sequence: 
Protein Accession #: NP_1 49 1 63 .2 

35 MTRKRTYWVPNSSGGLV>mGIDIGDDMVSGLIYKTYTLQDGPWSQQERNPEAPGRAAVPPWGKYX)AALRT 
MIPFRPKPRPPAPQPLDNAGLFSYLTVSWLTPUvIIQSLRSRLDENTIPPLSVHDASDKhP/QRLHRLWEEE 
VSRRGIEKASVLLVMLRFQRTRLIFDALLGICFCIASVLGPILUPKJLEYSEEQLGNWHGVGLCFALF 
LSECVKSLSFSSSWnNQRTAIRFRAAVSSFAFEKLIQFKSVIHITSGEAISFFTGDVNYLFEGVCYGPL 
\a-ITCASLVICSISSYFIIGYTAFIAILCYLLVFPLAVFMTRMAVKAQHHTSEVSDQRIRVTSEVLTCIK 

40 LIKMYTWEKPFAKnEDLRRKERKLLEKCGLVQSLTSITLFIIPTVATAVWVUHTSLKLKLTASMAFSM 
LASLNLLRLSVFFVPIAVKGLTNSKSAVMRFKKFFLQESPVFYVQTLQDPSKALVFEEATLSWQQTCPGI 
VNGALELERNGHASEGMTRPRDALGPEEEGNSLGPELHKJ^a.VVSKGMMLGVCGNTGSGKSSLLSAILEE 
MHIXEGSVGVQGSLAYWQQAWIVSGNIRENILMGGAYDKARYLQVLHCCSLNRDLELLPFGDMTEIGER 
GLNl^GGQKQWSLARAWSDRQr^LLDDPLSAVDAHVGKHIFEECIKKTLRGKTVVLVTHQLQYLEFCG 

45 QIILLENGKICENGTHSELMQKKGKYAQUQKMHKEATSDMLQDTAKIAEKPKVESQALATSLEESLNGN 
AWEHQLTQEEEMEEGSLSWR\miHYIQAAGGYMVSCIIFFFVVLIVFLTIFSFWWLSYWLEQGSGTNSS 
RESNGTMADLGNIAD^fPQLSFYQLWGLNALLUCVGVCSSGIFTKVTRKJ^STALH^^CU^^^KWRCPM 
FDTIPIGRLLNCFAGDLEQLDQLLPIFSEQFLVLSLMVIAVLLIVSVLSPYILLMGAnMVICFIYYMMF 
KXA^GVFKRLE^^V'SRSPLFSHILNSU^GLSSIHVYGKTEDFISQFKRLTDAQN^m-LLFLSSTR^^ 

5 0 EIMTNLVTU^ VALFVAFGISSTPYSFKVM A V^^ VLQLASSFQATARIGLETEAQFTA VERILQYMKMCVS 
EAPLHMEGTSCPQGWPQHGEIIFQDYHMKYRDhmTVLHGINLTIRGHEWGrVGRTGSGICSSUJMAL^^ 
LVEPMAGRILIIXjVDICSIGLEDLRSia^VIPQDPVLL^GTIRFNUiPFDRHTDQQIWDAl^RTFLTXAI 
SKFPKlO-HTDVVENGGNFSVGERQLLCIARAVLRNSKIILmEATASroMETDTLIQRTIREAFQGCTVL 
VIAHRVTTW^CDfflLVMGNGKWEEDRPEVUlKKPGSLFAAU^ATA 

Scq ID NO: 49 DNA sequence 

Nucleic Acid Accession #: NM_033419 

Coding sequence: 1 S-980 (underlined sequences conespond to start and stop codons) 

* 60 1 11 21 31 41 51 

I I i I ' > ^ 
CGAGCCAGGG AGAAAG GATG GCCGGCCTTGG CGGCGCGGTT GGTCCTGCTA GCTGGGGCAG 60 

CGGCGCTGGC GAGCGGCTCC CAGGGCOACC GTGAGCCGGT GTACCGCGAC TGCGTACTGC 120 

^ AGTGCG AAG A GCAGAACTGC TCTGGGGGCG CTCTGAATCA CTTCCGCTCC CGCCAGCCAA 1 80 

65 TCTACATGAG TCTAGCAGGC TGGACCTGTC GGG ACG ACTG TAAGTATGAG TGTATGTGGG 240 

TCACCGTTGG GCTCTACCTC CAGGAAGGTC ACAAAGTGCC TCAGTTCCAT GGCAAGTGGC 300 

• CCTTCTCCCG GTTCCTGTTC TTTCAAGAGC CGGCATCGGC CGTGGCCTCG TTTCTCAATG 360 

GCCTGGCCAG CCTGGTGATG CTCTGCCGCT ACCGCACCTT CGTGCCAGCC TCCTCCCCCA 420 

TGTACCACAC CTGTGTGGCC TTCGCCTGGG TGTCCCTCAA TGCATGGTTC TGGTCCACAG 480 

70 TTTrCCACACCAGGGACACTGACCTCACAG AGAAAATGGACTACTTCTGTGCCTCCACTG 540 

TCATCCtACA CTCAATCTAC CTGTGCTGCG TCAGGACCGT GGGGCTGCAG CACCCAGCTG 600 

TGGTCAGTGC CTTCCGGGCT CTCCTGCTGC TCATGCTGAC CGTGCACGTC TCCTACCTGA 660 

GCCTCATCCG CTTCG ACTAT GGCTACAACC TGGTGGCCAA CGTGGCTATT GGCCTGGTCA 720 

ACGTGGTGTG GTGGCTGGCC TGGTGCCTGT GG AACCAGCG GCGGCTGCCT CACGTGCGCA 780 

75 AGTGCGTGGT GGTGGTCTTG CTGCTGCAGG GGCTGTCCCT GCTCGAGCTG CTTGACTTCC 840 

CACCGCTCTT CTGGG TCCTG GATGCCCATG CCATCTGGCA CATCAGCACC ATCCCTGTCC 900 

ACGTCCTCTT TTTCAGCTTT CTGG AAGATG ACAGCCTGTA CCTGCTG AAG G AATCAGAGG 960 
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ACAAGTTCAA GCTGG ACTGA AGACCTTGGA GCGAGTCTGC CCCAGTGGGG ATCCTGCCCC 1020 
CGCCCTGCTG GCCTCCCTTC TCCCCTCAAC CCTTGAGATQ ATTTTCTCTr TtCAACTTCT 1080 
TG AACTTGGA CATG AAGGAT GTGGGCCCAG AATCATGTGG CCAGCCCACC CCCTGTTGGC 1 140 
CCTCACCAGC CTTGGAGTCT GTTCTAGGGA AGGCCTCCX:a GCATCTGGGA CTCGAGAGTG 1200 
GGCAGCCCCT CTACCTCCTG GAGCTGAACT GGGGTGGAAC TGAGTGTGCT CTTAGCTCTA 1260 
CCGGGAGGAC AGCTGCCTGT TTCCTCCCCA TCAGCCTCCT CCCCACATCC CCAGCTGCCT 1320 
GGCTGGGTCC TGAAGCCCTC TGTCTACCTG GGAGACCAGG GACCACAGGC CTTAGGGATA 1380 
CAGGGGGTCC CCTTCTGTTA CCACCTCCCA CCCTCCTCCA GGACACCACT AGGTGGTGCT 1440 
GGATGCTTGT TCTTTGGCCA GCCAAGGTTC ACGGCGATTC TCCCCATGGG ATCTTGAGGG 1500 
ACCAAGCTGC TGGGATTGGG AAGGAGTTTC ACCCTGACCA TTGCCCTAGC CAGGTTCCCA 1560 
GG AGGCCTCA CCATACTCCC TTTCAGGGCC AGGGCTCCAG CAAGCCCAGG GCAAGGATCC 1620 
TGTGCTGCTG TCTGGTTGAG AGCCTGCCAC CGTGTGTCGG GAGTGTGGGC CAGGCTGAGT 1680 
. GCATAGGTGACAGGGCCGTGAGCATGGGCCTGGGTGTGTGTGAGCTCAGGCCTAGGTGCG 1740 
CAGTGTGGAG ACGGGTGTTG TCGGGGAAGA GGTGTGGCTT CAAAGTGTGT GTGTGCAGGG 1 800 
1 0 GGTGGGTGTG TTAGCGTGGG TTAGGGG AAC GTGTGTGCGC GTGCTGGTGG GCATGTG AG A 1 860 
TGAGTGACTG CCGGTGAATG TGTCCACAGT TGAGAGGTTG GAGCAGGATG AGGGAATCXrr 1920 
GTCACCATCA ATAATCACTT GTGGAGCGCC AGCTCTGCCC AAGGCGCCAC CTGGGCGGAC 1980 
AGCCAGGAGC TCTCCATGGC CAGGCTGCCT GTGTGCATGT TCCCTGTCTG GTGCCCCnT 2040 
GCCCGCCTCC TGCAAACCTC ACAGGGTCCC CACACAACAG TGCCCTCCAG AAGCAGCCCC 2100 
ZU TCGGAGGCAG AGGAAGGAAA ATGGGGATGG CTGGGGCTCT CTCCATCCTC CTTTTCTCCT 2160 
TGCCTTCGCA TG GCTG GCCT TCCCCTCCAA AACCTCCATT CCXTCTGCTGC CAGCCCCTTT 2220 
GCCATAGCCT GATTTTGGGG AGGAGGAAGG GGCGATTTGA GGGAGAAGGG G AGAAAGCTT 2280 
ATGGCTGGGT CTGGTTTCtT CCCTTCCCAG AGGGTCTTAC TGTTCCAGGG TGGCCCCAGG 2340 
GCAGGCAGGG GCCACACTAT GCCTGCGCCC TGGTA AAGGT GACCCCTGCC ATTTACCAGC 2400 
ZD AGCCCTGGCA TGTTCCTGCC CCACAGGAAT AGAATGGAGG GAGCTCCAGA AACTTTCCAT 2460 
CCCAAAGGCA GTCTCCGTGG TTGAAGCAGA CTGGATTTTT GCTCTGCCCC TG ACCCCTTG 25^0 
TCCCTCTTTG AGGG AGGGGA GCTATGCTAG GACTCCAACC TCAGGGACTC GGGTGGCCTG 2580 
CGCTAGCTTC TTTTG ATACT GA AAACTTTT AAGGTGGG AG GGTGGCAAGG GATGTGCTTA 2640 
ATAAATCAAT TCCAAGCCTC AAAAAAAAAA AAAAAAAAAA AA 
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Seq ID NO: 50 Protein sequence: 
Protein Accession #: NP 219487.] 



1 11 21 31 41 . 51 

i i I I I i 
MKDVGPESCG QPTPCVVPSPA LESVLGKASQ HLGLESGQPL YLLELNWGGT ECALSSTGRT 60 
AACFLPISLL PTSPAAWLGP EALCLPGRPG TTGLRDTGGP LLLPPPTLLQ DTTRWCWMLV 120 
LWPAKVHGDS PHGILRDQAA GIGKEFHPDH CPSQVPRRPH HTPFQGQGSS KPRARILCCC 1 80 
40 LVESLPPCVG SVGQAECIGD RAVSMGLGVC ELRPRCAVWR RVLSGKRCGF KVCVCRGWVC 

Seq ID NO: 51 DNA sequence 
Nucleic Acid Accession #: XM_059098. 1 

45 Coding sequence: 1 78-61 8(underlined sequences correspond to start and stop codons) 

1 II 21 31 41 51 

i i i I ) I 

G ATGTACACT CTGAAGTGAG CACATTCCTG TTGGCAGGAC ATGACACCTT GGCAGCAAGC 60 
ATCTCCTGGA TCCTTTACTG CCTGGCTCTG AACCCTGAGC ATCAAGAGAG ATGCCGGGAG 120 

DU GAGGTCAGGG GCATCCTGGG GGATGGGTCT TCTATCACTT GGGACCAGCT GGGTG AGATG 1 80 
TCGTACACCA CAATGTGCAT CAAGGAGACG TGCCGATTGA TTCCTGCAGT CCCGTCCATT 240 
TCCAGAGATC TCAGCAAGCC ACTTACXriTC CCAGATGGAT GCACATTGCC TGCAGGGATC 300 
ACCGTGGTTC TTAGTATTTG GGGTCTTCAC CACAACCCTG CTGTCTGGAA AAACCCAAAG 360 
GTCTTTGACC CCTTG AGGTT CTCTCAGGAG AATTCTGATC AGAGACACCC CTATGCCTAC 420 

DD TTACCATTCT CAGCTGGATC AAGGAACTGC ATTGGGCAGG AGTTTGCCAT GATTGAGTTA 480 
AAG GTAAC CA TTGCCTTGAT TCTGCTCCAC TTCAGAGTGA CTCCAGACCC CACCAGGCCT 540 
CTTACTTTCC CCAACCATTT TATCCTCAAG CCCAAG AATG GG ATGTATTT GCACCTGAAG 600 
AAACrCTCTG AATGTTAGATCTCAGGGTACAATGATTAAACGTACTTTGTTTTTCGAAGT 660 
TAA ATTTACA GCTA ATG ATC CAAGCAGATA GAAAGGGATC AATGTATGGT GGGAGGATTG 720 

OU G AGGTTGGTG GGATAGGGGT CTCTGTGAAG AGATCCAAAA TCATTTCTAG GTACACAGTG 780 
TGTCAGCTAG ATCTGTTTCT ATATAACTTT GGG AGATTTT CAGATCTTTT CTGTTAAACT 840 
TTCACTACTA TTAATGCTGT ATACACCAAT AGACTTTCAT ATATTTTCTG TTGTTnTAA 900 
AATAGTTTTC AGAATTATGC AAGTAATAAG TGCATGTATG CTCACTGTCA AAAATTCCCA 960 
ACACTAGAAAATCATGTAGAATAAAAATTTTAAATCrCACTTCACTTAGCCGACATTCCA 1020 
TGCCCT^ACCAATCCTACTGCTTTTCCTAAAAACAGAA 1080 
GACTTTTTCC TATACATTTT ATATGTAGAA ATGTAGCAAT GTATTTGTAT AGATGTGATC 1 140 
ATTCCTATAT TGTTATTG AT 11111 i CACT TAATAAAAAT TCACCTTATT CCTT 

Seq ID NO: 52 Protein sequencer 
70 Protein Accession #: XP_059098.1 

I n 21 31 41 51 

1 I I 1 I I 

MSYTTMCIKE TCRLIPAVPS ISRDLSKPLT FPDGCTLPAG ITWLSIWGL HHNPAVWKNP 60 
/ D KVFDPLRFSQ ENSDQRHP YA YLPFSAGSRN CIGQEFAMIE UC VTIALILL HFRVTPDPTR 1 20 
PLTFPNHFIL KPKNGMYLHL KKLSEC 
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Seq ID NO: 53 DNA sequence 

Nucleic Acid Accession #: NM J)309 1 6 

Coding sequence: 1-1533 (underiined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

ATGC CCCTGT CCCTGGGAGC CG AG ATGTGG GGGCCTGAGG CCTGGCTGCT GCTGCTGCTA 60 
CTGCTGGCAT CATTTACAGG CCGGTGCCCC GCGGGTGAGC TGGAGACCTC AGACGTGGTA 120 
ACTGTGGTGC TGGGCCAGGA CGCAAAACTG CCCTGCTTCT ACCGAGGGGA CTCCGGCGAG 1 80 
CAAGTGGGGC AAGTGGCATG GGCTCGGGTG GACGCGGGCG AAGGCGCCCA GGAACTAGCG 240 
CTACTGCACT CCAAATACGG GCTTCATGTG AGCCCGGCTT ACGAGGGCCG CGTGGAGCAG 300 
CCGCCGCCCC CACGCAACCC CCTGGACGGC TCAGTGCTCC TGCGCAACGC AGTGCAGGCG 360 
GATGAGGGCG AGTACGAGTG CCGGGTCAGC ACCTTCCCCG CCGGCAGCTT CCAGGCGCXjG 420 
CTGCGGCTCC GAGTGCTGGT GCCTCCCCTG CCCTCACTGA ATCCTGGTCC AGCACTAGAA 480 
GAGGGCCAGG GCCTGACCCT GGCAGCCTCC TGCACAGCTG AGGGCAGCCC AGCCCCCAGC 540 
GTGACCTGGG ACACGGAGGT CAAAGGCACA ACGTCCAGCC GTTCCTTCAA GCACTCCCGC 600 
TCTGCTGCCG TCACCTCAGA GTTCCACTTG GTGCCTAGCC GCAGCATGAA TGGGCAGCCA 660 
CTG ACTTGTG TGGTGTCCCA TCCTGGCCTG CTCCAGGACC AAAGG ATCAC CCACATCCTC 720 
CACGTGTCCT TCCTTGCTGA GGCCTCTGTG AGGGGCCTTG AAG ACCAAAA TCTGTGGCAC 780 
ATTGGCAGAG AAGGAGCTAT GCTCAAGTGC CTGAGTGAAG GGCAGCCCCC TCCCT CATAC 840 
AACTGGACAC GGCTGGATGG GCCTCTGCCC AGTGGGGTAC GAGTGGATGG GGACACTTTG 900 
GGCTTTCCCC CACTGACCAC TGAGCACAGC GGC ATCTACG TCTGCCATGT CAGCAATG AG 960 
TTCTCCTCAA GGGATTCTCA GGTCACTGTG GATGTTCTTG ACCCCCAGGA AGACTCTGGG 1020 
AAGCAGGTGG ACCTAGTGTC AGCCTCGGTG GTGGTGGTGG GTGTGATCGC CGCACTCTTG 1080 
TTCTGCCTTC TGGTGGTGGT GGTGGTGCTC ATGTCCCGAT ACCATCGGCG CAAGGCCCAG 1 140 
CAGATGACCC AGAAATATGA GGAGGAGCTG ACCCTGACCA GGGAGAACTC CATCCGGAGG 1200 
CTGCATTCCC ATCACACGGA CCCCAGGAGC CAGCCGGAGG AGAGTGTAGG GCTGAGAGCC 1260 
GAGGGCCACC CTGATAGTCT CAAGGACAAC AGTAGCTGCT CTGTGATGAG TGAAGAGCCC 1320 
GAGGGCCGCA GTTACTCCAC GCtGACCACG GTGAGGGAGA TAGAAACACA GACTGAACTG 1380 
CTGTCTCCAG GCTCTGGGCG GGCCGAGGAG GAGGAAGATC AGGATGAAGG CATCAAACAG 1440 
GCCATGAACC ATTTTGTTCA GGAGAATGGG ACCCTACGGG CCAAGCCCAC GGGCAATGGC 1500 
ATCTACATCA ATGGGCGGGG ACACCTGGTC TGA 

SjcqlDNO: 54 Protein sequence: 
Protein Accession #: NP_1 1 2 1 78. 1 

1 11 21 31 41 51 

MPLSLGAEMW GPEAWLLLLL LLASFTGRCP AGELETSDW TVVLGQDAKL PCFYRGDSGE 60 
QVGQVAWARV DAGEGAQELA LLHSKYGLHV SPAYEGRVEQ PPPPRNPLDG SVLLRNAVQA 120 
DEGEYECRVS TFPAGSFQAR LRLRVLVPPL PSLNPGPALE EGQGLTLAAS CTAEGSPAPS 1 80 
VTWDTEVKGT TSSRSFKHSR SAAVTSEFHL VPSRSMNGQP LTCVVSHPGL LQDQRITHIL 240 
HVSFLAEASV RGLEDQNLWH IGREGAMLKC LSEGQPPPSY NWTRLDGPLP SGVRVDGDTL 300 
GFPPLTTEHS GIYVCHVSNE FSSRDSQVTV DVLDPQEDSG KQVDLVSASV VWGVIAALL 360 
FCLLVWWL MSRYHRRKAQ QMTQKYEEEL TLTRENSIRR LHSHHTDPRS QPEESVGLRA 420 
EGHPDSLKDN SSCSVMSEEP EGRSYSTLTT VREIETQTEL LSPGSGRAEE EEDQDEGDCQ 480 
AMNHFVQENG TLRAKPTGNG lYINGRGHLV 

Seq ID NO: 55 DNA sequence 

Nucleic Acid Accession #: AF007 1 70. 1 

Coding sequence: 73-1725 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

AAGGAGGCGG CCTCCGGGAA AAGCGACCGC AGGACTCCTG AGAGCAGCCT CCATGAGGCC 60 
CTGGACCAGT GC^GACCGC CCTGGACCTC TTCCTCACCA ACCAGTTCTC AGAAGCACTC 120 
AGCTACCTCA AGCCCAGAAC CAAGG A AAGC ATGTACCACT CACTG ACATA TGCCACCATC 1 80 
CTGGAGATGC AGGCCATGAT GACCTTTGAC CCTCAGGACA TCCTGCTTGC CGGCAACATG 240 
ATGAAGG AGG CACAGATGCT GTGTCAGAGG CACCGG AGGA AGTCTTCTGT AACAGATTCC 300 
TTCAGCAGCC TGGTGAACCG CCCCACGCTG GGCCAATTCA CTG AAG AAG A AATCCACGCT 360 
GAGGTCTGCT ATGCAGAGTG CCTGCTGCAG CGAGCAGCCC TGACCTTCCT GCAGGACGAG 420 
AACATGGTGA GCTTCATCAA AGGCGGCATC AAAGTTCGAA ACAGCTACCA GACCTACAAG 480 
GAGCTGGACA GCCTTGTTCA GTCCTCACAA TACTGCAAGG GTGAGAACCA CCCGCACTTT 540 
GAAGG AGGAG TGAAGCTTGG TGTAGGGGCC TTCAACCTGA CACTGTCCAT GCTTCC TACT 600 
AGGATCCTGA GGCTGTTGGA GTTTGTGGGG TTTTCAGG AA ACAAGG ACTA TGGC. ' GCTG 660 
CAGCTGGAGG AGGGAGCGTCAGGGCACAGGTTCCGCTCTGTGCTCTGTGTCATC CTG 720 
CTGTGCTACC ACACCTTCCT CACCTTCGTG CTCGGTACTG GGAACGTCAA CATCC; ;;AG 780 
GCCGAGAAGC TCTTGAAGCC CTACCTGAAC CGGTACCCTA AGGGTGCCAT CTTCC MTC 840 
TTTGCAGGGA GGATTGAAGT CATTAAAGGC AACATTGATG CAGCCATCCG GCGTriCG AG 900 
G AGTGCTGTG AGGCCCAGCA GCACTGGAAG CAGTTTCACC ACATGTGCTA CTGGGAGCTG 960 
ATGTGGTGCT TCACCTACAA GGGCCAGTGG AAGATGTCCT ACTTCTACGC CGACCTGCTC 1020 
AGCAAGGAGA ACTGCTGGTC CAAGGCCACC TACATTTACA TGAAGGCCGC CTACCTCAGC 1080 
ATGTTTGGGA AGGAGGACCA CAAGCCGTTC GGGGACGACG AAGTGGAATT ATTTCGAGCT 1140 
GTGCCAGGCC TGAAGCTCAA GATTGCTGGG AAATCTCTAC CCACAGAGAA GTTTGCCATC 1200 
CGGAAGTCCC GGCGCTACTT CTCCTCCAAC CCTATCTCGC TGCCAGTGCC TGCTCTGGAA 1260 
ATGATGTACA TCTGGAACGG CTACGCCGTG ATTGGG AAGC AGCCGAAACT CACGGATGGG 1320 
ATAtTTGAGA TTATCACTAA GGCTGAAGAG ATGCTGGAGA AAGGCCCAGA GAACGAGTAC 1380 
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TCAGTGGATG ACGAGTGCTT GGTGAAATTG TTGAAAGGCC TGTGTCTGAA ATACCTGGGC 1440 
CGTGTCCAGG AGGCCGAGGA GAATTTTAGG AGCATCTCTG CCAATGAAAA GAAGATTAAA 1500 
TATGACCACT ACTTGATCCC AAACGCCCTG CTGGAGCTGG CCCTGCTGCT TATGGAGCAA 1 560 
_ G ACAG AAACG AAG AQGCCAT CAAACTTTTG G AATCTGCCA AGCAAAACTA CAAG A ATT AC 1 620 
5 TCCATGGAGTCAAGGACACACTTTCGAATCCAGGCAGCCACACrCCAAGCCAAGTCTTCC 1680 
CTAG AGAACA GCAGCAGATC CATGGTCTCA TCAGTGTCCT TGTAGCTTTG TGCAGCAGTT 1 740 
CCGGGCTGGA AGACAG AGAC AGCTGGACAG AGCTCCTGAA AACATTTCAA AATACCCCCT 1 800 
CCCCCTGCCC TGCCCTGCCT TTGGGGTCCA CCGGCACTCC AGTTGGATGG CACAACATAG 1860 
_ TGTATCX:GTG CAGAAGCCGA GCTGGCATTT TCACCAGTGT AGCCAAGGGC CTTTGCCAAG 1920 

1 0 GGCAG AGCAG GTGGAGCCCT CTGCCTGCCC TATCACACAT ACGGGTACTT GCTTTTCACT 1980 
GTGATGTTTA AGAGAATGTA TGAACAGTTT ACAITTTCCT TAGAAATACA TTG ATGGGAT 2040 
CACAGTTGGC TTTAAAAACC AACAACAATC AACCACCTGT AAGTCTTTGT CTTCACCTAT 2100 
TATCATCTGG AGGTAAATCT CTTTATATGA TGATGCCAAA GGGCAAATTG CTTTTCAAAT 2160 
TCAGCAAGTT CTCAGCTTGT GTGACGGAAG GTCCTTCAGA GGACCTGAGG AATGCCTGGG 2220 

1 5 AGAGGCTAAG CCTCAGGCTT CAATGCTTCT GGGGTTGGGC ATGAGGATGT ACACAG ACAC 2280 
CCACTACCTT ACTACTCACA CTTCATTTCA CTCCTTTTGT AAATTTCCAA TTTAAAAATC 2340 
A AGCACGTCT TTTTAGTGAG ATAAAATCTG AGCTCTTCTG tag AAAAATC AATCTCTACC 2400 
AGTAGAAAAT GCCAGGGCTT GATGGAAGAG CTGTGTAGCC CTTTCTATGC CAAAGCCAGG 2460 
AAATTTGGGG GGCAGGAGGA GGTTCTCAGA ATCCAGTCTG TATCTTTGCT GTATGCCAAA 2520 

20 CTG AAACCAC TGGGAATAAT TTATGAAACA TAAAAATCTT CTGTACTTCA CTCCAAGGTA 2580 
CATTTGCTTA CTGACAGCAT n 1 1'G ri AAA ACTGTTATTC TTGAAAAAAA AAAAAAAAAA 2640 
AA 

Seq ID NO: 56 Protein setniehce: 
25 Protein Accession #: AAC39582.1 

1 11 21 31 41 51 
I- I I I I I 

MTALDLFLTN QFSEALSYLK PRTKESMYHS LTYATILEMQ AMMTFDPQDI LLAGNMMKEA 60 
3 0 QMLCQRHRRK SSVTDSFSSL VNRPTLGQFT EEEIHAEVC Y AECLLQRA AL TFLQDENM VS 1 20 
FIKGGIKVRN SYQTYKELDS LVQSSQYCKG ENHPHFEGGV KLGVGAFNLT LSMLPTRILR 1 80 
LLEFVGFSGN KDYGLLQLEE GASGHSFRSV LCVMLLLCYH TFLTFVLGTG NVNIEEAEKL 240 
LKPYLNRYPK GAIFLFFAGR IEVDCGNIDA AIRRFEECCE AQQHWKQFHH MCYWELMWCF 300 
TYKGQWKMSY FYADLLSKEN CWSKATYIYM KAAYLSMFGK EDHKPFGDDE VELFRAVPGL 360 
35 KLKIAGKSLPTEKFAIRKSRRYFSSNPISLPVPALEMMYI WNGYAVIGKQPKLTDGILEI 420 

ITKAEEMLEK GPENEYSVDD ECLVKLLKGL CLKYLGRVQE AEENFRSISA NEKKIKYDHY 480 
LIPNALLELA LLLMEQDRNE EAIKLLESAK QNYKNYSMES RTHFRIQAAT LQAKSSLENS 540 
SRSMVSSVSL 

40 Scq ID NO: 57 DNA sequence 

Nucleic Acid Accession #: NM_006670. 1 

Coding sequence: 1-927 (underlined sequences correspond to start and stop codons) 
^_ 1 11 21 31 41 51 

45 ( I ! I 1 1 

ATGCCTGGGG GGTGCTCCCG GGGCCCCGCC GCCGGGGACG GGCGTCTGCG GCTGGCGCGA 60 
CTAGCGCTGG TACTCCTGGG CTGGGTCTCC TCGTCTTCTC.CCACCTCCTC GGCATCCTCC 120 
TTCTCCTCCT CGGCGCCGTT CCTGGCTTCC GCCGTGTCCG CCCAGCCCCC GCTGCCGG AC 1 80 
CAGTGCCCCG CGCTGTGCGA GTGCTCCGAG GCAGCGCGCA CAGTCAAGTG CGTTAACCGC 240 

5 0 AATCTG ACCG AGGTGCCCAC GG ACCTGCCC GCCTACGTGC GCAACCTCTT CCTTACCGGC 300 
AACCAGCTGG CCAGCAACCA CTTCCTTTAC CTGCCGCGGG ATGTGCTGGC CCAACTGCCC 360 
AGCCTCAGGC ACCTGGACTT AAGTAATAAT TCGCTGGTGA GCCTGACCTA CGTGTCCTTC 420 
CGCAACCTGA CACATCTAGA AAGCCTCCAC CTGGAGGACA ATGCCCTCAA GGTCCTTCAC 480 

__ AATGGCACCCTGGCTGAGTTGCAAGGTCTACCCCACATTAGGGTTTTCCTGGACAACAAT 540 

55 CCCTGGGTCTGCGACTGCCACATGGCAGACATGGTGACCTGQCTCAAGGAAACAGAGGTA 600 
GTGCAGGGCA AAGACCGGCT CACCTGTGCA TATCCGGAAA AAATGAGGAA TCGGGTCCTC 660 
TTGGAACTCA ACAGTGCTGA CCTGGACTGT GACCCGATTC TTCCCCCATC CCTGCAAACC 720 
TCTTATGTCT TCCTGGGTAT TGTTTTAGCC CTGATAGGCG CTATTTTCCT CCTGGTTTTG 780 
TATTTGAACC GCA AGGGG at AAAAAAGTGG ATGCATAACA TCAG AG ATGC CTGCAGGGAT 840 

60 CACATGGAAG GGTATCATTA CAGATATGAA ATCAATGCGG ACCCCAGATT AACAAACCTC 900 
AGTTCTAACT CGOATGTCCT CGAGTGA 

Scq ID NO: 58 Protein sequence 
Protein Accession #: NP_006661.1 



1 11 21 31 41 51 

I I ! I i I 

MPGGCSRGPA AGDGRLRLAR LALVLLGWVS SSSPTSSASS FSSSAPFLAS AVSAQPPLPD 60 
70 QCPALCECSE AARTVKCVNR NLTEVPTDLP AYVRNLFLTG NQLASNHFLY LPRDVLAQLP 120 
SLRHLDLSNN SLVSLTYVSF RNLTHLESLH LEONALKVLH NGTLAELQGL PHIRVFLDNN 1 80 
PWVCDCHMAD MVTWLKETEV VQGKDRLTCA YPEKMRNRVL LELNSADLDC DPILPPSLQT 240 
SYWLGIVLAUGAIFLLVLYLNRKGIKKWMHNIIU>ACRDHMEGYHYRYEINADPRLTT«. 300 
SSNSDVLE 

75 

Seq ID NO: 59 DNA sequence 

Nucleic Acid Accession <^: NM_024022 
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Coding sequence: l-1362(underlined sequences conespond to stait and stop codons) 

1 11 21 31 41 51 

5 ATGGGGG AAA ATGATCCGCC TGCTGTTGAA GCCCCCTTCT CATTCCGATC GCTTTTTGGC 60 
CTTGATGATT TGAAAATAAG TCCTGTTGCA CCAGATGCAG ATGCTGTTGC TGCACAGATC 120 
CTGTCACTGC TGCCATTGA A GTTTTTTCCA ATCATCGTCA TTGGGATCAT TGCATTGATA 1 80 
TTAGCACTGG CCATTGGTCT GGGCATCCAC TTCG ACTGCT CAGGGAAGTA CAGATGTCGC 240 
TCATCCTTTA AGTGTATCGA GCTGATAGCT CGATGTGACG GAGTCTCGGA TTGCAAAGAC 300 

1 0 GGGGAGGACG AGTACCGCTG TGTCCGGGTG GGTGGTCAG A ATGCCGTGCT CCAGGTGTTC 360 
ACAGCTGCTT CGTGGAAGAC CATGTGCTCC G ATGACTGGA AGGGTCACTA CGCAAATGTT 420 
GCCTGTGCCC AACTGGGTTT CCCAAGCTAT GTG AGTTCAG ATAACCTCAG AGTGAGCTCG 480 
CTGGAGGGGC AGTTCCGGGA GGAGTTTGTG TCCATCGATC ACCTCTTGCC AGATGACAAG 540 
GTGACTGCAT TACACCACTC AGTATATGTG AGGGAGGGAT GTGCCTCTGG CCACXjTGGTT 600 

1 5 ACCTTGCAGT GCACAGCCTG TGGTCATAG A AGGGGCTACA GCTCACGCAT CGTGGGTGGA 660 
AACATGTCCT TGCTCTCGCA GTGGCCCTGG CAGGCCAGCC TTCAGTTCCA GGGCTACCAC 720 
CTGTGCGGGG GCTCTGTCAT CACGCCCCTG TGGATCATCA CTGCTGCACA CTGTGTTTAT 780 
G ACTTGTACC TCCCCAAGTC ATGGACCATC CAGGTGGGTC TAGTTTCCCT GTTGGACAAT 840 
CCAGCCCCAT CCCACTTGGT GGAGAAGATT GTCTACCACA GCAAGTACAA GCCAAAGAGG 900 

20 CTGGGCAATG ACATCGCXICT TATGAAGCTG GCCGGGCCAC TCACGTTCAA TGAAATGATC 960 
CAGCCTGTGT GCCTGCCCAA CTCTGAAGAG AACTTCCCCG ATGGAAAAGT GTGCTGGACG 1020 
TCAGGATGGG GGGCCACAGA GGATGGAGGT GACGCCTCCC CTGTCCTGAA CCACGCGGCC 1080 
GTCCCTTTGA TTTCCAACAA GATCTGCAAC CACAGGGACG TGTACGGTGG CATCATCTCC 1 140 
CCCTCCATGC TCTGCGCGGG CTACCTGACG GGTGGCGTGG ACAGCTGCCA GGGGGACAGC 1200 

25 GGGGGGCCCC TGGTGTGTCA AGAGAGGAGG CTGTGGAAGT TAGTGGGAGC GACCAGCTTT 1260 
GGC ATCGGCT GCGCAGAGGT G AACAAGCCT GGGGTGTACA CXTCGTGTCAC CTCCTTCCTG 1 320 
GACTGGATCC ACGAGCAGAT GGAGAGAGAC CTAAAAAC CT GA 

Seq ID NO: 60 Protein sequence 
30 Protein Accession #: NPJD76927 

I 11 21 31 41 51 

I I I 1 1 I 

MGENDPPAVE APFSFRSLFG LDDLKISPVA PDADAVAAQI LSLLPLKFFP HVIGnALI 60 
3 5 LALAIGLGIH FDCSGKYRCR SSFKCIELIA RCDG VSDCFCD GEDEYRC VRV GGQNAVLQVF 1 20 

TAASWKTMCS DDWKGHYA^rV ACAQLGFPSY VSSDNLRVSS LEGQFREEF V SIDHLLPDDK 1 80 
VTALHHSVYV REGCASGHVV TLQCTACGHR RGYSSRIVGG NMSLLSQWPW QASLQFQGYH 240 
LCGGSVITPL WIITAAHCVY DLYLPKSWTI QVGLVSLLDN PAPSHLVEKI VYHSKYKPKR 300 
LGNDIALMKL AGPLTFNEMI QPVCLPNSEE NFPDGKVCWT SG WG ATEDGG DASPVLNHAA 360 
40 VPLISNKICN HRDVYGCnS PSMLCAGYLT GGVDSCQGDS GGPLVOQERR LWKLVGATSF 420 
GIGCAEVNKP GVYTRVTSFL DWIHEQMERD LKT 

. Seq ID NO: 61 DNA sequence 
Nucleic Acid Accession #: NM_006475 
45 Coding sequence: 28-2538 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I ! I I i 

AACAGAACTG CAACGGAGAG ACTCAAGAIG ATTCCCTTTT TACCCATGTT TTCTCTACTA 60 
TTGCTGCTTA TTGTTAACCC T ATAAACGCC AACAATCATT ATGACAAG AT CTT GGCT CAT 120 
AGTCGTATCA GGGGTCGGG A CCAAGGCCCA AATGTCTGTG CCCTTCAACA GATTTTGGGC 1 80 
ACCAAAAAGA AATACTTCAG CACTTGTAAG AACTGGTATA AAAAGTCCAT CTGTGGACAG 240 
AAAACGACTG TTTTATATGA ATGTTGCCCT GGTTATATGA GAATGGAAGG AATGAAAGGC 300 
TGCCCAGCAG TTTTGCCCAT TGACCATGTT TATGGCACTC TGGGCATCGT GGG AGCCACC 360 
ACAACGCAGC GCTATTCTGA CGCCTCAAAA CTGAGGGAGG AGATCGAGGG AAAGGGATCC 420 
TTCACTTACT TTGCACCGAG TAATGAGGCT TGGGACAACT TGGATTCTGA TATCCGTAGA 480 
GGTTTGGAGA GCAACGTGAA TGTTGAATTA CTGAATGCTT TACATAGTCA CATGATTAAT 540 
AAGAGAATGT TGACCAAGG A CTTAAAAAAT GGCATGATTA TTCCTTCAAT GTATAACAAT 600 
TTGGGGCTTT TCATTAACCA TTATCCTAAT GGGGTTGTCA CTGTTAATTG TGCTCGAATC 660 
ATCCATGGGA ACCAGATTGC AACAAATGGT GTTGTCCATG TCATTGACCG TGTGCTTACA 720 
CAAATTGGTA CCTCAATTCA AGACTTCATT GAAGCAGAAG ATGACCTTTC ATCTTTTAGA 780 
GCAGCTGCCA TCACATCGGA CATATTGGAG GCCCTTGGAA GAGACGGTCA CTTCACACTC 840 
TTTGCTCCCA CCAATGAGGC TTTTGAGAAA CTTCCACGAQ GTGTCCTAGA AAGGTTCATG 900 
GG AGACAAAG TGGCTTCCGA AGCTCTTATG AAGTACCACA TCTTAAATAC TCTCCAGTGT 960 
TCTGAGTCTA TTATGGGAGG AGCAGTCTTT GAGACGCTGG AAGGAAATAC AATTGAGATA 1020 
GGATGTGACG GTGACAGTAT AACAGTAAAT GGAATCAAAA TGGTGAACAA AAAGGATATT 1080 
GTGACAAATA ATGGTGTGAT CCATTTGATT GATCAGGTCC TAATTCCTGA TTCTGCCAAA 1 140 
CAAGTTATTG AGCTGGCTGG AAAACAGCAA ACCACCTTCA CGGATCTTGT GGCCCAATTA 1200 
GGCTTGGCAT CTGCTCTGAG GCCAGATGGA GAATACACTT TGCTGGCACC TGTGAATAAT 1260 
GCATTTTCTG ATG ATACTCT CAGCATGGTT CAGCGCCTCC TTAAATTAAT TCTGCAG AAT 1 320 
CACATATTGA AAGTAAAAGT TGGCCTTAAT GAGCTTTACA ACGGGCAAAT ACTGGAAACC 1380 
ATCGGAGGCA AACAGCTCAG AGTCTTCGTA TATCGTACAG CTGTCTGCAT TGAAAATTCA 1440 
TGCATGGAGA AAGGGAGTAA GCAAGGGAGA AACGGTGCGA TTCACATATT CCGCGAGATC 1500 
ATCAAGCCAG CAGAGAAATC CCTCCATGAA AAGTTAAAAC AAGATAAGCG CTTTAGCACC 1560 
TTCCTCAGCC TACTTGAAGC TGCAGACTTG AAAGAGCTCC TGACACAACC TGGAGACTGG 1620 
ACATTATTTG TGCCAACCAA TGATGCl 1 1 1 AAGGGAATGA CTAGTGAAGA AAAAGAAATT 1680 
CTGATACGGG ACAAAAATGC TCTTCAAAAC ATCATTCTTT ATCACCTGAC ACCAGGAGTT 1740 
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TTCATTGGAA AAGGATTTGA ACCTGGTGTT ACTAACATTT TAAAGACCAC ACAAGGAAGC 1800 
AAAATCTTTC TG A AAG AAGT AAATGATACA CTTCTGGTG A ATG AATTG AA ATCAA AAGAA 1 860 
TCTGACATCA TGACAACAAA TGGTGTAATT CATGTTGTAG ATAAACTCCT CTATCCAGCA 1920 
GACACACCTG 1TGGAAATGA TCAACTGCTG GAAATACTTA ATAAATTAAT CAAATACATC 1980 
D CAAATTAAGT TTGTTCGTGG TAGCACCTTC A AAG AAATCC CCGTGACTGT CTATACAACT 2040 
AAAATTATAA CCAAAGTTGT GGAACCAAAA ATTAAAGTGA TTGAAGGCAG TCTTCAGCCT 2100 
ATTATCAAAA CTGAAGGACC CACACTAACA AAAGTCAAAA TTGAAGGTGA ACCTGAATTC 2160 
AGACTGATTA AAGAAGGTGA AACAATAACT GAAGTGATCC ATGGAGAGCC AATTATTAAA 2220 
1 n AAATACACCA AAATCATTGA TGGAGTGCCT GTGG AAATAA CTGAAAAAGA GACACGAGAA 2280 
1 U GAACGAATCA TTACAGGTCC TGAAATAAAA TACACTAGGA TTTCTACTGG AGGTGGAGAA 2340 
ACAGAAGAAA CTCTGAAGAA ATTGTTACAA GAAGAGGTCA CCAAGGTCAC CAAATTCATT 2400 
. GAAGGTGGTGATGGTCATTTATTTGAAGATGAAGAAATTAAAAGACTGCTTCAGGGAGAC 2460 
ACACCCGTGA GGAAGTTGCA AGCCAACAAA AAAGTTCAAG GTTCTAGAAG ACGATTAAGG 2520 
GAAGGTCGTT CTCAG TGA AA ATCCAAAAAC CAG AAAAAAA TGTTTATACA ACCCTAAGTC 2580 
1 0 AATAACCTGA CCTTAGAAAA TTGTGAGAGC CAAGTTGACT TCAGGAACTG AAACATCAGC 2640 

ACAAAGAAGC AATCATCAAA TAATTCTGA A CACAAATTTA ATATTTmT TTCTGAATGA 2700 
GAAACATGAG G GAAAT TGTG GAGTTAGCCT CCTGTGGTAA AGG AATTG AAG AAA ATATAA 2760 
CACCTTACAC CCTTTTTCAT CTTG ACATTA A AAGTTCTGG CTAACTTTGG AATCCATTAG 2820 
AG AAAAATCC TTGTCACCAG ATTCATTACA ATTCAAATCG AAGAGTTGTG AACTGTTATC 2880 
ZU CCATTGAAAA GACCGAGCCT TGTATGTATG TTATGGATAC ATAAAATGCA CGCAAGCCAT 2940 
JATCTCTCCATGGGAAGCTA AGTTATAAAA ATAGGTGCTT GGTGTACAAA ACTTTTTATA 3000 
TCAAAAGGCT TTGCACATTT CTATATGAGT GGGTTTACTG GTAAATTATG TTATTTTITA 3060 
CAACTAATTT TGTACTCTCA GAATGTTTGT CATATGCTTC TTGCAATGCA TATTTTTTAA 3120 
^ TCTC AAACGT TTCA ATA AAA CCATTTTTCA G ATATAA AG A G AATTACTTC AAATTG AGTA 3 1 80 

Z D ATTC AG AAA A ACTC AAG ATT T A AGTTAAAA AGTGGTTTGG ACTTGGGAA 

Scq ID NO: 62 Protein sequence: 
Protein Accession U: NP_006466 

30 1 II 21 31 41 51 
! I I ) I i 

MIPFLPMFSL LLLLI VNPIN ANNHYDKILA HSRIRGRDQG PNVCALQQIL GTKKKYFSTC 60 
KNWYKKSICG QKTTVLYECC PG YMRMEGMK GCPA VLPIDH VYGTLGIVGA TTTQRYSD AS 1 20 
KJ-REEIEGKG SFTYFAPSNE AWDNLDSDIR RGLESNXW^E LLNALHSHMI hn<ilMLTKDLK 180 

J O NGMIIPSMVN NLGLFINHYP NG V VTVNCAR IIHGNQIATN G WHVIDRVL TQIGTSIQDF 240 
lEAEDDLSSF RAAAITSDIL EALGRDGHFT LFAPTNEAFE KLPRGVLERF MGDKVASEAL 300 
MKYHILNTLQ CSESIMGGAV FETLEGNTIE IGCDGDSITV NGIKMVNKKD IVTNNGVIHL 360 
IDQVLIPDSA KQVIELAGKQ QTTFTDLVAQ LGLASALRPD GEYTLLAPVN NAFSDDTLSM 420 
VQRLLICLILQ NHILKVKVGL NELYNGQILE TIGGKQLRVF VYRTAVCIEN SCIVIEKGSKQG 480 

4U RNGAIfflFRE IIKPAEKSLH EKLKQDKRFS TFLSLLEAAD UCELLTQPGD WTLFVPTNDA 540 
FKGMTSEEKE ILIRDKNALQ NIILYHLTPG VFIGKGFEPG VTNILKTTQG SKIFLKEVND 600 
TLLVNELKSK ESDIMTTNG V IHWDKLLYP ADTPVGNDQL LEILNKLDCY IQIKFVRGST 660 
FKEIPVTVYT TKIITKVVEP KflCVIEGSLQ PIIKTEGPTL TKVfCIEGEPE FRLIKEGETI 720 
TEVIHGEPIIKKYTKIlDGVPVEITEKETREERnTGPEIKYTRISTGGGETEETLKKLL 780 

4D QEEVTKVTKF IEGGDGHLFE DEEKRLLQG DTPVRKLQAN KKVQGSRRRL REGRSQ 
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Seq ID NO: 63 DNA sequence 

Nucleic Acid Accession NM_02b974 

Coding sequence: 81-3080 (underiined sequences comespond to start and stop codons) 



1 11 21 31 41 51 

I i I I i I 

GGCGTCCGCG CACACCTCCC CGCGCCGCCG CCGCCACCGC CCGCACTCCG CCGCCTCTGC 60 
CCGCAACCGC TGAGCCATCC ATGG GGGTCG CGGGCCGCAA CCGTCCCGGG GCGGCCTG GG 1 20 

D O CGGTGCTGCT GCTGCTGCTG CTGCtGCCGC CACTGCTGCT GCTGGCGGGG GCCGTCCCGC 1 80 

CGGGTCGGGG CCGTGCCGCG GGGCCGCAGG AGGATGTAGA TGAGTGTGCC CAAGGGCTAG 240 
ATGACTGCCA TGCCGACGCC CTGTGTCAGA ACACACCCAC CTCCTACAAG TGCTCCTGCA 300 
AGCCTGGCTA CCAAGGGG AA GGCAGGCAGT GTGAGGACAT CGATGAATGT GG AAATG AGC 360 

^ TCAATGG AGG CTGTGTCCAT G ACTGTTTGA ATATTCCAGG CAATTATCGT TGCACTTGTT 420 

OU TTGATGGCTT CATGTTGGCT CATGACGGTC ATAATTGTCT TGATGTGQAC GAGTGCCTGG 480 
AGAACAATGG C GGCTG CCAG CATACCTGTG TCAACGTCAT GGGGAGCTAT GAGTGCTGCT 540 
GCAAGGAGGG GTTTTTCCTG AGTGACAATC AGCACACCTG CATTCACCGC TCGGAAGAGG 600 
GCCTGAGCTG CATGAATAAG GATCACGGCT GTAGTCACAT CTGCAAGGAG GCCCCAAGGG 660 
GCAGCGTCGC CTGTGAGTGC AGGCCTGGTT TTGAGCTGGC CAAGAACCAG AGAGACTGCA 720 

OD TCTTGACCTG TAACCATGGG AACGGTGGGT GCCAGCACTC CTGTGACGAT ACAGCCGATG 780 
GCCCAGAGTG CAGCTGCCAT CCACAGTACA AGATGCACAC AGATGGGAGG AGCTGCCTTG 840 
AGCGAG AGGA CACTGTCCTG GAGGTGACAG AGAGCAACAC CACATCAGTG GTGGATGGGG 900 
ATAAACGGGT GAA ACGGCGG CTGCTCATGG AAACGTGTGC TGTCAACAAT GGAGGCTGTG 960 
ACCGCACCTG TAAGGATACT TCGACAGGTG TCCACTGCAG TTGTCCTGTT GGATTCACTC 1020 

/ U TCCAGTTGG A TGGG AAG ACA TGTAAAG ATA TTG ATGAGTG CCAGACCCGC AATGGAGGTT 1 080 
GTGATCATTT CTGCAAAAAC ATCGTGGGCA GTTTTG ACTG CGGCTGCAAG AAAGGATTTA 1 140 
AATTATTAAC AGATGAGAAG TCTTGCCAAG ATGTGGATGA GTGCTCTTTG GATAGGACCT 1200 
GTGACCACAG CTGCATCAAC CACCCTGGCA CATTTGCTTG TGCTTGCAAC CGAGGGTACA 1260 
CCCTGTATGG CTTCACCCAC TGTGGAGACA CCAATGAGTG CAGCATCAAC AACGGAGGCT 1320 

/D GTCAGCAGGT CTGTGTGAAC ACAGTGGGCA GCTATGAATG CCAGTGCCAC CCTGGGTACA 1380 
AGCTCCACTG GAATAAAAAA GACTGTGTGG AAGTGAAGGG GCTCCTGCCC ACAAGTGTGT 1440 
CACCCCGTGT GTCCCTGCAC TGCGGTAAGA GTGGTGGAGG AGACGGGTGC TTCCTCAGAT 1500 
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GTCACTCTGG CATTCACCTC TCTTCAGATG TCACCACCAT CAGG ACAAGT GTAACCTTTA 1 560 
AGCTAAATGA AGGCAAGTGT AGTTTGAAAA ATGCTGAGCT GTTTCCCGAG GGTCTGCGAC 1620 
CAGCACTACC AGAGAAGCAC AGCTCAGTAA AAGAGAGCTT CCGCTACGTA AACCTTACAT 1680 
GCAGCTCTGG CAAGCAAGTC CCAGGAGCCC CTGGCCGACC AAGCACCCCT AAGGAAATGT 1740 
5 TTATCACTGTTGAGTTTGAGCTTGAAACTAACCAAAAGGAGGTGACAGCTTCTTGTGACC 1800 
TO AGCTGCAT CGTAAAGCGA ACCGAGAAGC GGCTCCGTAA AGCCATCCGC AOGCTCAG AA 1 860 
AGGCCGTCCA CAGGGAGCAG TTTCACCTCC AGCTCTCAGG CATGAACCTC GACGTGGCTA 1920 
AAAAGCCTCC CAGAACATCT GAACGCCAGG CAGAGTCCTG TGGAGTGGGC CAGGGTCATG 1980 
CAGAAAACCA ATGTGTCAGT TGCAGGGCTG GGACCTATTA TGATGGAGCA CGAGAACGCT 2040 

1 0 GCATTTTATG TCCAAATGGA ACCTTCCAAA ATGAGGAAGG ACAAATGACT TGTGAACCAT 2100 
GCCCAAGACC AGGAAATTCT GGGGCCCTGA AGACCCCAGA AGCTTGGAAT ATGTCTGAAT 2160 
GTGGAGGTCT GTGTCAACCT GGTGAATATT CTGCAGATGG CTTTGCACCT TGCCAGCTCT 2220 
GTGCCCTGGG CACGTTCCAG CCTGAAGCTG GTCGAACTTC CTGCTTCCCC TGTGGAGGAG 2280 
GCCTTGCCAC CAAACATCAG GGAGCTACTT CCTTTCAGGA CTGTG AAACC AG AGTTCAAT 2340 

1 5 GTTCACCTGG ACATTTCTAC AACACCACCA CTCACCGATG TATTCGTTGC CCAGTGGGAA 2400 
CATACCAGCC TGAATTTGG A AAAAATAATT GTGTTTCTTG CCCAGGAAAT ACTACG ACTG 2460 
ACTTTGATGG CTCCACAAAC ATAACCCAGT GTAAAAACAG AAGATGTGGA GGGGAGCTGG 2520 
GAG ATTTCAC TGGGTACATT G AATCCCCAA ACTACCCAGG CAATTACCCA GCCAACACCG 2580 
AGTGTACGTG G ACCATCAAC CCACCCCCCA AGCGCXGCAT CCTGATCGTG GTCCCTGAGA 2640 

20 TCTTCCTGCC CATAG AGGAC G ACTGTGGGG ACTATCTGGT G ATGCGG AAA ACCTCTTCAT 2700 
CCAATTCTGTGACAACATAT GAAACCTGCC AGACCTACGA ACGCCCCATC GCCTTCACCT 2760 
CCAGGTCAAA GAAGCTGTGG ATTCAGTTCA AGTCCAATGA AGGG AACAGC GCTAGAGGGT 2820 
TCCAGGTCCC ATACGTGACA TATGATGAGG ACTACCAGGA ACTCATTGAA GACATAGTTC 2880 
GAGATGGCAG GCTCTATGCA TCTGAGAACC ATCAGGAAAT ACTTAAGGAT AAG AAACTTA 2940 

25 TCAAGGCTCTGTTTGATGTCCTGGCCCATCCCCAGAACTATTTCAAGTAC ACAGCCCAGG 3000 
AGTCCCGAGA GATGTTTCCA AGATCGTTCA TCCGATTGCT ACGTTCCAAA GTGTCCAGGT 3060 
TTTTGAGACC TTACAAA TGA CTCAGCCCAC GTGCCACTCA ATACAAATGT TCTGCTATAG 3120 
GGTTGGTGGG ACAGAGCTGT CTTCCTTCTG CATGTCAGCA CAGTCGGGTA TTGCTGCCTC 3 180 
CCGTATCAGT GACTCATTAG AGTTCAATTT TTATAGATAA TACAG ATATT TTGGTAAATT 3240 

30 G AACTTGGTT TTTCTTTCCC AGCATCGTGG ATGTAG ACTG AG AATGGCTT TG AGTGGCAT 3300 
CAGCTTCTCA CTGCTGTGGG CGGATGTCTT GGATAGATCA CGGGCTGGCT GAGCTGGACT 3360 
TTGGTCAGCC TAGGTGAGAC TCACCTGTCC TTCTGGGGTC TTACTCCTCC TCAAGGAGTC 3420 
TGTAGTGGAA AGGAGGCCAC AGAATAAGCT GCTTATTCTG AAACTTCAGC TTCCTCTAGC 3480 
, CCGGCCCTCT CTAAGGGAGC CCTCTGCACT CGTGTGCAGG CTCTGACCAG GCAGAACAGG 3540 

35 CAAGAGGGGA GGGAAGGAGA CCCCTGCAGG CTCCCTCCAC CCACCTTGAG ACCTGGGAGG 3600 
ACTCAGTTTC TCCACAGCCT T€TCCAGCCT GTGTG ATACA AGTTTGATCC CAGGAACTTG 3660 
AGTTCTAAGC AGTGCTCGTG AAAAAAAAAA GCAGAAAGAA TTAGAAATAA ATAAAAACTA 3720 
AGCACTTCTG GAGACAT 

40 Seq'lD NO: 64 Protein seouence: 

Protein Accession #: NP_066025.1 

1 II 21 31 41 51 

45 MGVAGRNRPG AAWAVLLLLL LLPPLLLLAG AVPPGRGRAA GPQEDVDECA QGLDDCHADA 60 
LCQNTPTSYK CSCKPGYQGE GRQCEDIDEC GNELNGGCVH DCLNIPGNYR CTCFDGFMLA 120 
HDGHNCLDVD ECLENNGGCQ HTCVNVMGSY ECCCKEGFFL SDNQHTCIHR SEEGLSCMNK 180 
DHGCSmCKE APRGSVACEC RPGFELAKNQ RDCILTCKHG NGGCQHSCDD TADGPECSCH 240 
PQYKMHTDGR SCLEREDTVL EVTESNTTSV VDGDICRVKRR LLMETCAVNN GGCDRTCKDT 300 

5 0 STG VHCSCPV GFTLQLDGKT CKJDIDECQTR NGGCDHFCKN I VGSFDCGCK KGFKJLLTDEK 360 
SCQDVDECSL DRTCDHSCIN HPGTFACACN RGYTLYGFTH CGDTNECSIN NGGCQQVCVN 420 
TVGSYECQCH PGYKLHWNKK DCVEVKGLLP TSVSPRVSLH CGKSGGGDGC FLRCHSGIHL 480 
SSDVTTIRTS VTFKLNEGKC SUCNAELFPE GLRPALPEKH SSVKESFRYV NLTCSSGKQV 540 
PGAPGRPSTP KEMFITVEFE LETNQKEVTA SCDLSCIVKR TEKRLRKAIR TLRKAVHREQ 600 

55 FHLQLSGMNL DVAKKPPRTS ERQAESCGVG QGHAENQCVS CRAGTYYDGA RERCILCPNG 660 
TFQNEEGQMT CEPCPRFGNS GALKTPEAWN MSECGGLCQP GEYS ADGFAP CQLCALGTFQ 720 
PEAGRTSCFP CGGGLATKHQ GATSFQDCET RVQCSPGHFY NTTTHRCIRC PVGTYQPEFG 780 
KNNCVSCPGNTTTDFDGSTNITQCKNRRCXjGELGDFTGYIESPKYPG>r^ AOT 840 
PPPKRRILTV VPEIFLPIED DCGDYLVMRK TSSSNSVTTY ETCQTYERPI AFTSRSKKLW 900 

60 IQFKSNEGNS ARGFQVPYVT YDEDYQEUE DIVRDGRLYA SENHQHLKD KKLIKALFDV 960 
LAHPQNYFKY TAQESREMFP RSFIRIXRSK VSRFLRPYK 

Seq ID NO: 65 DNA sequence 
Nucleic Acid Accession #: NM_007210 
65 Coding sequence: I-l 869 (underlined sequences correspond to start and stop codons) 

1 II 21 31 41 51 

ATGA GGCTCC TCCGCAG ACG CCACATGCCC CTGCGCCTGG CCATGGTGGG CTGCGCCTTT 60 
70 GTGCTCTTCC TCTTCCTCCT GCATAGGGAT GTGAGCAGCA GAGAGGAGGC CACAGAG AAG 120 
CCGTGGCTGA AGTCCCTGGT GAGCCGG AAG G ATCACGTCC TGG ACCTCAT GCTGG AGGCC 1 80 
ATGAACAACC TTAGAGATTC AATGCCCAAG CTCCAAATCA GGGCTCCAG A AGCCCAGCAG 240 
ACTCTGTTCT CCATAAACCA GTCCTGCCTC CCTGGGTTCT ATACCCCAGC TGAACTGAAG 300 
CCCTTCTGGG AACGGCCACC ACAGGACCCC AATGCCCCTG GGGCAGATGG AAAAGCATTT 360 
75 CAGAAGAGCA AGTGGACCCC CCTGGAGACC CAGGAAAAGG AAGAAGGCTA TAAGAAGCAC 420 
TGTTTCAATG CCTTTGCCAG CGACCGGATC TCCCTGCAGA GGTCCCTGGG GCCAGACACC 480 
CGACCACCTG AGTGTGTGGA CCAGAAGTTC CGGCGCTGCC CCCCACTGGC CACCACCAGC 540 
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GTGATCATTG TGTTCCACAA CGAAGCCTGG TCCACACTGC TGCGAACAGT GTACAGCGTC 600 
CTACACACCA CCCCTGCCAT CTTGCTCAAG GAGATCATAC TGGTGGATGA TGCCAGCACA 660 
GAGG AGCACC TA AAGGAGAA GCTGGAGCAG TACGTGAAGC AGCTGCAGGT GGTGAGGGTG 720 
GTGCGGCAGG AGGAGCGGAA GGGGTTGATC ACCGCCCGGC TGCTGGGGGC CAGCGTGGCA 780 
5 CAGGCGGAGG TGCTCACGTT CCTGG ATGCC CACTGTGAGT GCTTCCACXjG CTGGCTGGAG 840 
CCCCTCCTGG CTCGAATCGC TGAGGACAAG ACAGTGGTGG TGAGCCCAGA CATCGTCACC 900 
ATCGACCTTA ATACTTTTGA GTTCGCCAAG CCCGTCCAGA GGGGCAGAGT CCATAGCCGA 960 
GGCAACTTTG ACTGGAGCCT GACCTTCGGC TGGGAAACAC TTCCTCCACA TGAGAAGCAG 1020 
AGGCGCAAGG ATG AAACATA CCCCATCAAA TCCX:CGACGT TTGCTGGTGG CCTCTTCTCC 1080 
lU ATCCCCAAGTCCTACTTTGAGCACATCGGTACCTATGATAATCAGATGGAGATCTGGGGA 1140 
GGGGAGAACG TGGAAATGTC CTTCCGGGTG TGGCAGTGTG GGGGCCAGCT GGAGATCATC 1200 
CCCTGCTCTG TCGTAGGCCA TGTGTTCCGG ACCAAGAGCC CCCACACCTT CCCCAAGGGC 1260 
ACTAGTGTCA TTGCTCGCAA TCAAGTGCGC CTGGCAGAGG TCTGG ATGGA CAGCTACAAG 1 320 
AAGATTTTCT ATAGGAGAAA TCTGCAGGCA GCAAAGATGG CCCAAGAGAA ATCCTTCGGT 1380 

i D GACATTTCGG AACGACTGCA GCTGAGGGAA CAACTGCACT GTCACAACTT 1TCCTGGTAC 1440 
CTGCACAATG TCTACCCAGA GATGTTTGTT CCTGACCTG A CGCCCACXHT CTATGGTGCC 1 500 
ATCAAGAACC TCGGCACCAA CCAATGCCTG GATGTGGGTG AGAACAACCG CGGGGGGAAG 1560 
CCCCTCATCA TGTACTCCTG CCACGGCCTT GGCGGCAACC AGTACTTTGA GTACACAACT 1620 
CAGAGGGACC TTCGCCACAA CATCGCAAAG CAGCTGTGTC TACATGTCAG CA AGGGTGCT 1 680 

2U CTGGGCCTTG GGAGCTGTCA CTTCACTGGC AAGAATAGCC AGGTCCCCAA GGACGAGGAA 1740 
TGGGAATTGG CCCAGGATCA GCTCATCAGG AACTCAGGAT CTGGTACCTG CCTGACATCC 1800 
CAGGACAAAA AGCCAGCCAT GGCCCCCTGC AATCCCAGTG ACCCCCATCA GTTGTGGCTC 1860 
TTTGTCTAGG ACCCAGATCA TCCCCAGAGA GAGCCCCCAC AAGCTCCTCA GGAAACAGGA 1920 
TTGCTGATGT CTGGGAACCT GATCACCAGC TTCTCTGGAG GCCX3TAAAQA TGGATTTCTA 1980 

ZD AACCCACTGG GTGGCAAGGC AGGACCTTCC TAATCCTTGC AACAACATTG GGCCCATTTT 2040 
CTTTCCTTCA CACCGATGGA AGAGACCATT AGGACATATA TTTAGCCTAG CGTTTTCCTG 2100 
TTCTAGAAAT AGAGGCTCCC AAAGTAGGGA AGGCAGCTGG GGGAGGGTTC AGGGCAGCAA 2160 
T GCTG AGTTC AAGAAAAGTA CTTCAGGCTG GGCACAGTGG CTCATGCCTQ AAATCCTAGC 2220 
ACTTTGGGAA GACAATGTGG GAGAATGGCT TGAGCCCAGG AGTTCAAGAC CGGCCTGAGC 2280 

3 U AACATAGTG A GG ATCCCATC TCTACGCCCA CCCTCCCCCC GGCAA AAAAA AAAGCTGGGT 2340 
ATGGTGGCTT ATGCCTGTAG TCGCAGCTAC TCAGAAGGCT GAGGTGGGAG GATTGCTTGT 2400 
TCCCCGGAGG TTGAAGCTAC AGTGAGCCTT GATTGTGTCA CTGCACTCCA GCCTGGGCAA 2460 
CAGGTAAGAC TCTGTCTCAA AAAAAAAACA AAAAAGAAGA AGAAAAGTAC TTCTACAGCC 2520 
ATGTCCTATT CCTTGATCAT CCAAAGCACC TGCAGAGTCC AGTGAAATGA TATATTCTGG 2580 

J 3 CTGGGCACAG TGGCTCACAC CTGTAATCCT AGCACTTTGG G AGGCCAAGG CAGGTGGATC 2640 
ACCTGAGGTC AGAAGTTTGA AACCAGCXZTG GACTACATGG TGAAACTCCA TCTCTACTAA 2700 
AAGTACAAAA ATTAGCTGGG CATGATGGCA CGCACCTGCA GTCCCAGCTA CrTGGGAGGC 2760 
TGAGGCAGGA GAATCACTCG AACCCAGGAG GCAGAGGTTG CAGTGAGCCA AGACAGCACC 2820 
. _ ATTGCACCCC AGCCTGAGCA ACAAG AGCGA AACTCCATCT CAGGAAAAAA AAAAAAAAAA 2880 
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Seq ID NO: 66 Protein sequence: 
Protein Accession #: NP_009 1 4 1 



1 n 21 31 41 51 
11)111 

MRLLRRRHMP LRLAM VGCAF VLFLFLLHRD VSSREEATEK PWLKSLVSRK DHVLDLMLEA 60 
MNNLRDSMPK LQIRAPEAQQ TLFSINQSCL PGFYTPAELK PFWERPPQDP NAPGADGKAF 1 20 
QKSKWTPLET QEKEEGYKKH CFNAFASDRI SLQRSLGPDT RPPECVDQKF RRCPPLATTS 1 80 
5 U VU VFHNEAW STLLRT VYS V LHTTPAILLK EELVDDAST EEHLKEKXEQ YVKQLQ WRV 240 
VRQEERKGLI TARLLGASVA QAEVLTFLDA HCECFHGWLE PLLARIAEDK TVWSPDIVT 300 
IDLNTFEFAK PVQRGRVHSR GNFDWSLTFG WETLPPHEKQ RRKDETYPIK SPTFAGGLFS 360 
IPKSYFEHIG TYDNQMEIWG GENVEMSFRV WQCGGQLEn PCS WGHVFR TKSPHTFPKG 420 
TSVIARNQVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWY 480 
LH^^vnrPEMFVPDLTPTFYGAUCNlX5TNQCLDVGE^n^GGKPLIMYSCHGLGGNQYFEYTT 540 
QRDLRHNIAK QLCLHVSKG A LGLGSCHFTG KNSQVPKDEE WELAQDQUR NSGSGTCLTS 600 
QDiOCPAMAPC NPSDPHQLWL FV 

Seq ID NO: 67 DNA sequence 
OO Nucleic Acid Accession #: NM_014n2 

Coding sequence: 600-4484 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
\ ] \ \ \ .\ 

03 TTCCTCCGCG AAGGCTCCTT TGATATTAAT AGTGTTGGTG TCTTGAAACT GACGTAATGC 60 

GCGGAG ACTG AGGTCCTGAC AAGCG ATAAC ATTTCTGATA AAG ACCCG AT CTTACTGCAA 1 20 
TCTCTAGCGT CCTCnTTTT GGTGCTGCTG GTTTCTCCAG ACCTCGCGTC CTCTCG ATTG 1 80 
CTCTCTCGCC TTCCTATTTC TTTTTTTTTT TTTTAAACAA AAAACAACAC CCCCTCCCCT 240 
CTC CCACCCG GCACCGGGCA CATCCTTGCT CTATTTCCTT TCTCTTTCTC TCTCTCTCTC 300 

70 TCTCTTTTTT AATAAGGGTG GGGGAGGGAA AGGGGGGGGA GGCAGGAAAG ACCTTTTTCT 360 
CTCCCCCCCG CAATAATCCA AGATCAACTC TGCAAACAAC AGAAGACGGT TCATGGCTrr 420 
GGCCGCCGCG CCACCATCTT TCGGGCTGCC GAGGGTGTTC TTGACGATTA ATCAACAGAT 480 
GTACAG ATCA GCTCTCAAAA TGTCTTCTGT GT CTTCTGAG CGTCTTCTAA GACAATTGCA 540 
TTAGCCTCCTGCTAGTrGACTAATAGAATTAATAATTGTAAAAAGCACTCTAAAGCCACA 600 

'5 TGCCTTATGA AGTCAATGCT GGGTATGATT TTACAAATAT GGTCCGGAAA AAGAACCCCC 660 
CTCTGAG AAA CGTTGCAAGT GAAGGCGAGG GCCAGATCCT GGAGCCTATA GGTACAG AAA 720 
GCAAGGTATC TGGAAAG AAC AAAGAATTCT CTGCAGATCA GATGTCAGAA AATACGGATC 780 
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AGAGTGATGC TGCAGAACTA AATCATAAGG AGGAACATAG CTTGCATGTT CAAGATCCAT 840 
CTTCTAGCAG TAAGAAGGAC TTGAAAAGCG CAGTTCTGAG TGAGAAGGCT GGCTTCAATT 900 
ATGAAAGCCC CAGTAAGGGA GGAAACTTTC CCTCCTTTCC GCATGATGAG GTGACAGACA 960 
GAAATATGTT GGCTTTCTCA TTTCCAGCTG CTGGGGG AGT CTGTG AGCCC TTG AAGTCTC 1020 
5 CGCAAAGAGC AGAGGCAGAT GACCCTCAAG ATATGGCCTG CACCCCCTCA GGGGACTCAC 1080 
TGGAGACAAA GGAAGATCAG AAGATGTCAC CAAAGGCTAC AGAGGAAACA GGGCAAGCAC 1 140 
AGAGTGGTCA AGCCAATTGT CAAGGTTTGA GCCCAGTTTC AGTGGCCTCA AAAAACCCAC 1200 
AAGTGCCTTC AG ATGGGGGT GTAAGACTG A ATAAATCCAA AACTGACTTA CTGGTGAATG 1260 
ACAACCCAGA CCCGGCACCT CTGTCTCCAG AGCTTCAGGA CTTTAAATGC AATATCTGTG 1320 
1 0 GATATGGTTA CTACGGCAAC GACCCCACAG ATCTGATTAA GCACTTCCGA AAGTATCACT 1380 
TAGGACTGCA TAACCGCACC AGGCAAGATG CTGAGCTGGA CAGCAAAATC TTGGC CCTTC 1440 
ATAACATGGT GCAGTTCAGC CATTCCAAAG ACTTCCAG AA GGTCAACCGT TCTGTGTTTT 1 500 
CTGGTGTGCT GCAGGACATC AATTCTTCAA GGCCTGTTTT ACTA AATGGG ACCTATG ATG 1 560 
TGCAGGTGAC TTCAGGTGGA ACATTCATTG GCATTGGACG GAAAACACCA GATTGCCAAG 1620 
15 GGAACACCAA GTATTTCCGC TGTAAATTCT GCAATTTCAC TTATATGGGC AACTCATCCA 1680 
CCGAATTAGA ACAACATTTT CTTCAGACTC ACCCAAACAA AATAAAAGCT TCTCTCCCCT 1740 
CCTCTGAGGT TGCAAAACCT TCAGAGAAAA ACTCTAACAA GTCCATCCCT GCACTTCAAT 1800 
CCAGTGATTC TGGAGACTTG GGAAAATGGC AGGACAAG AT AACAGTCAAA GCAGGAGATG 1 860 
ACACTCCTGT TGGGTACTCA GTGCCCATAA AGCCCCTCGA TTCCTCTAG A CAAAATGGTA 1 920 
20 CAGAGGCCAC CAGTTACTAC TGGTGTAAAT TTTGTAGTTT C AGCTGTG AG TC ATCT AGCT 1 980 
CACTTAAACT GCTAGAACAT TATGGCAAGC AGCACGGAGC AGTGCAGTCA GGCGGCCTTA 2040 
ATCCAGAGTT AAATGATAAG CTTTCCAGGG GCTCTGTCAT TAATCAGAAT GATCTAGCCA 2100 
AAAGTTCAGA AGGAGAGACA ATGACCAAGA CAGACAAGAG CTCGAGTGGG GCTAAAAAGA 2160 
AGGACTTCTC CAGCAAGGG A GCCGAGGATA ATATGGTAAC GAGCTATAAT TGTCAGTTCT 2220 
2 5 GTG ACTTCCG ATATTCCAA A AGCCATGGCC CTG ATGTAAT TGTAGTGGGG CCACTTCTCC 2280 
GTCATTATCA ACAGCTCCAT AACATTCACA AGTGTACCAT TAAACACTGT CCATTCTGTC 2340 
CCAGAGG ACT TTGCAGCCCA G AAAAGCACC TTGGAGAAAT TACTTATCCG TTTGCTTGTA 2400 
GAAAAAGTAA TTGTTCCCAC TGTGCACTCT TGCTTCTGCA CTTGTCTCCT GGGGCGGCTG 2460 
GAAGCTCGCG AGTCAAACAT CAGTGCCATC AGTGTTCATT CACCACCCCT GACGTAG ATG 2520 
30 TACTCCTCTT TCACTATGAA AGTGTGCATG AGTCCCAAGC ATCGGATGTC AAACAAGAAG 2580 
CAAATCACCT GCAAGG ATCG GATGGGCAGC AGTCTGTCAA GGAAAGCAAA GAACACTCAT 2640 
GTACCAAATG TGATTTTATT ACCCAAGTGG AAGAAGAGAT TTCCCGACAC TACAGGAGAG 2700 
CACACAGCTG CTACAAATGC CGTCAGTGCA GTTTTACAGC TGCCGATACT CAGTCACTAC 2760 
TGGAGCACTT CAACACTGTT CACTGCCAGG AACAGG ACAT CACTACAGCC AACGGCGAAG 2820 
35 AGGACGGTCA TGCCATATCC ACCATCAAAG AGGAGCCCAA AATTGACTTC AGGGTCTACA 2880 
ATCTGCTAAC TCCAGACTCT AAAATGGGAG AGCCAGTTTC TGAGAGTGTG GTGAAGAGAG 2940 
AGAAGCTGGA AGAGAAGGAC GGGCTCAAAG AGAAAGTTTG GACCGAGAGT TCCAGTGATG 3000 
ACCTTCGCAA TGTGACTTGG AGAGGGGCAG ACATCCTGCG GGGGAGTCCG TCATACACCC 3060 
AAGCAAGCCT GGGGCTGCTG ACGCCTGTGT CTGGCACCCA AGAGCAGACA AAGACTCTAA 3120 
40 GGG ATAGTCC CAATGTGG AG GCCGCCCATC TGGCGCG ACC TATTTATGGC TTGGCTGTGG 3 1 80 
AAACCAAGGG ATTCCTGCAG GGGGCGCCAG CTGGCGGAGA GAAGTCTGGG GCCCTCCCCC 3240 
AGCAGTATCC TGCATCGGGA GAAAACAAGT CCAAGGATGA ATCCCAGTCC CTGTTACGGA 3300 
GGCGTAGAGG CTCCGGTGTT TTTTGTGCCA ATTGCCTG AC CACAAAGACC TCTCTCTGGC 3360 
G A AAGAATGC AAATGGCGGA TATGTATGCA ACGCGTGTGG CCTCTACCAG AAGCTTCACT 3420 
45 CGACTCCCAG GCCTTTAAAC ATCATTAAAC AAAACAACGG TGAGCAGATT ATTAGGAGGA 3480 
GAACAAGAAA GCGCCTTAAC CCAGAGGCAC TTCAGGCTGA GCAGCTCAAC AAACAGCAGA 3540 
GGGGCAGCAA TGAGGAGCAA GTCAATGGAA GCCCGTTAGA GAGGAGGTCA GAAGATCATC 3600 
TAACTGAAAG TCACCAGAGA G AAATTCCAC TCCCCAGCCT AAGTAAATAC GAAGCCCAGG 3660 
GTTCATTGAC TAAAAGCCAT TCTGCTCAGC AGCCAGTCCT GGTCAGCCAA ACTCTGGATA 3720 
50 TTCACAAAAG GATGCAACCT TTGCACATTC AGATAAAAAG TCCTCAGGAA AGTACTGGAG 3780 
ATCCAGGAAA TAGTTCATCC GTATCTGAAG GGAAAGGAAG TTCTGAG AGA GGCAGTCCTA 3840 
TAGAAAAGTA CATGAGACCT GCGAAACACC CAAATTATTC ACCACCAGGC AGCCCTATTG 3900 
AAAAGTACCA GTACCCACTT TTTGGACTTG CCTTTGTACA TAATG ACTTC CAG AGTG AAG 3960 
CTGATTGGCT GCGGTTCTGG AGTAAATATA AGCTCTCCGT TCCTGGGAAT CCGCACTACT 4020 
. 5 5 TG AGTC ACGT GCCTGGCCTA CCAAATCCTT GCCAAAACTA TGTGCCTTAT CCCACCTrCA 4080 
ATCTGCCTCC TCATTTTTCA GCTGTTGGAT CAGACAATGA CATTCCTCTA GATTTGGCGA 4140 
TCAAGCATTC CAG ACCTGGG CCAACTGCAA ACGGTGCCTC CAAGGAGAAA ACGAAGGCAC 4200 
CACCAAATGT AA AAAATGAA GGTCCCTTG A ATGTAGTAAA AACAG AGAAA GTTG ATAG AA 4260 
- GTACTCAAGA TGAACTTTCA ACAAAATGTG TGCACTGTGG CATTGTCTTT CTGG ATGAAG 4320 
60 TGATGTATGC TTTGCATATG AGTTGCCATG GTGACAGTGG ACCTTTCCAG TGCAGCATAT 4380 
GCCAGCATCT TTGCACGGAC AAATATG ACT TCACAACACA TATCCAGAGG GGCCTGCATA 4440 
GGAACAATGC ACAAGTGGAA AAAAATGGAA AAC CTAAA GA GTAAAACCTT AGCACTTAGC 4500 
ACAATTAAAT AGAAATAGGT TTTCTTGATG GGAATTCAAT AGCTTGTAAT GTCTT ATGAA 4560 
GACCTATTAA AAAAATACTT CATAGAGCCT GCCTTATCCA ACATGAAATT CCCTTCTTTT 4620 
65 GTTATTCTTT CTTTTGATGA GTAGGTTACC AAG ATTAAAA AGTGAGATAA ATGGTCAATG 4680 
AGAAAGAATG GAAGATGGTA AACAATCACT TTTTAAAACC TGTI A AGTCA AAACCATCTT 4740 
GGCTAATATG TACTGGGGAA ATAATCCATA AGAGATATCA CCAG ACTAGA ATTAATATAT 4800 
TTATAAAGAA AGAG ACCAAA ACTGTCTAGA ATTTGAAAGG GTTTACATAT TATTATACTA 4860 
AAGCAGTACr GG ACTGGCCA TTGG ACCATT TGTTCCA AAA CCCATAAATT GTTGCCTAAA 4920 
70 TTTATAATGA TCATGAAACC CTAGGCAGAG GAGGAGAAAT TGAAGGTCCA GGGCAATGAA 4980 
AGAAAAATGG CGCCCTCTCA ATTTAGTCTTCTCTCATTGG CCATGTTTCA GATTTTGACC 5040 
TAGAAATGCG AGCTGTGGTT AGGCTTGGTT AGAGTGCAGC AAGCAACATG ACAGATGGTG 5100 
GCACGCTGTT TTTACCCAGC CCTGCCTGTA CATACACATG CACACCCTCT CTG ATATTTT 5 1 60 
TGTCCTTTAG ATGTTCAAAT ACTCAGTAGT CCTTTTGTTT GCGGTTTAGA TTCATTTTGT 5220 
75 CCACACATGT ACCCATTTTA AAAAACAATG TCCTCGATGC TTCTGTAGTG ATTTCATTTT 5280 
AGCCAGGTAT TTGTTTCTTG TGTGTGATGA ACCAGTATGG ATTTGCTTTT CTAAGCCTCC 5340 
TGTTGGTTAC TAATCTCACT TGGCACATTA TAACTAAAGG AATCCCCTCA ATTCAAAAGC 5400 
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ATAGATGG AT ACAAATGTCA GACCGTGGGT TTAATTTGTT TAGAACACAT GGCATTTCTT 5460 

CACAAGGTAA CCTGCTGTAT TTATTTATTT TCTTTTGGTT AAATATAATT TCCAAACTTT 5520 

GTGGTCAGGCAGCGTCTAAGGTTACGTTACCACAGACTGACAGTTGGTATATGTACCAGC 5580 

CAATCCCTTC ATTAAATGTA TACAGATTTA GTTAAGTAGC ATTAAATAGG ATTCTTAGAA 5640 
5 GTATGTCCTC ATAG AACTTT TAATACTTAA GGCTTTGTAA AAACTATCCA TGAAGGGAAA 5700 

GCTCCTCAGC ATAACTGCTC AGGGAAATAG GGCTAAATAA CTGAACATTA AATAATTGGT 5760 

TAAAGGTGCTGTTAGTCGAG CCTCAATGCT TGCTACAAGG ATGTATGTAC AAGGACTGAC 5820 

TTTAATAATT TGCATTATAT TGTCCCAACC AGTAGTTTAT TTTTTOCCAC GGAGATGTAG 5880 

AAGATATTAC AAGCTACTGG ATGCACTGTC AGATTAACTT ATTTCATTAA AGAAGTTGGG 5940 
1 0 AG AACAAAT A GG AAAAAAAA AACTTATTTT TCTAGTAAAT ATTAATGTAT T ACATTTCAA 6000 

ATAATGGTGC CTG ACATATT GAATAATTAT TTTCTACAGT GTACGTATGC AACAAAGATA 6060 

TTCCATCATG CATTAGAGTC AGTTCTGGCT CTGCCTAGCT GTTTACATrT GCAAATGTAG 6120 

CAAACAAGGT AATGAAGCAA CTATTTCTAT TGCAGTAG AT ATCCI ITTGT GTGTGTGTGT 6180 

GTGCATTAAA GTTGTAAACG GTAACATGAA ACAAATGAAA GTTCTTGCTA TAATGGTATG 6240 
1 5 GAAAACAAGA AGG AAATGAA AATATTTTTA TGCCTACTTA GGAAAAAAAG GGTAGCACTT 6300 

ATTCATTCCA AGTACl'l i I T I I l lT"i r AAT TTTTAAGCTC TTAACTCACA TTGTTATGCT 6360 

TAAGATG ATA AACATATATC CTCTTTTTAT TGCTTTGTCT ATGTTTCATA TGAAACATTT 6420 

CAGAAATTAT TTTGATAAGT GTTGCTGGAA TCTGCAACQC TGATTTTTTT TTGCATTCTG 6480 

TAGTCGCATT TGCACTCCAT TTTTAC ATTA ATTCGCAGTT GCTTTGTATC ATTGTTTTGT 6540 
20 TTGGGTTTTG TTTCTTTTTC ACAGTGCCGG GTCTTCGTTT CTTAAAGTTG GATGGCAGGT 6600 

AGAGTTCAAC CAGTTCGTGA CTGTTGTAGC GAATGAAGTT AAAAAAATGT CTTTCTGATG 6660 

TTGTGTTGTC ATTTTCATTT TTGCATTTTT TTGTTTGCAT ATTAAA AAAA GAGAAAAGAG 6720 

AAAGCAAGAG ACAGAAATCA GGACTAAGTC CTCTGCTTCA GTTTCATTGT TAACGGGCCT 6780 

TATTCTG ATC TCACCTGTCG CGTAGCTCTA ATATTCACAT AAACTGAAAT AAAG AAGTGG 6840 
25 AATGAGGAGC TTTGACATTC AAATTATGTG ATGTAATTTA TCTTCCTTAG GAATTTTGAT 6900 

GGATGCATCT CAAAATGTAT AGCCAGACTT G AG AGGTGAC AATTAAAGAT CTAAAAAAGA 6960 

G AGGAG ATTC CCCCAAACAA CAATATTTAA TTTTCTTAGT AAAAAGAATA ACAGAATGCA 7020 

TCGTGGCAAT CCTTAAGCAA CATTATCTAT GTGGACTGCT TAAATCAGCA AAACACCAGA 7080 
^ AGTTTGGTTA ACTTGGGCAA TATGACAAGT ATTACTTTTT GGGCAAAACT ACTCATTAAG 7140 
3.0 CAATTTCTCT AGTGTGTCGG ACACAAATAG GTTCTTTATT TTTGGCATGT ATGCCTTTTT 7200 

ATTTTCATTC AAT TI l"! ITl TTTTCTCAGA CAGACATAGT AGTATCAACT AGCATTGGAA 7260 

AATACATATC ACTATTCTTG GAATATTTAT GGTCAGTCTA CTTTTTAGTA AAATATTTTT 7320 

GGATAGCGTT GACACGATAG ATCTTATTCC ATACTTCTTT ATTATTGATA ATTTTATTTT 7380 

CATTTTTTGC TTTCATTATT ATACATATTT TGGTGGAGAA GAGGTTGGGC TTTTTTGAAA 7440 
3 5 G AG ACAAAA A TTTATTATAA CACTAAACAC TCCTTTTTTG ACATATTAAA GCCTTTATTC 7500 

CATCTCTCAA GATATATTAT AAAATTTATT TTTTTAATTT AAGATTTCTG AATTATTTTA 7560 

TCTTAAATTG TGATTTTAA A CG AGCTATTA TGGTACX5GAA CTTTTTTTAA TGAGGAATTT 7620 

CATGATG ATT TAGGAATTTT CTCTCTTGGA AAAGGCTTCC CCTGTGATGA AAATGATGTG 7680 

CCAGCTAAAA TTGTGTGCCA TTTAAAAACT G AAAATATTT TAAAATTATT TGTCTATATT 7740 
40 CTAAATTGAGCTrrGGATCA AACTTTAGGCCAGGACCAGCTCATGCGTTCTCATTCTTCC 7800 

TTTTCTCACT CTTTCTCTCA TCACTCACCT CTGTATTCAT TCTGTTGTTT GGGATAG AAA 7860 

AATCATAAAG AGCCAACCCA TCTCAGAACG TTGTGG ATTG AGAGAGACAC TACATGACTC 7920 

CAAGTATATG AGAAAAGGAC AGAGCTCTAA TTGATAACTC TGTAGTTCAA AAQGAAAAGA 7980 

GTATGCCCAA TTCTCTCTAC ATGACATATT GAGATTTTTT TTAATCAACT TTTAAGATAG 8040 
45 TGATGTTCTG TTCTAAACTG TTCTGTTTTA GTGAAGGTAG ATTTTTATAA AACAAGCATG 8100 

GGG ATTCTTT TCTAAGGTAA TATTAATG AG AAGGG AAAA A AGTATCTTTA ACAGCTCTTT 8 1 60 

GTTGAAGCCTGTGGTAGCAC ATTATGTTTATAATTGCACATGTGCACATA ATCTATTATG 8220 

ATCCAATGCA AATACAGCTC CAAAAATATT AAATGTATAT ATATTTTAAA ATGCCTGAGG 8280 

AAATACATTT TTCTTAATAA ACTGAAG AGT CTCAGTATGG CTATTAAAAT AATTATTAGC 8340 
5 0 CTCCTGTTGT GTGGCTGCA A A ACATCACAA AGTG ACCGGT CTTG AQACCT GTG AACTGCT 8400 

GCCCTGTTTA GTAAATAAAA TTAATGCATT TCTAGAGGGG GAATATCTGC CAT CCAG TGG 8460 

TGGAAATGTG GAGTAAAGAA GCTGGTGGTC TGCTTCTGTG CTGTATGCCA GCCTTTTGCC 8520 

TTAAGTTGAG AGGAGGTCAA CTTTAGCTAC TGTCTTTGGT TTGAGAGCCA TGGCAAAAAA 8580 

AAAAAAAGAA AAAAAGATCA AGTCGTCTTT GGTGAGCCAG TAAGGTGAAA GCTTGCTGAC 8640 
55 TGTCCAAGGC ACAAGAGAAA ATTG AGG AAT TGAAATGCAA CCTGAGTATC AAACTAAATA 8700 

TTCTAATCAA AGGTAGGTAC TGTTAGGTGG AATTCTATCA GCAGGCAACT GCAAATG AGA 8760 

AGAAGATAGA AGGACGCCCG TCGGGACTTT GGAGGGCATT GTTATTTTCC CAAAGAAAGA 8820 

CGGCCAAGGG CAGAGGCATG GATTCTTTGC AGAGCACTTC CTTTTGGTTT TTCAGTACTG 8880 

TTTCATAGAC AGTGGGCTCA CATGTTCCTG ATAGTGCTGC AGTTGCTTAG AAAGCATCCC 8940 
60 AGTTAATTGC AGTAATTAGA ACTTCTGGAA TATGCTAGGG CAGAAGTATG TCAAGTATGT 9000 

CACATG AAGA AAATGTGAAA TTCAAGAGTA ATCCACACGT GAGAAACTAG ACA ATGTACA 9060 

TTCATGTGTT CTCTTGAAAG GAAAGGGAGA GCTGTAAGCT TCACTCTGTC CTACACCGGA 9120 

G A AAAGCAGG AATA ACTTTA CCGTG G AA AT AATGTTTAGC TTTTATCAG A G AAAATTGTC 9180 ) 

CTTCTAGAGC ATAG AGTCCC AAAACTCAAT TCTGGTTrrC CCCTGTTTTT TTTTTTriTT 9240 
65 TTTTTCCCAA CATATGAACT GCAGCATATC ACTTTTTCTT TTTGTGCCTC AGGTTCCTCA 9300 

CCTGTAAAAT TG AAAAATAT ATGTATTAAT AATATTATTA ATAATAATAA TGGTAATGTA 9360 

GTACTTGTTT GTAAAGCACT TTGAGATCCTTGGTTGAAAG GCACCATAGG AGTGCCAAGT 9420 

ATTATTATGT GGCCAAGGGG GTTATTTAAA CTGTCAGTTC CCAAAGGCCA GGAAAGGTTG 9480 

GGGTCATTTT TCTTA AAGAC G AGCTGTAAA TATCAACTAG GCAGCCAATA GTGTTGACTA 9540 
70 TG AAG ATGCA AAACTATTAC TAGGCTGATA AAATCATAGT TTCTTAATGG CTACCAATAA 9600 

GGCAAATATC ACAATAATAA ACGCCAAATT CCTTAGGGCG GACTATTTGA CAACCACATG 9660 

GAAAACTTTG GGGGAGGCAT GAGGGGGGAA CATCTCAAAA TGCCAATGTA AAATTTAACT 9720 • 

TACAGCAATA TTCACCAGCA GAAAATGTCT TTCATATGG A ATGATTTCAT GTTGCTAAGA 9780 . 

AAAAGA ATTC AATTTGTAGT CCTGATTTGA ATACTAGAAT GTTGGCTATA ATAGTTCTGT 9840 
75 TCTTACA ACA CATG AAATTT TTTCGTTTTA TTTTATTTTG TTTTCATAGT GCATGTTCAT 9900 

TTCTACTCAC AAACATGTTC TTGGTGTATT TCTTATGCAA ACAATCTTCA GGCAGCAAAG 9960 

ATGTCTGTTA CATCTAAACT TGAATAATAA AGTTTTACCA CCAGTTACAC A 
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■ Seq ID NO: 68 Protein sequence: 
Protein Accession #: NPj05483 1 

5 1 n 21 31 41 51 

I 1 I I I ) 

MPYEVNAGYD FTNMVRKKNP PLRNVASEGE GQILEPIGTE SKVSGKNKEF SADQMSENTD 60 
QSDAAELNHK EEHSLHVQDP SSSSKKDUCS AVLSEKAGFN YESPSKGGNF PSFPHDEVTD 120 
RNMLAFSFPA AGGVCEPLKS PQRAEADDPQ DMACTPSGDS LETKEDQKMS PKATEETGQA 1 80 

1 0 QSGQANCQGL SPVSVASKNP QVPSDGG VRL NKSKTDLLVN DNPDP APLSP ELQDFKCNIC 240 
GYGYYGNDPT DLDCHFRKYH LGLHNRTRQD AELDSKILAL HNMVQFSHSK DFQKVNRSVF 300 
SG VLQDINSS RPVLLNGTYD VQVTSGGTFI GIGRKTPDCQ GNTKYFROCF CNFTYMGNSS 360 
TELEQHFLQT HPNKIKASLP SSEVAKPSEK NSNKSIPALQ SSDSGDLGKW QDKITVKAGD 420 
DTPVGYSVPl KPLDSSRQNG TEATSYYWCK FCSFSCESSS SLKLLEHYOK QHGAVQSGGL 480 

15 NPELNDKLSRGSVINQNDLAKSSEGETMTKTDKSSSGAKKKDFSSKGAEDNMVTSYNCQF 540 
CDFRYSKSHG PDVIVVGPLL RHYQQLHNIH KCTIICHCPFC PRGLCSPEKH LGEITYPFAC 600 
. RKSNCSHCAL LLLHLSPGAA GSSRVKHQCH QCSFTTPDVD VLLFHYESVH ESQASDVKQE 660 
ANHLQGSDGQ QSVKESKJEHS CTKCDFITQV EEEISRHYRR AHSCYKCRQC SFTA ADTQSL 720 
LEHFNTVHCQ EQDITTANGE EDGHAISTIK EEPKIDFRVY NLLTPDSKMG EPVSESWKR 780 

20 EKLEEKDGLK EKVWTESSSD DLRNVTWRGA DILRGSPSYT QASLGLLTPV SGTQEQTKTL 840 
RDSPNVEAAH LARPIYGLAV ETKQFLQGAP AGGEKSG ALP QQYPASGENK SKDESQSLLR 900 
RRRQSGVFCA NCLTTKTSLW RKNANGGYVC NACGLYQKLH STPRPLNUK QNNGEQIIRR 960 
RTRKRLNPEA LQAEQLNKQQ RGSNEEQ VNG SPLERRSEDH LTESHQREIP LPSLSKYEAQ 1 020 
GSLTKSHSAQ QPVLVSQTLD IHKRMQPLHI QIKSPQESTG DPGNSSSVSE GKGSSERGSP 1080 

25 lEKYMRPAKH PNYSPPGSPI EKYQYPLFGL PF VHNDFQSE ADWLRFWSKY KLSVPGNPHY 1 1 40 
LSHVPGLPNP CQNYVPYPTF >a.PPHFSAVG SDNDIPLDLA IKHSRPGPTA NGASKEKTKA 1200 
PPNVKNEGPL NWKTEKVDR STQDELSTKC VHCGIVFLDE VMYALHMSCH GDSGPFQCSI 1260 
CQHLCTDKYD FTTHIQRGLH RNNAQVEKNG KPKE 

30 Seq ID NO: 69 DNA sequence 

Nucleic Acid Accession #: XM_073879 

Coding sequence: l-387(underiined sequences conespond to start and stop codons) 

I 11 21 31 41 51 

35 I 1 1 I 1 t 

ATGGGGTTTG GAGACCAGGG AACGGTGGAA GGGAGCCTAG GAACGTCGAA AAAACCACCT 60 
GAAGTGAAAA TGTTTGGAGC CAGTCAAGGT TTGCTG ACAA TGG A AACAAA CCAGTCCCTG 1 20 
GCACAAGGCA CAGGCTGCTC AGTAGTAAAG GTAGACACTG TTCTCTTCGA GAGTTTATAC 180 
CACTGCGGCT TTGAACACGG GAGCGTGATG CACTGCCTTG GGGATGATCA CCCCCAGGAA 240 

40 GACAGGAAGG CTCACTTCTC TGCCCCAGIT GCAGCCATCG CCTCTCGAGC ACCGACTCCT 300 
GTCTGTCCTG CACACCACTC AACACAG AGC ATCTGCCAGT TTCTACAGCA CTGCAGGCAG 360 
AACACTCACT TGCAGGCTGC TAA CTAA 



Seq ID NO: 70 Protein seouence: 
45 Protein Accession XP_073879 . 

1 11 21 31 41 51 

i t I I 1 1 

MGFGDQGTVE GSLGTSKKPP EVKMFGASQG LLTMETNQSL AQGTGCS WK VDTVLFESLY 60 
50 HCGFEHGSVM HCLODDHPQE DRKAHFSAPV AAIASPAPTP VCPAHHSTQS ICQFLQHCRQ 120 
NTHLQAAN 

Seq ID NO: 7 1 DNA sequence 
Nucleic Acid Accession #: AB033064 
5 5. Coding sequence: 826- 1 98 6 (underlined sequences correspond to start and stop codons) 

1 U 21 31 41 51 

1.1 11 I 1 

GGGGAACAGC AAATTCAGTC ACAGACAATC CTCCACTCGG TCAAGAGCCA CTTTTCTCTT 60 
60 CCTGCCTTGCCCCCCCGCAGGGGGTAAGGAACTGAGCGTTTAATCTTTAGCCGGTTGGCT 120 
ACCAGCTAAA ATTCTACTTA TCTTAGTTTC TAGTGGATAG CTTTCTTATT TTGCCC ATGT 1 80 
TTTCTTAGAA TCCCTGTTTA ATATACTTTT GTCAGTAGTA GTATCTAGGA GTAGCAGGG A 240 
GAGTGACAAT AAATTAGCCC CI IC ITU 11 CCCTTGTCAT TCAGGCCCCT TTTCCTCTCC 300 
^ AG AGGGAAAT TACCAGTAAA CTCTTCTAAA TCTTCCACCC CTTCTCAGTC ATACTGTGAA 360 

65 GAAACACACT AAAGTGOACA TTATTTGACC AGTGAACACG AACCCAGCTT CAGGCATTGG 
TTTGTTGTGG CACATGG AGA AACATCTCTT TTAAAATATC TCCCAATTAC CCTTTTCACA 48; 
ATTTGTATCC ACCTAGG ATT TGCTGCTGGG GTAAGTCACT AGATTTATIT CTCAAAGCTC 54r. 
CCCrCTCTAT GAGCTGAAAG ACTGACCAAC CATG AACACT AGTAG GGGAT GGGGAAAGGG 6i>0 
GACAGAGCAG AGCCAGTTGT TCCACACTTT GGG AAGCAGG AGTAGCTTTT ATCATCTTCC 660 
70 TCTGGGGAGC AGGCATAGAG ACATAAACTG AGTGAAAATG GGTGGAGGAA GAACTTCTAT 720 
ACCCACGAAC AACATGTGAA G AGAG AG AAC CAAACATAAA GTAAGGAGGG TAG ACGTTAC 780 
ATCCAAGAGG AAATAATCCA GGCAAGGAAG CACAAGCTGA TCAAGMGTG TAGTTCTGTG 840 
GCTGCCAAGT TGTGGTTTTT GACAG ATCGT CGCATCAGGG AAGACTATCC TCAAAAAG AG 900 
ATTTTACG AG CATTGAAGGC CAAATGTTGT G AGGAGGAAC TGGACTTTAG GGCTGTGGTG 960 
75 ATGGATGAGG TGGTGCTGAC AATCGAGCAA GGAAACCTGG GTCTGCGGAT CAATGGAGAG 1020 
CTAATCACTG CCTACCCACA AGTGGTGGTA GTCAGAGTAC CAACCCCTTG GGTGCAAAGT 1080 
GATAGTCACA TCACTGTTTT GCGCCATCTA GAGAAGATGG GATGTCGGTT AATGAACCGA 1 140 

379 



wo 02/059377 



CCTCAAGCCA TCCTGAACTG CGTTAATAAG TTCTGGACAT TTCAAGAGTT GGCTGGCCAT 1200 
GGTGTTCCTC TGCCGGATAC TTTCTCTTAT GGTGGCCACG AAAATTTTGC TAAAATGATT 1260 
GATGAGGCTG AAGTTCTGGA GTTCCCAATG GTAGTAAAGA ATACX3CGGGG TCACAGAGGT 1320 
AAAGCTOTTT TCTTGGCTCG AGATAAGCAC CATTTGGCTG ATCTAAGCCA TCTTATTCGC 1380 
CATGAAGCGC CATACCTGTT CCAGAAGTAT GTTAA AGAGT CTCATGGACG GGATGTACGT 1440 
GTCATTGTCG TGGGAGGCCG TGTGGTTGGC ACCATGTTAC GTTGTTCAAC AGATGGGAGA 1500 
ATGCAAAGCA ACTGCTCATT AGGTGGTGTG GGGATGATGT GCTCATTGAG TGAACAAGGG 1560 
AAGCAGCTAG CTATCCAGGT GTCTAATATC CTGGGGATGG ATGTGTGTGG CATTGACCTG 1620 
CTGATGAAAG ATGACGGCTC CTTCTGCGTC TGTGAGGCCA ATGCAAATGT AGGTTTCATC 1680 
GCCTTTGATA AGGCTTGTAA TCTAGATGTA GCTGGTATCA TAGCAGACTA TGCCGCCTCC 1740 
CTTCTACCCT CTGGCCGGCT C ACCCGGCGT ATGTCCCTGC TCTCCXjTGGT GTCCACTGCC 1 800 
AGTGAGACTA GTGAGCCGGA GCTGGGTCCC CCAGCCAGCA CTGCTGTTGA CAACATGAGT 1860 
GCAAGTTCCA GCTCTGTTGA CAGCGACCCT GAAAGCACGG AGCGAGAGCT GCTCACCAAG 1920 
CTCCCAGGGG GCCTG1TCAA CATGAACCAG CTGCTAGCCA ATGAAATCAA ACTACTGGTG 1980 
G ACTGAC TCC ACTGGTAATT AACCAACAAA ACCCTTGTAA AACTTTCTTT CrrCTTTTCT 2040 
ATTTTTAAAA CCAACTTGCA ATGCTGTTCA TGGAGGATGC TCAGGAAGAT GAGAGAAAAT 2100 
TAGTAGGATT AGTTGGAGAG AGTGGGAGAT AGATGAGACC TCTGCTAGTA AGATGTTACT 2160 
TTCATTTACA AATCCTACAA ATAGAGAGGC AGAATAGGTG GGGTATAGAA AAATGTCAGG 2220 
CTCTC ATAGT TACCCl I TIA AATTGCTAAA AAATGTGTAT GCTCATAGGC CATGAGG AAC 2280 
AAATACTTTT 1 1 1 1 ITICAT GGTCCCTTGC TTTTGTTTTT GTACAAAAAA AAATGGTTTT 2340 
GCTACAAATA TCCAAGTAGC ATAACTTCAC ATTGTGTTGG AAGATTTGtC ATCAGTGAGG 2400 
AAAACATCTG CTTAAATTAC AGGAATTTTT GTATTATACA GCTCTGAAAA TTCTGCCATT 2460 
TCCTTATTAA CTAGCAG CTT TAGTTTGTAG TTTATGAAAT CTTGAGGGGC TCTTTTACTG 2520 
GG ATTTCTTA TTTTTTTGTT TTTTCCCGCT TAATTTGGTG GG AGGTCAAA TTG AATATAA 2580 
CCCAATAAAG GCTTCTTAAT GACAAAATTG GCATGTTTGC ATGATGAAAT GGAAATQAAC 2640 
AGTATTGCAA TGTCCGGTAT ACAAAATAAC ATTAATTCAA TGTAGATAAA ATTACACTAG 2700 
TTTA A AATAT GTG CATT CAC TTGTATTTGT TAGTGTTTTA GTCTTTTTTG AAAGATGTGC 2760 
TCTGTTA ATG TTGCmTTT TTnTTTnT TAATACATGC TAGTCTAACA TTTCCTGCTC 2820 
TATGCCTGCA TCTTTAACAA TGGCCAAAGT GAAGAAAATG CTACCTTTTT TGTTAACAAG 2880 
ACACTGACTT GAAACATGTA CATTTAAAGC CTTTTATTTT TTCCCTTTTT GTTTTGGTAG 2940 
TTGGGCATTT AAATAAGGAC AAGGAAAAAT ATTTTTGGGG GCAAATCAAG AGCCTATGAG 3000 
TTCTAAGTAT AAAGCTGAAG TGATTTCGAA TGCCAGCGTT ATATATTTGC ATTTTTCACA 3060 
TTTTACGAGG GAGTATATGT GTATGTGTGT GCACGCATGC ATGTGTATGT GTTTTGCTTT 3120 
T TGTTT CCAT CAACTAATCA AAAAGGATAA TTTAGAAAAT GGAGCATGAT GGGAAACAGA 31 80 
GTTTTTGACT TTAAAAAACA GATGAGTTGT TTTCATAAGT AGACTCCACT GGGGTAGAGG 3240 
TATTCACCTT AAAACATAGG GTGAGTAGAT GCTTTTTTAG GCCTTTTTGT GTATATGTAC 3300 
GTTGTTTGTTTTTTTCCTTTTGTTTCTAGCCTGTTCAGTGTACAGTTTATTCAAGGCTAC 3360 . 
ATGCTTTTCT TTAATGCTTC TGGCTATGCA TTTTCTCTTT TTACATATAG GATTTGGGAT 3420 
TGGGGGTGGG TTGGATGTTT TTGTTTGGGG ACTTATTTAG TAGTATTGAG TCTCTTATAG 3480 
CCCTACTCTT AAGCCTTCAA TACTGTCCAC TCTTTATATT CCTTTACTTG CAGAATTTAT 3540 
AAAAGCCCCC AAACTGCATA TAATATGAGC CTTTAAAACA TGGGTAAAAC TAATCCCATT 3600 
GATGGGTTTG GATGGTATGT TAAGAAATGG AGATGCTGCA GAGCCCAACG TAATTTTTTA 3660 
AACAGCAAGT TTTCCATCTC CCTACG AATC CTCTG AAGCT TTTACCCAAG CCCTTTCTTG 3720 
CCTCTCCAGT GCTATTTTCC TTCAGATGGA CCTTAAACAT AATTTCTTGG ACACTACTAG 3780 
AGAGACTTCG AGGCAATAAT AAAAGATCAG TATTAACCAG CTATAACAGA GGTTTGATCA 3840 
TGCTTACTTG TACAGTTTTT CCCCCGTTTT AAAAAGGAAT GTAATAAAAT TTGTTTTTTC 3900 
CATAGAATTA AATAATATTA AAATTGAGTG AAAGGTTGAT TGTTGATGAA TAGAATAGTA 3960 
CCTCTCATCT GTGCAGTGTC TCATTTCACC TCAGAGAAAA GGATACATAA GAGGAGTTTG 4020 
TA ATTTATCT TAGGATATTC TAATTGCATT TAAAAGAACT TATCTTGCGC AGGGTAAATG 4080 
GGGGACTCAC ATACATATAT TAATACCTCT GACTCATTAA CAGAAAGAAA TACTTGGTAC 4140 
TTCTTTCGCT GAATGACCAT ACTGTGGAGG ATGCATACTA TTTGGTATAG AGAAATAAAT 4200 
GAGGAAGAAA GAACTGCTTA ATTAAATTAT CATTCATATG TTCATATAGA GACCATCTGG 4260 
TTGCCATGTG TATTATGACA CATACACTTT GAATAGTTAC ATATCACAAG TATGTAGTTC 4320 
ATGTTTGTGT TGGTGGGGTA AGGCATCAGG AAAAATGTAG TTAGTCTTTT CTTAACTTAT 4380 
ACXTAAATTAA CCAACTATAT TATAGGAAAT ATGTGAAATT AGTTCATTAG CTTTATTCAC 4440 
TATTATGCAT TCACATGATA TTAAAACGTA CACTCACATG TTAGAATGAA AAGAGCAGTA 4500 
GTTATCTTAG ATTTTAAAAA CATGGATATC TTCTTG AATT CCTTCAAGAT TGAGGTAG AG 4560 
AATAAGAGCA AATCATTCTG GAAGTACCTT AAGG AAACAA ACAGCAGCAG ATATTTAGGT 4620 
TAAACTTATT TTCATAATTG TTTAATAACT TTTGTATAAT CTTCATTGCT ATTATGAGAG 4680 
AGAATGTATA TATCAAATAT GTGTAATGAT AAAATCTGAA TTGTAAAATT TTTGTATATT 4740 . 
GTTAAAATTG TAATTCTA AA TTGTATTTCA AAAATGATTA TTTCTG ATAT TGTTTTTATG 4800 
TCACCCATGA TGAAAACTGG ACTTTATATA TCTAAACATA CAAGTATGAA CTATTCTATT 4860 
TA AAA i i fTT AATAGTTTTT TTCTTTTTTG GTGCCTATAA TTGATTGGTC ATTTCTGCTG 4920 
GCTTTTCTCC AATGAACATT G AAATCTTCC TGTATATGTT ACCAATAAGA AAACTACCCT 4980 
GGAACAGTAG AAAAACCCAA CAAGAGACTT GGCATTCATC AAGCACATTA TCAGACTTTG 5040 
AGAACATATT GAAGGCATTG ACTTTGAAAA TCATCTCTTT TTCTCAAGAA GAAAGCAATG 5100 
GAGAAGCAAA TTTGTTTCAT TCAGTGAATC CCCAGTTTGG GGCTTGTGGG GCTTAGAGAC 5160 
ATTGTGAAAT CAAATCTTGT GTTATACTTT TCTCCTGGCT CACTTTTTTT GAGAAGGTTT 5220 
ATGGGCTATT TGGCTGGTG A GACACGATCC CCTCCTAAGA AAATGTAGGT GCTCAGACAG 5280 
GTAACCACTG CTGCTACTGT TTTTATTTGT TTGTTTGTTC AATnTATTT AAG ATTTGTT 5340 
TTTGTTGTAC TAGGATTTTA AAAAATGTAA TATATTGCAG GATTTATAAC CAG 

Seq ID NO: 72 Protein sequence: 
Protein Accession BAA86S52 



1 11 21 31 41 51 
111)11 
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MCSSVAAKLW FLTDRRIRED YPQKEILRAL KAKCCEEELD FRAVVMDEVV LTIEQGNLGL 60 
RINGELITAY PQVVVVRVPT PWVQSDSDIT VLRHLEKMGC RLMNRPQAIL NCVNKFWTFQ 120 
ELAGHGVPLP DTFSYGGHEN FAKMIDEAEV LEFPMVVKNT RGHRGKAVFL ARDKHHLADL 180 
SHLIRHEAPY LFQKYVKESH GRDVRVIVVG GRVVGTMLRC STDGRMQSNC SLGGVGMMCS 240 
LSEQGKQLAI QVSNILGMDV CGIDLLMKDD GSFCVCEANA NVGFIAFDKA CNLDVAGIIA 300 
DYAASLLPSG RLTRRMSLLS VVSTASETSE PELGPPASTA VDNMSASSSS VDSDPESTER 360 
ELLTKLPGGL FNMNQLLANE IKLLVD 

Seq ID NO: 73 DNA sequence 

Nucleic Acid Accession #: XM_040080-2 

Coding sequence: 159-1 104 (underlined sequences correspond to start and stop codons) 
I 11 21 31 41 51 

I r I i I I 

CTG AGTGGGG GCGGGGACTG CTGG AGTTGC GGGGCCTGCC TGGGGTAGGG CGGGGCAGG A 60 
CAGCTTGGAG ATAGGGCCCG GAATTGCGGG CGTCACTCTG CTCXTTGCGAC CTAGCCAGGC 120 
GTG AGGG AGT GACAGCAGCG CATTCGCGGG ACGAGAGCG A TG AGTGAGAA CGCCGCACCA 1 80 
GGTCTG ATCT CAG AGCTG AA GCTGGCTGTG CCCTGGGGCC ACATCGCAGC CAAAGCCTGG 240 
GGCTCCCTGC AGGGCCCTCC AGTTCTCTGC CTGCACGGCT GGCTGGACAA TGCCAGCTCX: 300 
TTCGACAGAC TCATCCCTCT TCTCCCGCAA GACTTTTATT ACGTTGCCAT GG ATTTCG GA 360 
GGTCATGGGC TCTCGTCCCA TTACAGCCCA GGTGTCCCAT ATTACCTCCA GACTTTTGTG 420 
AGTGAGATCC GAAGAGTTGT GGCAGCCTTG AAATGG AATC GATTCTCCAT TCTGGGCCAC 480 
AGCTTCGGTG GCGTCGTGGG CGGAATGTTT TTCTGTACCT TCCCCGAGAT GGTGGATAAA 540 
CTTATCTTGG TGGACACGCC GCTCTTTCTC CTGG AATC AG ATG AAATGG A GAACTTGCTG 600 
ACCTACAAGC GGAGAGCCAT AGAGCACGTG CTGCAGGTAG AGGCCTCCCA GGAGCCCTCG 660 
CACGTGTTCA GCCTGAAGCA GCTGCTGCAG AGGTTACTGA AGAGCAATAG CCACTTGAGT 720 
GAGGAGTGCG GGGAGCTTCT CCTGCAAAGA GGAACCACGA AGGTGGCCAC AGGTCTGGTT 780 
CTGAACAGAG ACCAGAGGCT CGCCTGGGCA GAGAACAGCA TTGACTTCAT CAGCAGGGAG 840 
CTGTGTGCGC ATTCCATCAG GAAGCTGCAG GCCCATGTCC TGTTGATCAA AGCAGTCCAC 900 
GGATATTTTG ATTCAAGACA GAATTACTCT GAGAAGGAGT CCCTGTCGTT CATGATAGAC 960 
ACGATGAAAT CCACCCTCAA AGAGCAGTTC CAGTTTGTGG AAGTCCCAGG CAATCACTGT 1020 
GTCCACATGA GCGAACCCCA GCACGTGGCC AGTATCATCA GCTCCTTCTT ACAGTGCACA 1080 
CACATGCTCC CAGCCCAGCT G TAGC TCTGG GCCTGG AACT ATGAAG ACCT AGTGCTCCCA 1 140 
GACTCAACAC TGGGACTCTG AGTTCCTGAG CCCCACAACA AGGCCAGGGA TGGTGGGG AC 1200 
AGGCCTCACT AGTCTTGAGG CCCAGCCTAG GATGGTAGTC AGGGGAAGGA GCGAGATTCC 1260 
AACTTCAACA TCTGTGACCT CAAGGGGGAG ACAGAGTCTG GGTTCCAGGG CTGCTTTCTC 1320 
CTGGCTAATA ATAAATATCC AGCCAGCTGG AGGAAGGAAG GGCAGGCTGG GCCCACCTAG 1380 
CCTTTCCCTG CTGCCCAACT GGATGGAAAA TAAAAGGTTC TTGTATTCTC a 

Seq ID NO: 74 Protein sequence: 
Protein Accession #: XP JMOOSO. 1 

1 11 21 31 41 51 

MSENAAPGLl SELKLAVPWG HIAAKAWGSL QGPPVLCLHG WLDNASSFDR LIPLLPQDFY 60 
YVAMDFGGHG LSSHYSPGVP YYLQTFVSEI RRWAAUCWN RFSILGHSFG G WGGMFFCT 120 
FPEM VDKLIL LDTPLFLLES DEMENLLTYK RRAIEHVLQV E ASQEPSHVF SLKQLLQRLL 1 80 
KSNSHLSEEC GELLLQRGTT KVATGLVLNR DQRLAWAENS IDRSRELCA HSIRKLQAHV 240 
LLDCAVHGYF DSRQNYSEKB SLSFMIDTMK. STLKEQFQFV EVPGNHCVHM SEPQHVASII 300 
SSFLQCTHML PAQL 

Seq ID NO: 75 DNA sequence 

Nucleic Acid Accession #: NM_005794 

Coding sequence: 434-1276 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

GGTTCCCTTC CACGCTGTGA AGCTTTGTTC TTTTGGTCTT CATGATAAAT CTTGCTGCTG 60 
CTCACTCGTT GGGTCCGTGC CACCTTTAAG AGCTGTAACA CTCACCGCGA AGGTCTGCAA 120 
CTTCACTCCT GGGGCCAGCA AGACCACGAA TGCACCGAGA GGAATGAACA ACTCTGGACA 180 
CACCATCTTT AAGAACCGTA ATACTCACCG CAAGGGTCTG C AACT TCATT CTTG AAGTCA 240 
GTGAGGCCAA GAACCCATCA ATTCCGTACA CATTTTGGTG ACTTTGAAGA GACTGTCACC 300 
TATCACCAAG TGGTGAGACT ATTGCCAAGC AGTGAGACTA TTGCCAAGTG GTGAGACCAT 360 
CACCAAGCGG TGAGACTATC ACCTATCGCC AAGTGGCCTG ATTCAGCAGG AAGCATCTCA 420 
GACACCAACC AC TATGC TGT CAGCAGTTGC CCGGGGCTAC CAGGGCTGGT TTCATCCCTG 480 
TGCTAGGCTT TCTGTGAGG A TGAGCAGCAC CGGGATAGAC AGG AAGGGCG TCCTGGCTAA 540 
CCGGGTAGCC GTGGTCACGG GGTCCACCAG TGGGATCGGC TTTGCCATCG CCCGACGTCT 600 
GGCCCGGGAC GGGGCCCACG TGGTCATCAG CAGCCGGAAG CAGCAGAACG TGGACCGGGC 660 
CATGGCCAAG CTGCAGGGGG AGGGGCTGAG TGTGGCGGGC ATTGTGTGCC ACGTGGGGAA 720 
GGCTGAGGAC CGGGAGCAGC TGGTGGCCAA GGCCCTGGAG CACTGTGGGG GCGTCGACTT 780 
CCTGGTGTGC AGCGCAGGGG TCAACCCTCT GGTAGGGAGC ACTCTGGGGA CCAGTGAGCA 840 
GATCTGGGAC AAGATCCTAA GTGTGAACGT GAAGTCCCCA GCCCTGCTGC TGAGCCAGTT 900 
GCTGCCCTAC ATGGAGA ACA GGAGGGGTGC TGTCATCCTG CTCTCTTCCA TTGCAGCTTA 960 
TAATCCAGTA GTGGCGCTGG GTGTCTACAA TGTCAGCAAG ACAGCGCTGC TGGGTCTCAC 1020 
TAGAACACTG GCATTGGAGC TGGCCCCCAA GGACATCCGG GTAAACTGCG TGGTTCCAGG 1080 
AATTATAAAA ACTGACTTCA GCAAAGTGTT TCATGGGAAT G AGTCTCTCT GG AAG AACTT 1 1 40 
CAAGGAACATCATCAGCTGC AGAGGATTGG GGAGTCAGAG GACTGTGCAG GAATCGTGTC 1200 

381 
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CTTCCTGTGC TCTCCAGATG CCAGCTACGT CAACGGGGAG AACATTGCGG TGGCAGGCTA 1260 
CTCCACTCGG CTCTGAGAGG ACTGGGGGCG GCTGCGTAGC TGTGGTCCCA GCCXDAGGAGC 1320 
CTGAGGGGGT GTCTAGGTGA TCATTTGGAT CTGGAGCAGA GTCTGCCATT CTGCCAGACT 1380 

_ AGCAATTTGG GGGCTTACTC ATGCTAGGCT TGAGGAAGAA GAAAAACGCT TCGGCATTCT 1440 

5 CC 

■ Scq ID NO: 76 Protein sequence: 
Protein Accession #: NP_005785 

10 1 II 21 31 41 SI 

I ] I 1 1 ) 

MLSAVARGYQ GWFHPCARLS VRMSSTGIDR KGVLANRVAV VTGSTSGIGF AIARRLARDG 60 
AHVVISSRKQ QNVDRAMAKL QGEGLSVAGI VCHVGKAEDR EQLVAKALEH CGGVDFLVCS 120 
AG VNPLVGST LGTSEQIWDK ILSVNVKSPA LLLSQLLPYM ENRRGAVILV SSIAAYNPW 1 80 
1 5 ALGVYNVSKT ALLGLTRTLA LELAPKDIRV NCVVPGIIKT DFSKVFHGNE SLWKNFKEHH 240 
QLQRIGESED CAGIVSFLCS PDASYVNGEN lAVAGYSTRL 

Seq ID NO: 77 DNA sequence 
20 Nucleic Acid Accession #: NM_002425 

Coding sequence: 26-1453 (underlined sequences correspond to start and stop codons) 

1 II 21 31 41 51 

1 1 I I 1 I 
Zd AAAGAAGGTA AGGGCAGTGA GA ATG ATGCA TCTTGCATTC CTTGTGCTGT TGTGTCTGCC 60 
AGTCTGCTCT GCCTATCCTC TG AGTGGGGC AGCAAAAGAG G AGG ACTCCA ACAAGG ATCT 1 20 
TGCCCAGCAA TACCTAGAAA AGTACTACAA CCTCGAAAAG GATGTGAAAC AGTTTAGAAG 180 
AAAGGACAGT AATCTCATTG TTAAAAAAAT CCAAGQAATG CAGAAGTTCC TTGGGTTGG A 240 
GGTGACAGGG AAGCTAG ACA CTGACACTCT GGAGGTGATG CGCAAGCCCA GGTGTGGAGT 300 
30 TCCTGACGTT GGTCACTTCA GCTCCTTTCC TGGCATGCCG AAGTGGAGGA AAACCCACCT 360 
TACATACAGG ATTGTGAATT ATACACCAGA TTTGCCAAGA GATGCTGTTG ATTCTGCCAT 420 
TGAGAAAGCT CTGAAAGTCT GGGAAG AGGT GACTCCACTC ACATTCTCCA GGCTGTATGA 480 
AGGAGAGGCT GATATAATGA TCTCTTTCGC AGTTAAAGAA CATGGAGACT TTTACTCTTT 540 
TGATGGCCCA GGACACAGTT TGGCTCATGC CTACCCACCT GGACCTGGGC TTTATGGAGA 600 
TATTCACTTT GATG ATGATG AAAAATGGAC AGAAGATGCA TCAGGCACCA ATTTATTCCT 660 
CGTTGCTGCT CATG AACTTG GCCACTCCCT GGGGCTCTTT CACTCAGCCA ACACTGAAGC 720 
TTTG ATGTAC CCACTCTACA ACTCATTCAC AGAGCTCGCC CAGITCCGCC TTTCGCAAGA 780 
TGATGTGAAT GGCATTCAGT CTCTCTACGG ACCTCCCCCT GCCTCTACTG AGGAACCCCT 840 
GGTGCCCACA AAATCTGTTC CTTCGGGATC TG AGATGCCA GCCAAGTGTG ATCCTGCTTT 900 
40 GTCCTTCGAT GCCATCAGCA CTCTGAGGGG AGAATATCTG TTCTTTAAAG ACAGATATTT 960 
TTGGCGAAGA TCCCACTGGA ACCCTGAACC TG AATTTCAT TTG ATTTCTG CATTTTGGCC 1020 
CTCTCTTCCA TCATATTTGG ATGCTGCATA TGAAGTTAAC AGCAGGGACA CCGTTTTTAT 1080 
TTTTAAAGGA AATGAGTTCT GGGCCATCAG AGGAAATGAG GTACAAGCAG GTTATCCAAG 1140 
AGGCATCCAT ACCCTGGGTT TTCCTCCAAC CATAAGGAAA ATTGATGCAG CTGTTTCTGA 1200 
45 CAAGGAAAAG AAGAAAACAT ACTTCTTTGC AGCGGACAAA TACTGGAGAT TTGATGAAAA 1260 
TAGCCAGTCC ATGGAGCAAG GCTTCCCTAG ACTAATAGCT GATGACTTTC CAGGAGTTGA 1320 
GCCTAAQGTT GATGCTGTAT TACAGGCATT TGGATTTTTC TACTTCTTCA GTGGATCATC 1380 
ACAGTTTGAG TTTGACCCCA ATGCCAGGAT GGTGACACAC ATATTAAAGA GTAACAGCTG 1440 
GTTACATTGC TAG GCGAGAT AGGGGGAAGA CAGATATGGG TGTTTTTAAT AAATCTAATA 1500 
ATTATTCATC TAATGTATTA TGAGCCAAAA TGGTTAATTT TTCCTGCATG TTCTGTGACT 1560 
GAAGAAGATG AGCCTTGCAG ATATCTGCAT GTGTCATGAA GAATGTTTCT GGAATTCTTC 1620 
ACTTGCTTTT GAATTGCACT G AACAG AATT A AG AAATACT CATGTGCA AT AGGTG AG AG A 1 680 
ATGTATTTFC ATAGATGTGT TATTACTTCC TCAATAAAAA GTTTTATTTT GGGCCTGTTC 1 740 
CTT 
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Seq ID NO: 78 Protein sequence: 
Protein Accession #: NP 002416 



I U 21 31 41 51 

60 I 1 I 1 1 I 

MHLAFLVLLC LPVCSA YPLS GAAKEEDSNK DLAQQYLEKY YNLEKDVKQF RRKDSNLIVK 60 
KiQGMQKFLG LEVTGKLDTD TLEVMRKPRC GVPDVGHFSS FPGMPK.WRKT HLTYRIVNYT 120 
PDLPRDAVDS AIEKALKVWE EVTPLTFSRL YEGEADIMIS FAVKEHGDFY SFDGPGHSLA 1 80 
HAYPPGPGLY GDIHFDDDEK WTED ASGTNL FLVAAHELGH SLGLFHSANT EALM YPLYNS 240 
65 FTELAQFRLS QDDVNGIQSL YGPPPASTEE PLVPTKSVPS GSEMPAKCDP ALSFDAISTL 300 

RGEYLFFKDR YFWRRSHWNP EPEFHLISAF WPSLPSYLDA AYEVNSRDTV FIFKGNEFWA 360 
IRGNEVQAGY PRGIHTLGFP PTIRKIDAAV SDKEKKKTYF FAADKYWRFD ENSQSMEQGF 420 
PRLIADDFPG VEPKVDAVLQ AFGFFYFFSG SSQFEFDPNA RMVTHILKSN SWLHC 

70 Seq ID NO: 79 DNA sequence 

Nucleic Acid Accession #: NM_000493.] 

Coding sequence: 1 -2043 (underlined sequences correspond to start and stop codons) 

1 il 21 31 41 51 
75 I 1 I I ! I 

ATGCTGCCAC AAATACCCTT TTTGCTGCTA GTATCCTTGA ACTTGGTTCA TGGAGTGTTT 60 
TACGCTGAAC GATACCAAAC GCCCACAGGC ATAAAAGGCC CACTACCCAA CACCAAGACA 120 
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CAGTTCTTCA TTCCCTACAC CATAAAGAGT AAAGGTATAG CAGTA AG AGG AG AGCAAGGT 1 80 
ACTCCTGGTCCACCAGGCCCTGCTGGACCTCXjAGGGCACCCAGGTCCTTCTGGACCACCA 240 
GGAAAACCAG GCTACGGAAG TCCTGGACTC CAAGGAG AGC CAGGGTTGCC AGGACCACCG 300 
GGACCATCAG CTGTAGGGAA ACCAGGTGTG CCAGGACTCC CAGGAAAACC AGGAGAGAGA 360 
5 GGACCATATG GACCAAAAGG AGATGTTGGA CCAGCTGGCC TACCAGGACC CCGGGGCCCA 420 
CCAGGACCAC CTGGAATCCC TGGACCGGCT GGAATTTCTG TGCCAGGAAA ACCTGGACAA 480 
CAGGGACCCA CAGGAGCCCC AGGACCCAGG GGCTTTCCTG GAGAAAAGGG TGCACCAGGA 540 
GTCXCTGGTA TGAATGGACA GAAAGGGGAA ATGGG ATATG GTGCTCCTGG TCGTCCAGGT 600 
GAGAGGGGTC TTCCAGGCCC TCAGGGTCCC ACAGGACCAT CTGGCCCTCC TGGAGTGGGA 660 
1 0 A AAAG AGGTG AAAATGGGGT TCCAGGACAG CCAGGCATCA AAGGTGATAG AGGTTTTCCG 720 
GGAGAAATGG GACCAATTGG CCCACCAGGT CCCCAAGGCC CTCCTGGGGA ACGAGGGCCA 780 
GAAGGCATTG GAAAGCCAGG AGCTGCTGGA GCCCCAGGCC AGCCAGGGAT TCCAGGAACA 840 
AAAGGTCTCC CTGGGGCTCC AGGAATAGCT GGGCCCCCAG GGCCTCCTGG CTTTGGGAAA 900 
CCAGGCTTGC CAGGCCTGAA GGGAGAAAGA GGACCTGCTG GCCTTCCTGG GGGTCCAGGT 960 
15 GCCAAAGGGG AACAAGGGCCAGCAGGTCTTCCTGGGAAGCCAGGTCTGACTGGACCCCCT 1020 
GGGAATATGG GACCCCAAGG ACCAAAAGGC ATCCCGGGTA GCCATGGTCT CCCAGGCCCT 1080 
AAAGGTGAGA CAGGGCCAGC TGGGCCTGCA GGATACCCTG GGGCTAAGGG TGAAAGGGGT 1 140 
TCCCCTGGGT CAGATGGAAA ACCAGGGTAC CCAGGAAAAC CAGGTCTCGA TGGTCCTAAG 1200 
GGTAACCCAG GGTTACCAGG TCCAAAAGGT GATCCTGGAG TTGGAGGACC TCCTGGTCTC 1260 
20 CCAGGCCCTG TGGGCCCAGC AGGAGCAAAG GGAATGCCCG GACACAATGG AGAGGCTGGC 1320 
CCAAGAGGTG CCCCTGGAAT ACCAGGTACT AGAGGCCCT a TTGGGCCACC AGGCATTCCA 1380 
GGATTCCCTG GGTCTAAAGG GGATCCAGGA AGTCCCGGTC CTCCTGGCCC AGCTGGCATA 1440 
GCAACTAAGG GCCTCAATGG ACCCACCGGG CCACCAGGGC CTCCAGGTCC AAGAGGCCCC 1500 
TCTGGAGAGC CTGGTCTTCC AGGGCCCCCT GGGCCTCCAG GCCCACCAGG TCAAGCAGTC 1560 
25 ATGCCTG AGG GTTTTATAAA GGCAGGCCAA AGGCCCAGTC TTTCTGGGAC CCCTCTTGTT 1 620 
AGTGCCAACCAGGGGGTAACAGGAATGCCTGTGTCTGCTTTTACTGTTATTCTCTCCAAA 1680 
GCTTACCCAO CAATAGGAAC TCCCATACCA TTTGATAA AA TrTTGTATAA CAGGC A ACAG 1 740 
CATTATGACC CAAGGACTGG AATCTTTACT TGTCAG ATAC CAGGAATATA CTATTTTTCA 1 800 
TACCACGTGC ATGTGA AAGG GACTCATGTT TGGGTAGGCC TGTATAAG AA TGGCACCCCT 1 860 
30 GTAATGTACA CCTATGATGA ATACACCAAA GGCTACCTGG ATCAGGCTTC AGGGAGTGCC 1920 
ATCATCGATC TCACAGAAAA TGACCAGGTG TGGCTCCAGC TTCCCAATGC CGAGTCAAAT 1980 
GGCCTATACT CCTCTGAGTA TGTCCACTCC TCTTTCTCAG GATTCCTAGT GGCTCCAATG 2040 
TGAGTACACC CCACAGAGCT AATCTAAATC TTGTGCTAGA AAAAGCATTC TC TAAC TCTA 2100 
CCXCACCCTA CAAAATGCAT ATGGAGGTAG GCTGAAAAGA ATCTAATTTT TATTTTCTGA 2160 
35 . AATACAGATTTGAGCTATCAGACCAACAAACCTTCCCCCTGAAAAGTGAGCAGCAACGTA 2220 
AAAACGTATG TG AAGCCTCT CTTGAATTTC TAGTTAGCAA TCTTAAGGCT CTTTAAGGTT 2280 
TTCTCCAATA TTAAAAAATA TCACCAAAGA AGTCCTGCTA TGTTAAAAAC AAACAACAAA 2340 
AAACAAAGCA ACAAAAAAAA AAATTAAAAA AAAAAACAG A AATAG AGCTC TAAGTTATGT 2400 
G AAATTTGAT TTGAGAAACT CGGCATTTCC TTTTTAAAAA AGCCTGTTTC TAACTATGAA 2460 
40 TATGAG AACT TCTAGGAAAC ATCCAGG AGG TATCATATAA CTTTGTAG A A CTTAAATACT 2520 
TGAATATTCA AATTTAAAAG ACACTGTATC CCCTAAAATA TTTCTGATGG TGCACTACTC 2580 
TG AGGCCTGT ATGGCCCCTT TCATCAATAT CTATTCAAAT ATACAGGTGC ATATATACTT 2640 
G1TAAAGCTC TTATATAAAA AAGCCCCAAA ATATTGAAGT TCATC TGAA A TGCAAGGTGC 2700 
TTTCATCAAT G AACCTTTTC AAAACTTTTC TATGATTGCA G AGAAGCTTT TTATATACCC 2760 
45 AGCATAACTT GGAAACAGGT ATCTG ACCTA TTCTTATTTA GTTAACACAA GTGTGATTAA 2820 
TTTGATTTCT TTAATTCCTT ATTGAATCTT ATGTGATATG ATnTCTGGA TTTACAGAAC 2880 
ATTAGCACAT GTACCTTGTG CCTCCCATTC AAGTGAAGTT ATAATTTACA CTGAGGGTTT 2940 
CAAAATTCGA CTAGAAGTGG AGATATATTA TTTATTTATG CACTGTACTG TATT TTTATA 3000 
TTGCTGTTTA AAACTTTTAA GCTGTGCCTC ACTTATTAAA GCACAAAATG TTTTACCTAC 3060 
5 0 TCCTTATTTA CG ACAC AATA A AATAACATC AATAG ATTTT TAGGCTG AAT TAATTTGAA A 3 1 20 
GCAGCAATTT GCTGTTCTCA ACCATTCnT CAAGGCTTTT CATTCQACAC AATAAAATAA 3180 
CATCAATAGA TTTTTAGG 

Seq ID NO: 80 Protein sequence: 
55 Protein Accession NP_000484.I 



I 11 21 31 41 51 

60 MLPQIPFLLL VSLNLVHGVF YAERYQTPTG DCGPLPNTKT QFFIPYTIKS KGIAVRGEQG 60 

TPGPPGPAGP RGHPGPSGPP GKPGYGSPGL QGEPGLPGPP GPSAVGKPGV PGLPGKPGER 120 
GPYGPKGDVG P AGLPGPRGP PGPPGIPGPA GISVPGKPGQ QGPTGAPGPR GFPGEKG APG 1 80 
VPGMNGQKGE MGYGAPGRPG ERGLPGPQGP TGPSGPPGVG KRGENGVPGQ PGIKGDRGFP 240 
GEMGPIGPPG PQGPPGERGP EGIGKPGAAG APGQPGIPGT KGLPGAPGIA GPPGPPGFGK 300 

65 PGLPGLKGER GPAGLPGGPG AKGEQGPAGL PGKPGLTGPP GNMGPQGPKG IPGSHGLPGP 360 
KGETGPAGPAGYPGAKGERGSPGSDGKPGYPGKPGLDGPKGNPGLPGPKGDPGVGGPPGL 420 
PGPVGPAGAK GMPGHNGEAG PRGAPGIPGT RGPIGPPGIP GFPGSKGDPG SPGPPGPAGl 480 
ATKGLNGPTG PPGPPGPRGP SGEPGLPGPP GPPGPPGQAV MPEGFIKAGQ RPSLSGTPLV 540 
SANQGVTGMP VSAFTVILSK AYPAIGTPIP FDKILYNRQQ HYDPRTGIFT CQIPGIYYFS 600 

70 YHVHVKGTHV WVGLYKNGTP VMYTYDEYTK GYLDQASGSA irDLTENDQV WLQLPNAESN 660 
GLYSSEYVHS SFSGFLVAPM 

Seq ID NO: 81 DNA sequence . 
Nucleic Acid Accession #: NMJ0O0786 
75 Coding sequence: 332-1 861 (underlined sequences correspond to start and stop codons) 

I n 21 31 41 51 
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I I 1 1 I i 

CXjCGATTCTC AGGGATTGAT CCGCCTCTTC AGGTAAGTTA TCTTCCGGCC CCGTACCACT 60 
GTGCCACAGG CGCAGCCCGC TTCCTCAGGT GCCCTATCCC GCGCAGAAGA CCACGGCTTC 120 
ACAG AGTGTT ATTTAAGGGC GTGGCCAGCG G A ACATCCCG CCCC ATTCTG TG ACGCACGG 1 80 
D GGTGGCGCGC GTGGGACCCG AGGGGTGGGG CTGGGTTTAG TAGGAGACCT GGGGCAAGGC 240 
CCCCTGTGGA CGACCATCTG CCAGCTTCTC TCGTTCCGTC GATTGGGAGG AGCGGTGGCG 300 
ACCTCGGCCT TCAGTGTTTC CG ACGGAGTG A ATGG CGGCG GCGGCTGGG A TGCTGCTGCT 360 
GGGCTTGCTG CAGGCGGGTG GGTCGGTGCT GGGCCAGGCG ATGGAGAAGG TGACAGGCGG 420 
^ CAACCTCTTG TCCATGCTGC TGATCGCCTG CGCCTTCACC CTCAGCCTGG TCTACCTGAT 480 
1 V CCGTCTGGCC GCCGGCCACC TGGTCCAGCT GCCCGCAGGG GTGAAAAQTC CTCCATACAT . 540 
TTTCTCCCCA ATTCCATTCC TTGGGCATGC CATAGCATTT GGGAAAAGTC CAATTGAATT 600 
TCTAG AAAAT GCATATG AGA AGTATGGACC TGTATTTAGT TTTACCATGG TAGGCAAGAC 660 
ATTTACTTAC CTTCTGGGGA GTGATGCTGC TGCAeTGCTT TTTAATAGTA AAAATGAAGA 720 
CCTGAATGCA GAAGATGTCT ACAG TCGCCT GACAACACCT GTGTTTGGGA AGGGAGTTGC 780 

1 D ATACGATGTG CCTAATCCAG TTTTCTTGGA GCAGAAGAAA ATGTTAAAAA GTGGCCTTAA 840 
CATAGCCCAC TTTAAACAGC ATGTTTCTAT AATTGAAAAA GAAACAAAGG AATACTTTGA 900 
GAGTTGGGGA GAA AGTGG AG AAAAAAATGT GTTTGAAGCT CTTTCTGAGC TCATAATTTT 960 
AACAGCTAGC CATTGTTTGC ATGGAAAGGA AATCAGAAGT CAACTCAATG AAAAGGTAGC 1020 
ACAGCTGTAT GCAGATTTGG ATGGAGGTTT CAGCCATGCA GCCTGGCTCT TACCAGGTTG 1 080 
GCTGCCTTTG CCTAGTTTCA GACGCAGGGA CAGAGCTCAT CGGGAAATCA AGGATATTTT 1 140 
CTATAAGGCA ATCCAGAAAC GCAGACAGTC TCAAGAAAAA ATTGATGACA TTCTCCAAAC 1200 
TTTACTAGAT GCTACATACA AGGATGGGCG TCCTTTGACT GATGATGAAG TAGCAGGGAT 1260 
GCTTATTGGA TTACTCTTGG CAGGGCAGCA TACATCCTCA ACTACTAGTG CTTGGATGGG 1320 
CTTCTTTTTG GCCAGAGACA AAACACTTCA AAAAAAATGT TATTTAGAAC AGAAAACAGT 1380 

ZD CTGTGGAGAG AATCTGCCTC CTTTAACTTA TGACCAGCTC AAGGATCTAA ATTTACTTGA 1440 
TCGCTGTATA AAAGAAACAT TAAGACTTAG ACCTCCTATA ATGATCATGA TGAGAATGGC 1500 
CAGAACTCCT CAGACTGTGG CAGGGTATAC CATTCCTCCA GGACATCAGG TGTGTGTTTC 1560 
TCCCACTGTC AATCAAAGAC TTAAAGACTC ATGGGTAGAA CGCXTrGGACT TrAATCX:TGA 1620 
TCGCTACTTA CAGGATAACC CAGCATCAGG GGAAAAGTTT GCCTATGTGC CATTTGGAGC 1 680 

J U TGGGCGTCAT CGTTGTATTG GGGAAAATTT TGCCTATGTT CAAATTAAG A CAATTTGGTC 1 740 
CACTATGCTT CGTTTATATG AATTTGATCT CATTGATGGA TACTTTCCCA CTGTGAATTA 1800 
TACAACTATG ATTCACACCC CTGAGAACCC AGTTATCCGT TACAAACGAA GATCAAAATG 1860 
^AAAAGGTTG CAAGGAACGA ATATATGTGA TTATCACTGT AAGCCACAAA GGCATTCGAA 1920 
G AG A ATG A AG TGTAC AAAAC AACTCTTGTA GTTTACTGTT TTTTTAAGTG TGTA ATTCTA 1 980 

J D A AAGCCAGTT TATG ATTTAG G ATTTTGTTA ACTG AATQGT TCTATCAAAT ATAATAGCAT 2040 
TTGACACATT TTCTAATAGT TATGATACTT ATACATGTGC TTTCAGGAAG TTCCTTGGTG 2100 
AAACAATTGT TGAGGGGGGA TCTAGGTAAT TGGCAGATTC TAAATAATAT AATTTCCAGA 2160 
TAGTAATTTT AAGAGTACTC ATCGCTCTTG CCAAATAAGT TCAGGGTATT CAAATCTTGG 2220 
ACTAGTCCTG CAAGGTATAA AG AATAAAAA TCCCAGTGAG ATAGTTGGAA ACCACAGTTT 2280 
ATTATTATTT ATCTGGGCAA TTATTGTGTG TGTGAGGATG GAAGGGTAGG GAATAATCGA 2340 
ACATCTAAAQ CCTTGAATAA GAGAATACTA ATTGTTTTGG TATGATGATA CTCAGAAATG 2400 
GAGATATTAT AGGAAAAAGA AATCCTTTGG AATTTTAACT AAAATCACTG CATATGGGAA 2460 
ATTAAGAGAT CCAGGACCAT ATTTGATAAG AGTTCCTAAA AATAATGTAA TT ATTAATGC 2520 
TAAAG ACTGC TCATGTATCT TGATCTAATT ACTAAATAAA TTACATATTT ATTTACCTGA 2580 
TAAATATGTA TCTAGTTCTA CAAGGTCACA TTTATGTGGA AGTCCAAAGT CAAGTCCTTA 2640 
GGGGATAATT TTGTTITGGG CTCAGTTGTT CCCTGCTTCC T fi nTlTiT TTTTTTTTTT 2700 
TTG AGATGGA GTCTCGCTCT GTTGCCCAGG CTGGAGTGCA GTGGTGCX3AT CTCAGCTCAC 2760 
TGCATCCTCT GCCTCCCGGG TTCAAGCAAT TCTCTGCCTC AGCXTCCCAA GTAGTTGGGA 2820 
TTACAGGCAC CTGCCACCAT GCCTGGCTAA TTTTTTGTAT TTTTAGTAGA GACGGGGGTT 2880 

D U TCACTATGTT GGCTAGGCTG GTCTTG AACT CCTG AGCCTC GTG AGTCCAC CCGCCTTGGC 2940 
CTCCCAAAGT GCTGGGATTA CAGGCATGAG CCACCGCACC TGGCCTTCCC TGCTTCCTCT 3000 
CTAGAATCCA ATTAGGGATG TTTGTTACTA CTCATATTGA TTAAAACAGT TAACAAACTT 3060 
TTTTCTTTTT AAAATGTGAG ATCAGTGAAC TCTGGTTTTA AGATAATCTG AAACAAGGTC 3120 
^ CTTGGGAGTA ATAAA ATTGG TCACATTCTG TAAAGCACAT TCTGTTTAGG AATCAACTTA 3 1 80 
'TCTCAAATTG TAACTCGGGG CCT AACTATA TG AG ATGGCT GAAAAAATAC CACATCGTCT 3240 
O-mTCACTA GGTGATGCCA AAATATTTTG CTTTATGTAT ATTACAGTTC TTTTTAAAAC 3300 
ACTGGAAGAC TCATGTTAAA CTCTAATTGT GAAGGCAOAA TCTCTGCTAA 1 i l l IC AGAT 3360 
TAAAATTCTC TTTG AAAAAA T 

60 Scq ID NO: 82 Protein seauencet 

Protein Accession #: NP_000777 

1 11 21 31 41 51 
<^ I I I I I I 

OD MAAAAGMLLL GLLQAGGSVL GQAMEKVTGG NLLSMLLIAC AFTLSLVYLI RLAAGHLVQL 60 
PAGVKSPPYI FSPIPFLGHA lAFGKSPIEF LENAYEKYGP VPSFTMYGiCT FTYLLGSDAA 120 
ALLFNSKNED LN AED VYSRL TTPVFGKG VA YDVPNPVFLE QKKMLKSGLN lAHFKQHVSI 1 80 
lEKETKEYFE SWGESGEKNV FEALSELIIL TASHCLHGKE IRSQLNEKVA QLYADLDGGF 240 
SHAAWLLPGW LPLPSFRRRD RAHREIKDIF YKAIQKRRQS QEKIDDILQT LLDATYKDGR 300 

/ U PLTDDEVAGM LIGLLLAGQH TSSTTSAWMG FFLARDKTLQ KKCYLEQKTV CGENLPPLTY 360 
DQLKDLNLLD RCDCETLRLR PPIMIMMRMA RTPQTVAGYT IPPGHQVCVS PTVNQRLKDS 420 
WVERLDFNPD RYLQDNPASG EKFAYVPFGA GRHRCIGENF AYVQIKTIWS TMLRLYEFDL 480 
IDGYFPTVNY TTMIHTPENP VIRYKRRSK 

75 Seq ID NO; 83 DNA sequence 

Nucleic Acid Accession #: NM_00655 1 .2 

Coding sequence: 64-336 (underlined sequences correspond to start and stop codons) 
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1 11 21 31 41 51 

) 1 I I I I 

A ATTCTAGAA GTCCAAATCA CTCATTGTTT GTGAAAGCTG AGCTCACAGC AAAACAAGCC 60 
5 AC CATGA AGC TGTCGGTGTG TCTCCTGCTG GTCACGCTGG CCCTCTGCTG CTACCAGGCC 120 
AATGCCGAGT TCTGCCCAGC TCTTGTTTCT GAGCTGTTAG ACTTCTTCTT CATTAGTGAA 180 
CCTCTGTTCA AGTTAAGTCT TGCCAAATTT G ATGCCCCTC CGGAAGCTGT TGCAGCCAAG 240 
TTAGGAGTGA AG AGATGCAC GGATCAGATG TCCCTTCAGA AACGAAGCCT CATTGCGGAA 300 
GTCCTGGTGA AAATATTGAA GAAATGTAGT GTGTGACATG TAAAAACTTT CATCCTGGTT 360 
1 0 TCCACTGTCT TTCAATGACA CCCTG ATCTT CACTGCAGAA TGTAAAGGTT TCAACGTCTT 420 
GCTTTAATAA ATCACTTGCT CTAC 

Seq ID NO: 84 Protein sequence: 
Protein Accession NP 006542.1 

15 " 

1 n 21 31 41 SI 

\ \ \ \ \ \ 

MKXSVCLLLV TLALCCYQAN AEFCPALVSE LLDFFFISEP LFKLSLAKFD APPEAVAAKL 60 
GVKRCTDQMS LQKRSLIAEV LVKILKKCSV 

Seq ID NO: 85 DNA sequence 

Nucleic Acid Accession #: NM_0023 1 7, 1 

Coding sequence: 231-1484 (underlined sequences correspond to start and stop codons) 
25 1 U 21 31 41 51 

GGGCCAGGAC TGAGAAAGGG GAAAGGGA AG GGTGCCACGT CCGAGCAGCC GCCTTGACTG 60 
GGGAAGGGTC TGA ATCCCAC CCTTGGCATT GCTTGGTGG A G ACTGAG ATA CCCGTGCTCC 1 20 
GCTCGCCTCC TTGGTTG AAG ATTTCTCCTT CCCTCACGTG ATTTG AGCCC CGTTTTT ATT 1 80 
30 TTCTGTGAGC CACGTCCTCC TCGAGCGGGG TCAATCTGGC AAAAGGAGTG ATG CGCTTCG 240 
CCTGGACCGT GCTCCTGCTC GGGCCTTTGC AGCTCTGCGC GCTAGTGCAC TGCGCCCCTC 300 
CCGCCGCCGG CCAACAGCAQ CCCXICGCGCG AGCCGCCGGC GGCTCCGGGC GCXTTGGCGCC 360 
AGCAGATCCA ATGGGAGAAC AACGGGCAGG TGTTCAGCTT GCTGAGCCTG GGCTCACAGT 420 
ACCAGCCTCA GCGCCGCCGG GACCCGGGCG CCGCCGTCCC TGGTGCAGCC AACGCCTCCG 480 
35 CCGAGCAGCC CCGCACTCCG ATCCTGCTGA TCCGCGACAA CCGCACCGCC GCGGGGCGAA 540 
CGCGGACGGC CGGCTCATCT GGAGTCACCG CTGGCCGCCC CAGGCCCACC GCCCGTCACT 600 
GGTTCCAAGC TGGCTACTCG ACATCTAGAG CCCGCG AAGC TGGGCCCTCG CGCGCGGAGA 660 
ACCAGACAGC GCCGGGAGAA GTTCCTGCTC TCAGTAACCT GCGGCCGCCC AGCCGCGTGG 720 
ACGGCATGGT GGGCGACG AC CCTTACAACC CCTACAAGTA CTCTGACGAC AACCCTTATT 780 
40 ACAACTACTA CGATACTTAT GAAAGGCCCA GACCTGGGGG CAGGTACCGG CCCGGATACG 840 
GCACTGGCTACTTCCAGTACGGTCTCCCAG ACCTGGTGGCCGACCCCTACTACATCCAGG 900 
CGTCCACGTA CGTGCAGAAG ATGTCCATGT ACAACCTGAG ATGCGCGGCG GAGGAAAACT 960 
GTCTGGCCAG TACAGCATAC AGGGCAGATG TCAGAGATTA TGATCACAGG GTGCTGCTCA 1020 
GATTTCCCCA AAGAGTGAAA AACCAAGGGA CATCAGATTT CTTACCCAGC CGACCAAGAT 1080 
45 ATTCCTGGGA ATGGCACAGT TGTCATCAAC ATTACCACAG TATGGATGAG TTTAGCCACT 1 140 
TGTACCTGCT TGATGCCAAC ACCCAGAGGA GATGGGCTGA AGGCCACAAA GCAAGTTTCT 1200 
GTCTTGAAGA CACATCCTGT GACTATGGCT ACCACAGGCG ATTTGCATGT ACTGCACACA 1260 
CACAGGGATT GAGTCCTGGC TGTTATGATA CCTATGGTGC AGACATAGAC TGCCAGTGGA 1320 
TTGATATTAC AGATGTAAAA CCTGGAAACT ATATCCTAAA GGTCAGTGTA AACCCCAGCT 1380 
50 ACCTGGTTCC TGAATCTGAC TATACCAACA ATGTTGTGCG CTGTGACATT CGCTACACAG 1440 
GACATCATGC GTATGCCTCA GGCTGCACAA TTTCACCGTA TIAgAAGGCA AAGCAAAACT 1500 
CCCAATGGAT AAATCAGTGC CTGGTGTTCT GAAGTGGGAA AAAATAGACT AACTTCAGTA 1560 
GG ATTTATGT ATTTTGAAAA AGAGAACAGA AAACAACAAA AGAATTTTTG TTTGGACTGT 1620 
TTTCAATAAC AAAGCACATA ACTGGATTTT GAACGCTTAA GTCATCATTA CTTGGGAAAT 1680 
5 5 TTTTAATGTT TATT ATTTAC ATCACTTTGT G AATTAAC AC AGTGTTTC A A TTCTGTAATT 1 740 
ACATATTTGA CTCTTTCAAA AAAAAAAAAA AAAAAAAAAA 

Seq ID NO: 86 Piotein sequence: 
Protein Accession #: NPj002308. 1 

I 11 21 31 41 51 

MRFAWTVLLL GPLQLCALVH CAPPAAGQQQ PPREPPAAPG AWRQQIQWEN NGQVFSLLSL 60 
GSQYQPQRRR DPGAAVPGAA NASAQQPRTP ILLIRDNRTA AGRTRTAGSS GVTAGRPRPT 120 
ARHWFQAGYS TSRAREAGPS RAENQTAPGE VPALSNLRPP SRVDGMVGDD PYNPYKYSDD 180 
NPYYNYYDTY ERPRPGGRYR PGYGTGYFQY GLPDLVADPY YIQASTYVQK MSMYNLRCAA 240 
EENCLASTAY RADVRDYDHR VLLRFPQRVK NQGTSDFLPS RPRYSWEWHS CHQHYHSMDE 300 
FSHLYLLDAN TQRRWAEGHK ASFCLEDTSC DYGYHRRFAC TAHTQGLSPG CYDTYGADID 360 
CQWIDITDVK PGNYIUCVSV NPSYLVPESD YTNNVVRCDI RYTGHHAYAS GCTISPY 

Seq ID NO: 87 DNA sequence 
Nucleic Acid Accession #: NM J0O64 19.1 

Coding sequence: 91-420 (underlined sequences correspond to start and stop codons) ' 
75 I 11 21 31 41 51 

TTCGGCACTT GGGAGAAGAT GTTTGAAAAA ACTGACTCTG CTAATGAGCC TGGACTCAGA 60 
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GCTCAAGTCT G AACTCTACC TCCAG ACAG A ATG AAGTTCA TCTCG ACATC TCTGCTTCTC 1 20 
ATGCTGCTGG TCAGCAGCCT CTCTCCAGTC CAAGGTGTTC TGG AGGTCTA TTACACAAGC 1 80 
TTG AGGTGTA GATGTGTCCA AGAGAGCTCA GTCTTTATCC CTAG ACGCTT CATTGATCGA 240 
ATTCAAATCT TGCCXTCGTGG G AATGGTTGT CCAAGAAAAG AAATCATAGT CTGGAAGAAQ 300 

D AACAAGTCAA TTGTGTGTGT GGACCCTCAA GCTGAATGGA TACAAAGAAT GATGGAAGTA 360 
TTG AG AAA AA GAAGTTCTTC AACTCTACCA GTTCCAGTGT TTAAGAGAAA GATTCC CTGA 420 
TGCTGATATT TCCACTAAG A ACACCTGCAT TCTTCCCTTA TCCCTGCTCT GGATTTTAGT 480 
TTTGTGCTTA GTTAAATCTT TTCCAGGG AG AAAGAACTTC CCCATACAAA TAAGGCATGA 540 
GGACTATGTG AAAAATAACC TTGCAGGAGC TGATGGGGCA AACTCAAGCT TCTTCACTCA 600 
1 K) CAGCACCCTA TATACACTTG G AGTTTGCAT TCTTATTCAT CAGGGAGGAA AGTTTCTTTG 660 
A AAATAGTTA TTCAGTTATA AGTAATACAG GATTATTTTG ATTATATACT TGTrGTTTAA 720 
TGTTTAAAAT TTCTTAGAAA ACAATGGA AT G AGAATTTAA GCCTCAAATT TGAACATGTG 780 
GCTTGAATTA AGAAGAAAAT TATGGCATAT ATTAAAAGCA GGCTTCTATG AAAGACTCAA 840 
AAAGCTGCCT GOG AGGCAGA TGGAACTTGA GCCTGTCAAG AGGCAAAGGA ATCCATGTAG 900 

^ T' AGATA TCCT CTGCTTAAAA ACTCACTACG GAGGAGAATT AAGTCCTACT TTTAAAGAAT 960 
TTCTTTATAA AATTTACTGT CTAAGATTAA TAGCATTCGA AGATCCCCAG ACTTCATAGA 1020 
ATACTCAGGG AAAGCATTTA AAGGGTGATG TACACATGTA TCCTTTCACA CATTTGCCTT 1080 
G ACAAACTTC TTTCACTCAC AT CUl IT CA CTGACTTTTT TTGTGGGGGC GGGGCX:GGGG 1 140 
GGACTCTGGTATCTAATTCTTTAATGATTCCTATAAATCTAATGAeATTCAATAAAGTTG 1200 
ZU AGCAAACATT TTACTT 

Seq ID NO: 88 Protein sequence: 
Protein Accession NP_0064 10. 1 

25 I n 21 31 41 51 

1 I I I I I 

MKFISTSLLL MLLVSSLSPV QGVLEVYYTS LRCRCVQESS VFIPRRFIDR IQILPRGNGC 60 
PRKEIIVWiCK NKSIVCVDPQ AEWIQRMMEV LRKRSSSTLP VPVFKRKIP 

3 0 Scq ID NO: 89 DNA sequence 

Nucleic Acid Accession #: NM_002652 

Coding sequence: 37-477 (underlined sequences correspond to start and stop codons) 

1 • II 21 31 41-51 

35 I 1 I I j I 

CTTCTCTGGG ACACATTGCC TTCTGTTTTC TCCAG CATGC GCTTGCTCCA GCTCXTTOTTC 60 
AGGGCCAGCC CTGCC ACCCT GCTCCTGGTT CTCTGCCTGC AGTTGGGGGC CAAC AA AGCT 1 20 
CAGGACAACA CTCGG AAGAT CATAATAAAG AATTTTGACA TTCCCAAGTC AGTACGTCCA 1 80 
AATG ACG AAG TCACTGCAGT GCTTGCAGTT CAAAC AGAAT TGAAAG AATG CATGGTGGTT 240 

H-U AAAACTTACC TCATTAGCAG CATCCCTCTA CAAGGTGCAT TTAACTATAA GTATACTGCC 300 
TGCCTATGTG ACGACAATCC AAAAACCTTC TACTGGGACT TTTACACCAA CAGAACTGTG 360 
CAAATTGCAG CCGTCGTTGA TGTTATTCGG GAATTAGGCA TCTGCCCTGA TGATGCTGCT 420 
GTAATCCCCA TCAAAAACAA CCGGTTTTAT ACTATTG AAA TCCTAAAGGT AGA ATAAT GG 480 
AAGCCCTGTC TGTTTGCCAC ACCCAGGTGA TTTCCTCTAA AGAAACTTGG CTGGAATTTC 540 
TGCTCTGGTC TATAAAATAA ACTTCTTAAC ATGCTT 



50 



55 



60 



Seq ID NO: 90 Protein sequence: 
Protein Accession #: NP_002643. 1 

1 II 21 31 41 51 

1 i I I I I 

MRLLQLLFRA SPATLLLVLC LQLGANKAQD NTRKIIDCNF DIPKSVRPND EVTAVLAVQT 60 
ELKECM VVKT YLISSIPLQG AFNYKYTACL CDDNPKTFYW DFYTNRTVQI AAWDVIREL 120 
GICPDDAAVI PIKNNRFYTI EIUCV 

Seq ID NO: 91 DNA sequence 

Nucleic Acid Accession #: AK000341 

Coding sequence: 85-975 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 



G ATAGCGCCG GGCAG AGGG A CCCGGCTACC CTGGACAGCG CATCGCCGCC CGCCCGGGTC 60 
G CCGCGCC AC AGCCGCTGCG GATCATGGAA CATCTAAAGG CCTTTGATGA TGAAATCAAT 120 
GCTTTTTTGG ACA ATATGTT TGGACCGCGA G ATTCTCGAG TCAG AGGGTG GTTCACGTTG 1 80 

O J GACTCTTACC TTCCTACCTT TTTTCTTACT GTCATGTATC TGCTCTCAAT ATGGCTGGGT 240 

AACAAGTATA TGAAGAACAG ACCTGCTCTT TCTCTCAGGG GTATCCTCAC CTTGTATAAT 300 
CTTGGAATCA CACTTCTCTC CGCGTACATG CTGGCAGAGC TCATTCTCTC CACTTGGGAA 360 
GGAGGCTACA ACTTACAGTG TCAAGATCTT ACCAGCGCAG GGGAAGCTGA CATCCGGGTA 420 
GCCAAGGTGC TTTGGTGGTA CTATTTCTCC AAATCAGTAG AGTTCCTGGA CACAATTTTC 480 

/ U TTCGTTtTGC GGAAAAAA AC G AGTCAG ATT ACl 1 1 ICl I C ATGTATATCA TCATGCTTCT 540 
ATGTTTAACA TCTGGTGGTG TGTCTTGAAC TGGATACCTT GTGGACAAAG TTTCTTTGGA 600 
CCAACACTG A ACAGTTTTGT CCACATTCTT ATGTACTCCT ACTATGGACT TrCTGTGTTT 660 
CCATCTATGC ACAAGTATCT TTGGTGGAAG AAATATCTCA CACAGGCTCA GCTGQTGCAG 720 
TTCGTGCrrCA CCATCACGCA CACCATGAGC GCCGTCGTGA AACCGTGTGG CTTCCCCTTC 780 

75 GGrrGTCTCATCITCCAGTC ATCTTATATGCTAACGTTAGTCATCCTCTTCTTAAATTTT 840 

TATGTTCAGA CATACCGAAA AAAGCCAATG AAGAAAGATA TGCAAGAGCC ACCTGCAGGG 900 
AAAGAAGTGA AGAATGGTTT TTCCAAAGCC TACTTCACTG CAGCAAATGG AGTG ATGAAC 960 
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AAGAAAGCAC AA TAA AAATG AGTAACAGAA AAAGCACATA TACTAGCCTA ACAGATTGGC 1020 
rrGTTTTAAA GCAAAGACTG AATTG AAGGT TACATGTTTT AGGATAAACT AATTTCTTTT 1080 
GAGTTCATAA ATCATTTGTA CCCAGAATGT ATTAATATAT TGCTATTAGG TTAATCTGTT 1 140 
AACTGAATGC T7TGATCAGC ATTGAGGTGA TGCTCACCTC CGAGGACCTC AGAACTGGTG 1200 
5 CAGCITCTCT CTCCCTCCCT CCCACAG ACT GAACCTTTCG CCAG AAGCTG TCCTTATAAC 1 260 
GCCTTATACG CATACACAGC CAGGAAACGT GGAGCATTGT TTCTCACAGA GAGTCTCCAA 1320 
ATAAAAAGGG TTTTGTTCAG ATTAAAATGT TTACAACAAA ATGTTAATTA TATTCTAAAT 1380 
ACAGGGTATG TTCTAATCTA TATTAAGCAA TAATGCCAGT GCATAATCAT TCCATTTGTT 1440 
CCTTTAGCAA TCAACCCCAG AAAATATTAA AATGGGATCA TACACAGAAG ATAGAAAAAT 1500 

1 0 CTAGCAAAAC TTCTCTTTCT GTA AGCCAG A GTCTTGTCTA TCAG ATTCCC ACAACCACTC 1 560 
CTGATTCTAA ATTTAGTGAT ATGGTAATGA AATTGGTATT TATTTTAAAT ATTAGTTATT 1620 
CTAAGGAGAA AAAAATGCTT CTGCAAGATT TTCATAATTC AGGGGCTGTG GATAGGATTG 1680 
TTCCTCTGTT TCCXTTAATCA TTCATCTGTT CATGTCTCCC TCTTGTGCCA GTC AGCCTAG 1 740 
GTTATACAG A TGCCATGCTC CACACCACGA GCAGTGTACA AATCTGGCTG CCCGTTTACT 1 800 

1 5 TTCTGAGC AA GCACTGGAGT CC ACTCCG AC CTTTTTCTTT G AACATGC AT G CTGCTGG AA 1 860 

TATGTATAAA TCAGAACTAG CAGAAGTAGC AGAGTGATGG GAGCAAAATA GGCACTGAAT 1920 
TCGTCAACTC TTTTTTGTQA GCCTACTTGT GAATATTACC TCAGATACCT GTTGTCACTC 1980 
TTCACAGGTT ATTTAAGTTC TTGAAGCTGG GAGGAAAAAG ATGGAGTAGC TTGGAAAGAT 2040 
TCCAGCACTG AGCCGTGAGC CGGTCATGAG CCACGATAAA AAATGCCAGT TTGGCAAACT 2100 

20 CAGCACTCCT GTTCCCTGCT CAGGTATATG CGATCTCTAC TG AG AAGC AA GCACAA AAGT 21 60 
AGACCAAAGT ATTAATGAGT ATTTCCTTTC TCCATAAGTG CAGGACTGTT ACTCACTACT 2220 
AAACTCTACC AAGAATGGAA ACCAAGAATA TTTTCTGAAG ATTTTTTTGA AGATTAATTT 2280 
ATACCCTATA AAATAAAACTTGTTAGCTTC GATGAAGTCA AAAAAAAAAA AAAAAAAAAA 

25 Seq ID NO: 92 Prptein sequence: 

Protein Accession #: B AA9 1 096. 1 

1 11 21 31 41 51 

30 MEHUCAFDDE INAFLDNMFG PRDSRVRGWF TLDSYLPTFF LTVMYLLSrW LGbnCYMK^ 60 
ALSLRGILTL YNLGITLLSA YMLAELILST WEGGYNLQCQ DLTSAGEADI RVAKVLWWYY 120 
FSKSVEFLDT IFFVLRKKTS QITFLHVYHH ASMFNIWWCV LNWIPCGQSF FGPTLNSFVH 180 
ILMYSYYGLSVFPSMHKYLWWKKYLTQAQLVQFVLTITHTMSAWKPCGFPFGCLIFQSS 240 
YMLTLVILFL NFYVQTYRKK PMKKDMQEPP AGKEVKNGFS KAYFTAANGV MNKKAQ 

35 

Seq ID NO: 93 DNA sequence 

Nucleic Acid Accession #: NM_000044 

Coding sequence: 1 1 1 5-3874 (underlined sequences correspond to start and stop codons) 

40 1 11 21 31 41 51 

I I I I 1 I 

CGAGATCCCG GGGAGCCAGC TTGCTGGGAG AGCGGGACGG TCCGGAGCAA GCCCACAGGC 60 
AGAGGAGGCG ACAGAGGGAA AAAGGGCCGA GCTAGCCGCT CCAGTGCTGT ACAGGAGCCG 120 
AAGGGACGCA CCACGCCAGC CCCAGCCCGG CTCCAGCGAC AGCCAACGCC TCTTGCAGCG 180 

45 CGGCGGCTtCGAAGCCXjCCGCCCGGAGCTGCCCTTTCCTCTTCGGTGAAGTTTTTAAAAG 240 
CTGCTAAAGA CTCGGAGGAA GCAAGGAAAG TGCCTGGTAG GACTGACGGC TGCCTTTGTC 300 
CTCCTCCTCT CCACCCCGCC TCCCCCCACC CTGCCTTCCC CCCCTCCCCC GTCTTCTCTC 360 
CCGCAGCTGC CTCAGTCGGC TACTCTCAGC CAACCCCCCT CACCACCCTT CTCCCCACCC 420 
GCCCCCCCGC CCCCGTCGGC CCAGCGCTGC CAGCCCGAGT TTGCAGAGAG GTAACTCCCT 480 

50 TTGGCTGCGA GCGGGCGAGC TAGCTGCACA TTGCAAAGAA GGCTCTTAGG AGCCAGGCGA 540 
CTGGGGAGCG GCTTCAGCAC TGCAGCCACG ACCCGCCTGG TTAGAATTCC GGCGGAGAGA 600 
ACCCTCTGTTTTCCCCCACTCTCTCTCCACCTCCTCCTGCCTTCCCCACCCCGAGTGCGG 660 
AGCAGAGATC AAAAGATGAA AAGGCAGTCA GGTCTTCAGT AGCCAAAAAA CAAAACAAAC 720 
AAAAACAAAA AAGCCGAAAT AAAAGAAAAA GATAATAACT CAGTTCTTAT TTGCACCTAC 780 

5 5 TTCAGTGG AC ACTG A ATTTG G AAGGTGG AG G ATTTTGnT TTTTCTTTTA AG ATCTGGGC 840 
ATCTTTTGAA TCTACCCTTC AAGTATTAAG AGACAGACTG TGAGCCTAGC AGGGCAGATC 900 
TTGTCCACCG TGTGTCTTCT TCTGCACGAG ACTTTGAGGC TGTCAGAGCG CTTTTTGCGT 960 
GGTTGCTCCC GCAAGTTTCC TTCTCTGGAG CTTCCCGCAG GTGGGCAGCT AGCTGCAGCG 1020 
_ _^ ACTACCGCAT CATCACAGCC TGTTGAACTC TTCTGAGCAA GAGAAGGGGA GGCGGGGTAA 1080 

60 GGG AAGTAGG TGGAAG ATTC AGCC AAGCTC AAGGATGGAA GTGC AGTTAG GGCTGGG AAG 1 1 40 
GGTCTACCCT CGGCCGCCGT CCAAGACCTA CCGAGGAGCT TTCCAGAATC TGTTCCAGAG 1200 
CGTGCGCGAA GTGATCCAGA ACCCGGGCCC CAGGCACCCA GAGGCCGCGA GCGCAGCACC 1260 
TCCCGGCGCC AGTTrGCTGC TGCTGCAGCA GCAGCAGCAG CAGCAGCAGC AGCAGCAGCA 1320 

^ GCAGCAGCAG CAGCAGCAGC AGCAGCAAG A GACTAGCCCC AGGCAGCAGC AGCAGCAGCA 1 380 

65 GGGTG AGG AT GGTTCTCCCC AAGCCCATCG TAG AGGCCCC ACAGGCTACC TGGTCCTGGA 1 440 
TGAGGAACAG CAACCTTCAC AGCCGCAGTC GGCCCTGGAG TGCCACCCCG AGAGAGGTTG 1500 
CGTCCCAGAG CCTGGAGCCG CCGTGGCCGC CAGCAAGGGG CTGCCGCAGC AGCTGCCAGC 1560 
ACCTCCGG AG GAGG ATG ACT CAGCTGCCCC ATCCACGTTG TCCCTGCTGG GCCCCACTTT 1 620 
CCCCGGCTTA AGCAGCTGCT CCGCTGACCT TAAAGACATC CTGAGCGAGG CCAGCACCAT 1680 

70 GCAACTCCTT CAGCAACAGC AGCAGGAAGC AGTATCCGAA GGCAGCAGCA GCGGGAGAGC 1 740 
GAGGGAGGCC f CGGGGGCTC CCACTTCCTC CAAGGACAAT TACTTAGGGG GCACTTCG AC 1 800 
CATTTCTGAC AACGCCAAGG AGTTGTGTAA GGCAGTGTCO GTGTCCATGG GCCTGGGTGT I860 
GGAGGCGTTG GAGCATCTGA GTCCAGGGGA ACAGCTTCOG GGGGATTGCA TGTACGCCCC 1920 
ACnrrGGGA GTTCCACCCG CTGTGCGTCC CACTCCTTG r GCCCCATTGG CCGAATGCAA 1980 

75 AGGTTCTCTG CTAGACG ACA GCGCAGGCAA GAGCACTGAA GATACTGCTG AGTATTCCCC 2040 
TTTCAAGGGA GGTTACACCA AAGGGCTAGA AGGCGAGAGC CTAGGCTGCT CTGGCAGCGC 2100 
TGCAGCAGGG AGCTCCGGGA CACTTGAACT GCCGTCTACC CTGTCTCTCT ACAAGTCCGG 2160 
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AGCACTGG AC GAGGCAGCTG CGTACCAGAG TCGCGACTAC TACAACTTTC CACTGGCTCT 2220 
GGCCGGACCG CCGCCCCCTC CGCCGCXTTCC CCATCCCCAC GCTCGCATCA AGCTGGAGAA 2280 
CCCGCTGGAC TACGGCAGCG CCTGGGCGGC TGCGGCGGCG CAGTGCCGCT ATGGGGACCT 2340 
GGCGAGCCTG CATGGCGCGG GTGCAGCGGG ACCCGGTTCT GGGTCACCCT CAGCCGCCGC 2400 
TTCCTCATCC TCGCACACTC TCTTCACAGC CG AAGAAGGC CAGTTGTATG GACCGTGTGG 2460 
TGGTGGTGGG GGTGGTGGCG GCGGCGGCGG CGGCGGCGGC GGCGGCGGCG GCGGCGGCGG 2520 
CGGCGGCGGC GAGGCGGGAG CTGTAGCCCC CTACGGCTAC ACTCGGCCCC CTCAGGGGCT 2580 
GGCGGGCCAG GAAAGCGACT TCACCGCACC TGATGTGTGG TACCCTGGCG GCATGGTGAG 2640 
CAGAGTGCCC TATCCCAGTC CCACTTGTGT CAAAAGCGAA ATGGGCCCCT GGATGGATAG 2700 
CTACTCCGGA CCTTACGGGG ACATGCGTTT GGAGACTGCC AGGGACCATG TTTTGCCCAT 2760 
TGACTATTAC TTTCXACCCC AGAAGACXTTG CCTGATCTGT GGAOATGAAG CTTCTGGGTG 2820 
TCACTATGGA GCTCTCACAT GTGGAAGCTG CAAGGTCTTC TTCAAAAGAG CCGCTGAAGG 2880 
GAAACAGAAG TACCTGTGCG CCAGCAGAAA TG ATTGCACT ATTGATAAAT TCCGAAGGAA 2940 
AAATTGTCCA TCTTGTCGTC TTCGGAA ATG TTATGAAGCA GGGATGACTC TGGGAGCCCG 3000 
GAAGCTGAAG AAACTTGGTA ATCTGAAACT ACAGGAGGAA GGAGAGGCTT CCAGCACCAC 3060 
CAGCCCCACT GAGGAGACAA CCCAGAAGCT GACAGTGTCA CACATTGAAG GCTATGAATG 3 120 
TCAGCCCATC TTTCTG AATG TCCTGG A AGC C ATTGAGCCA GGTGTAGTGT GTGCTGG ACA 3 1 80 
CGACAACAAC CAGCCCGACT CCTTTGCAGC CTTGCTCTCT AGCCTCAATG AACTGGGAGA 3240 
GAGACAGCTT GTACACGTGG TCAAGTGGGC CAAGGCCTTG CCTGGCTTCC GCAACTTACA 3300 
CGTGGACGAC CAGATGGCTG TCATTCAGTA CTCCTGGATG GGGCTCATGG TGTTTGCCAT 3360 
GGGCTGGCGA TCCTTCACCA ATGTCAACTC CAGGATGCTC TACTTCGCCC CTGATCTGGT 3420 
TTTCAATG AG TACCGCATGC ACAAGTCCCG GATGTACAGC CAGTGTGTCC GAATGAGGCA 3480 
CCTCrCTCAA GAGTTTGGAT GGCTCCAAAT CACCCCCCAG GAATTCCTGT GCATGAAAGC 3540 
ACTGCTACTC TTCAGCATTA TTCCAGTGGA TGGGCTGAAA AATCAAAAAT TCTTTGATGA 3600 
ACTTCGAATG AACTACATCA AGGAACTCGA TCGTATCATT GCATGCAAAA GAAAAAATCC 3660 
CACATCCTGC TCAAGACGCT TCTACCAGCT CACCAAGCTC CTGGACTCCG TGCAGCCTAT 3720 
TGC GAGA GAG CTGCATCAGT TCACTTTTGA CCTGCTAATC AAGTCACACA TGGTGAGCGT 3780 
GGACTTTCCG GAAATGATGG CAGAGATCAT CTCTGTGCAA GTGCCCAAGA TCCTTTCTGG 3840 
GAAAGTCAAG CCCATCTATT TCCACACCCA G TGAA GCATT GGAAACCCTA TTTCCCCACC 3900 
CCAGCTCATG CCCCCTTTCA GATGTCTTCT GCCTGTTATA ACTCTGCACT ACTCCTCTGC 3960 
AGTGCCTTGG GGA ATTTCCT CTATTGATGT ACAGTCTGTC ATGAACATGT TCCTGAATTC 4020 
TATTTGCTGG GCJ J Ji rjlJ TCTCTTTCTC TCCTTTCTTT TTCTTCTTCC CTCCCTATCT 4080 
AACCCTCCCA TGGCACCTTC AGACTTTGCT TCCCATTGTG GCTCCTATCT GTGTTTrGAA 4140 
TGGTGTTGTA TGCCTTTAAA TCTGTGATGA TCCTCATATG GCCCAGTGTC AAGTTGTGCT 4200 
TGTTTACAGC ACTACTCTGT GCCAGCCACA CAAACGTTTA CTTATCTTAT GCCACGGGAA 4260 
GTTTAGAGAG CTAAGATTAT CTGGGGAAAT CAAAACAAAA AACAAGCAAA CAAAAAAAAA 4320 
A 

Seg ID NO: 94 Protein sequence: 
Protein Accession #: NP_000035J 

1 U 21 31 41 51 
1 i 1 I t 1 

MEVQLGLGRV YPRPPSKTYR GAFQNLFQSV REVIQNPGPR HPEAASAAPP GASLLLLQQQ 60 
QQQQQQQQQQ QQQQQQQQET SPRQQQQQQG EDGSPQAHRR GPTG YLVLDE EQQPSQPQSA 1 20 
LECHPERGCV PEPGAAVAAS KGLPQQLPAP PDEDDSAAPS TLSLLGPTFP GLSSCSADLK 1 80 
DILSEASTMQ LLQQQQQEAV SEGSSSGRAR EASGAPTSSK DNYUjGTSTI SDNAKELCKA 240 
VSVSMGLGVE ALEHLSPGEQ LRGDCMYAPL LGVPPAVRPT PCAPLAECKG SLLDDSAGKS 300 
TEDTAEYSPF KGGYTKGLEG ESLGCSGSAA AGSSGTLELP STLSLYKSGA LDEAAAYQSR 360 
DYYNFPLALA GPPPPPPPPH PHARIKLENP LDYGSAWAAA AAQCRYGDLA SLHGAGAAGP 420 
GSGSPSAAAS SSWHTLFTAE EGQLYGPCGG GGGGGGGGGG GGGGGGGGGG GGEAGAVAPY 480 
GYTRPPQGLA GQESDFTAPD VWYPGGMVSR VPYPSPTCVK SEMGPWMDSY SGPYGDMRLE 540 
TARDHVLPID YYFPPQKTCL ICGDEASGCH YGALTCGSCK VFFKRAAEGK QKYLCASRND 600 
CTIDKFRRKN CPSCRLRKCY EAGMTLGARK LKKLGNLKLQ EEGEASSTTS PTEETKJKLT 660 
VSHIEGYECQ PIFLNVLEAI EPG VVCAGHD NNQPDSFAAL LSSLNELGER QLVHVVKWAK 720 
ALPGFRNLHV DDQMAVIQYS WMGLMVFAMG WRSFTNVNSR MLYFAPDLVF NEYRMHKSRM 780 
YSQCVRMRHL SQEFGWLQIT PQEFLCMKAL LLFSIIPVDG LKNQKFFDEL RMNYIKELDR 840 
UACKRKNPT SCSRRFYQLT KLLDSVQPIA RELHQFTFDL LDCSHMVSVD FPEMMAEHS 900 
VQVPKILSGK VKPIYFHTQ 

SeqIDNO:95 DNA sequence 

Nucleic Acid Accession #: NMj002497 

Coding sequence: 135-1 472 (underlined sequences correspond to start and stop codons) 

I 11 21 31 41 51 
I t I I I I . 

GGCACGAQTA GGGGTGGCGG GTCAGTGCTG CTCGGGGGCT TCTCCATCCA GGTCCCTGGA 60 
GTTCCTGGTC CCTGGAGCTC CGCACTTGGC GCGCAACCTG CGTGAGGCAG CQCGACTCTG 120 
GCG ACTGGCC GGCC ATGCCT TCCCGGGCTG AGG ACTATG A AGTGTTGTAC ACCATTGGCA 1 80 
CAGGCTCCTA CGGCCGCTGC CAGAAGATCC GGAGGAAGAG TGATGGCAAG ATATTAGTTT 240 
GGAAAGAACT TGACTATGGC TCCATGACAG AAGCTGAGAA ACAGATGCTT GTTTCTGAAG 300 
TGAATTTGCT TCGTGAACTG AAACATCCAA ACATCGTTCG TTACTATGAT CGGATTATTG 360 
ACCGGACCAA TACAACACTG TACATTGTAA TGGAATATTG TGAAGGAGGG GATCTGGCTA 420 
GTGTAATTAC AAAGGGAACC AAGGAAAGGC AATACTTAGA TGAAGAGTTT GTTCTTCGAG 480 
TGATGACTCA GTTG ACTCTG GCCCTG AAGG AATGCCACAG ACGAAGTGAT GGTGGTCATA 540 
CCGTATTGCA TCGG GATCTT AAACCAGCCA ATGTTrTCCT GGATGGCAAG CAAAACGTCA 600 
AGCTTGGAGA CTTTGGGCTA GCTAGAATAT TAAACCATGA CACGAGTTTT GCAAAAACAT 660 
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TTGTTGGCAC ACCTTATTAC ATGTCTCCTG AACAAATGAA TCGCATGTCC TACAATGAGA 720 
AATCAGATAT CTGGTCATTG GGCTGCTTGC TGTATGAGTT ATGTGCATTA ATGCCTCCAT 780 
TTACAGCTTT TAGCCAGAAA GAACTCGCTG GGAAAATCAG AGAAGGCAAA TTCAGGCGAA 840 
TTCCATACCG TTAGTCTGAT GAATTGAATG AAATTATTAC GAGGATGTTA AACTTAAAGG 900 
5 ATT ACCATCG ACCTTCTGTT GAAGAAATTC TTGAGAACCC TTTAATAGCA GATTTGGTTG 960 

CAGACGAGCA AAGAAGAAAT CTTGAGAGAA GAGGGCGACA ATTAGGAGAG CCAGAAAAAT 1020 
CGCAGGATTC CAGCCCTGTA TTGAGTGAGC TGAAACTGAA GGAAATTCAG TTACAGGAGC 1080 
GAGAGCGAGC TCTCAAAGCA AGAGAAGAAA GATTGGAGCA GAAAGAACAG GAGCTTTGTG 11 40 
TTCGTGAGAG ACTAGCAGAG GACAAACTGG CTAGAGCAGA AAATCTGTTG AAGAACTACA 1200 
1 0 GCTTGCTAAA GG AACGG AAG TTCCTGTCTC TGGCAAGTA A TCC AG A ACTT CTTAATCTTC 1 260 
CATCCTCAGT AATTAAGAAG AAAGTTCATT TCAGTGGGGA AAGTAAAGAG AACATCATGA 1320 
GGAGTGAGAA TTCTGAGAGT CAGCTCACAT CTAAGTCCAA GTGCAAGGAC CTG AAGAAAA 1380 
GGCTTCACGC TGCCCAGCTG CGGGCTCAAG CXXSTGTCAGA TATTGAGAAA AATTACCAAC 1440 
TGAAAAGCAG ACAGATCCTG GGCATGCGCTAGCCAGGTAG AGAGACACAG AGCTGTGTAC 1500 
1 5 AGGATGTAAT ATTACCAACC TTTAAAGACT GATATTCAAA TGCTGTAGTG TTGAATACTT 1560 
GGCCCCATGA GCCATGCCTT TCTGTATAGT ACACATGATA TTTCGGAATT GGTTTTACTG 1620 
TTCTTCAGCA ACTATTGTAC AAAATGTTCA CATTTAATTT TTCTTTCTTC TTTTAAGAAC 1680 
ATATTATAAA AAGAATACTT TCTTGGTTGG GCTTTTAATC CTGTGTGTGA TTACTAGTAG 1740 
GAACATGAGA TGTGACATTC TAAATCTTGG GAGAAAAAAT AATATTAGGA AAAAAATATT 1800 
20 TATGCAGGAA GAGTAGCACT CACTGAATAG TTTTAAATG A C TGAGT GGTA TGCTTACAAT 1860 
TGTCATGTCT AGATTTAAAT TTTAAGTCTG AGATTTTAAA TOnTTTGAG CTTAGAAAAC 1920 
CCAGTTAGAT GCAATTTGGT CATTAATACC ATGACATCTT GCTITATAAAT ATTCCATTGC 1980 
TCTGTAGTTC AAATCTGTTA GCTTTGTGAA AATTCATCAC TGTGATGTTT GTATTCTTTT 2040 
TTTTTTTCTG TTTAACAGAA TATGAGCTGT CTGTCATTTA CCTACTTCTT TCCCACTAAA 2100 
25 TAAAAGAATTCTTCAGTTA 

Seq ID NO: 96 Protem sequence: 
Protein Accession NP_002488 

30 1 11 21 31 41 51 

GIREFNPNIM ANEVERINMI TOSISGENEA RELATEDKIN ASEHOMOSAP lENSMPSRAE 60 
DYEVLYTIGT GSYGRCQKIR RKSDGKILVW KELDYGSMTE AEKQMLVSEV NLLREUCHPN 120 
rVRYYDRIID RTNTTLYIVM EYCEGGDLAS VITKGTKERQ YLDEEFVLRV MTQLTLALKE 1 80 

35 CHRRSDGGHT VLHRDLKPAN VFLDGKQNVK LGDFGLARIL NHDTSFAKTF VGTPYYMSPE 240 
QMNRMSYNEK SDIWSLGCLL YELCALMPPF TAFSQKELAG KIREGICFRRI PYRYSDELNE 300 
ITTRMLNLKD YHRPSVEEIL ENPLI ADLVA DEQRR>a.ERR GRQLGEPEKS QDSSPVLSEL 360 
KLKEIQLQER ERALKAREER LEQKEQELCV RERLAEDKLA RAENLLKNYS LLKERKFLSL 420 
ASNPELLNIJ>SSVIKKKVHFSGESKENIMRSENSESQLTSKSKCKDLKKRLHAAQLR.\QA 480 

40 LSDIEKNYQLKSRQILGMR 

Seq ID NO: 97 DNA sequence 

Nucleic Acid Accession #: NM_007050.2 

Coding sequence: 1 85*4576 (underlined sequences conespond to start and stop codons) 

45 

I 11 21 31 41 51 

CCTCCCGCCT CAGTTCGCGC CGCGCGTCGG CTTGGAACGC AGGAGCGCCG GCTCCGGGAG 60 
CCCGAGCGGA GCCAGCCGCG CGCACAGCCA GCGGCCGCGC CGGCG ATGCG GGGCCACCCC 120 
50 GCGCCCGCCC CAGTCCCGGC CCCGGCCCCC GCGGGAAGGG GCTGAGCTGC CCGCCGCCGC 1 80 
CCGGATGGCG AGCCTCGCCG CGCTCGCCCT CAGCCTGCTC CTGAGGCTGC AGCTGCCGCC 240 
ACTGCCCGGC GCCCGGGCTC AGAGCGCCCC AGGTGGCTGT TCCTTTGATG AGCACTACAG 300 
CAACTGTGGT TATAGTGTGG CTCTAGGGAC CAATGGGTTC ACCTGGGAGC AGATTAACAC 360 
AACGG AG AAA CCAATGCTGG ACCAGGCAGT GCCCACAGG A TCTTTCATGA TGGTGA ACAG 420 
5 5 CTCTGGGAGA GCCTCTGGCC AG AAGGCCC A CCTTCTCCTG CCAACCCTGA AGGAGAATGA 480 
CACCCACTGC ATCGACTTCC ATTACTACTT CTCCAGCCGT G ACAGGTCCA GCCCAGGGGC 540 
CTTG AACGTC TACGTGAAGG TGAATGGTGG CCCCCAAGGG AACCCTGTGT GGAATGTGTC 600 
CGGGGTCGTC ACTG AGGGCT GGGTGAAGGC AG AGCTCGCC ATCAGCACTT TCTGGCCACA 660 
TTTCTATCAG GTOATATTTG AATCCGTCTC ATTGAAGGGT CATCCT GGCT ACATCGCCGT 720 
60 GG ACG AGGTC CGGGTCCTTG CTCATCCATG CAG AAAAGCA CCTCATTTTC TGCGACTCCA 780 
AAACGTGGAG GTGAATGTGG GGCAGAATGC CACATTTCAG TGCATTGCTG GTGGGAAGTG 840 
GTCTCAGCAT G ACAAGCTTT GGCTCCAGCA ATGGAATGGC AGGGACAC ; • a CCCTGATGGT 900 
CACCCGTGTG GTCAACCACA GGCGCTTCTC AGCCACAGTC AGTGTGGC \CACTGCCCA 960 
GCGGAGCGTC AGCAAGTACC GCTGTGTGAT CCGCTCTG AT GGTGGGT " TGTGTCCAA 1020 
65 CTACGCGGAG CTGATCGTGA AAGAQCCTCC CACGCCCATT GCTCCCC 5CTGCTGGC 1080 

TGTGGGGGCC ACATACCTGT GG ATCAAGCC AAATGCC AAC TCC ATCA ATGGCCC 1 1 40 

CATCATCCTGAAGGAAGTGG AATATCGCACCACCACAGGC ACGTGGG , . .GACCCACAT 1200 
AGTCGACTCTCCCAACTATA AGCTGTGGCATCTGGACCCCGATGTTGAC / - GAGATCCG 1260 
AGTGCTCCTC ACACGACCAG GTGAGGGGGG TACGGGACCG CCAGGGGCTC C'rCTCACCAC 1320 
70 CAGG ACCAAG TGTGCAGATC CGGTACATGG CCCACAGAAC GTGGAAATCG TAGACATCAG 1380 
AGCCCGGCAG CTGACCCTGC AGTGGGAGCC CTTCGGCTAC GCGGTGACCC GCTGCCATAG 1440 
CTACAACCTC ACCGTGCAGT ACCAGTATGT GTTCAACCAG CAGCAGTACG AGGCCGAGGA 1500 
GGTCATCCAG ACCTCCTCCC ACTACACCCT GCGAGGCGTG CGCCCCTTCA TGACCATCCG 1 560 
GCTGCGACTC TTGCTGTCTA ACCCCGAGGG CCGAATGGAG AGCGAGGAGC TGGTGGTGCA 1620 
75 GACTGAGGAA GACGTTCCAG GAGCTGTTCC TCTAGAATCC ATCCAAGGGG GGCCCTTTGA 1680 
GGAGAAGATC TACATCCAGT GGAAACCTCC CAATGAGACC AATGGGGTCA TCACGCTCTA 1740 
CGAGATCAAC TACAAGGCTG TCGGCTCGCT GGACCCAAGT GCTGACCTCT CGAGCCAGAG 1800 
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GGGGAAAGTG TTCAAGCTCC GGAATGAAAC CCACCACCTC TTTGTGGGTC TGTACX:CAGG 1860 
GACCACCTAT TCCTTCACCA TCAAGGCCAG CACAGCAAAG GGCTTTGGGC CCCCTGTCAC 1920 
CACTCGGATT GCCACCAAAA TTTCAGCTCC ATCCATGCCT GAGTACGACA CAGACACCCC 1980 
ATTGAATGAG ACAGACACGA CCATCACAGT GATGCTGAAA CCCGCTCAGT CCCGGGGAGC 2040 
D TCCTGTCAGT GTTTATCAGC TGGTTGTCAA GGAGGAGCGA CTTCAGAAGT CACGGAGGGC 2100 
AGCTGACATT ATTGAGTGCT TTTCGGTGCC CGTGAGCTAT CGGAATGCCT CCAGCCTCGA 2 1 60 
TTCTCTACAC TACTTTGCTG CTGAGTTGAA GCCTGCCAAC CTGCCTGTCA CCCAGCCATT 2220 
TACAGTGGGTGACAATAAGACATACAATGGCTACTGGAACCCTCCTCTCTCTCCCCTGAA 2280 
AAGCTACAGC ATCTACTTCC AGGCACTCAG CA AAGCCAAT GGAG AG ACCA AAATCAACTG 2340 
1 U TGTTCGTCTG GCTACAAAAG CACCAATGGG CAGCGCCCAG GTGACCCCGG GGACTCCACT 2400 
CTGCCTCCTC ACCACAGGTG CCTCCACCCA GAATTCTAAC ACTGTGGAGC CAGAGAAGCA 2460 
GGTGGACAAC ACCGTGAAGA TGGCTGGCGT GATCGCTGGC CTCCTCATGT TCATCATCAT 2520 
TCTCCTGGGC GTGATGCTCA CCATCAAAAG GAGAAGAAAT GCTTATTCCT ACTCCTATTA 2580 
^ CTTGTCCCAA AGGAAGCTGG CCAAGAAGCA GAAGGAGACC CAGAGTGGAG CCCAGAGGGA 2640 

1 5 GATGGGGCCT GTGGCCTCTG CCGACAAACC CACCACCAAG CTCAGCGCCA GCCGCAATGA 2700 
T GAAG GCTTC TCTTCTAGTT CTCAGG ACGT CAACGGATTC ACAGATGGCA GCCGCGGGGA 2760 
GCTTTCCCAG CCCACCCTCA CG ATCCAGAC TCATCCCTAC CGCACCTGTG ACCCTGTGGA 2820 
G ATGAGCTAC CCCCGGGACC AGTTCCA ACT CGCCATCCGG GTGGCTGACT TGCTGCAGCA 2880 
Orv CATCACGCAG ATGAAGAGAG GCCAGGGCTA CGGGTTCAAG GAGGAATACG AGGCCTTACC 2940 
ZKJ AGAGGGGCAG ACAGCTTCGT GGGACACAGC CAAGGAGGAT GAAAACCGCA ATAAGAATCG 3000 
ATATGGGAAC ATCATATCCT ACGACCATTC CCGGGTGAGG CTGCTGGTGC TGGATGGAGA 3060 
CCCGCACTCT GACTACATCA ATGCCAACTA CATTGACGGA TACCATCGAC CTCGGCACTA 3120 
CATTGCG ACT CAAGGTCCG A TGCAGGAGAC TGTAAAGG AC TTTTGGAGAA TGATCTGGCA 3180 
GGAGAACTCC GCCAGCATCG TCATGGTCAC AAACCTGGTG G AAGTGGGCA GGGTGAAATG 3240 
ZD TGTGCGATAC TGGCCAGATG ACACGGAGGT CTACGGAGAC ATTAAAGTCA CCCTGATTGA 3300 
AACAGAGCCC CTGGCAGAAT ACGTCATACG CACCTTCACA GTCCAGAAGA AAGGCTACCA 3360 
TGAGATCCGG GAGCTCCGCC TCTTCCACTT CACCAGCTGG CCTGACCACG GCGTTCCCTG 3420 
CTATGCCACT GGCCTTCTGG GCTTCGTCCG CCAGGTCAAG TTCCTCAACC CCCCGGAAGC 3480 
TGGGCCCATA GTGGTCCACT GCAGTGCTGG GGCTGGGCGG ACTGGCTGCT TCATTGCCAT 3540 
JU TGACACCATG CTTGACATGG CCGAGAATGA AGGGGTGGTG GACATCTTCA ACTGCGTGCG 3600 
TGAGCTCCGG GCCCAAAGGG TCAACCTGGT ACAGACAGAG GAGCAATATG TGTTTGTGCA 3660 
CGATGCCATC CTGGAAGCGT GCCTCTGTGG CAACACTGCC ATCCCTGTGT GTGAGTTCCG 3720 
TTCTCTCTAC TACAATATCA GCAGGCTGGA CCCCCAGACA AACTCCAGCC AAATCAAAGA 3780 
TGAATTTCAG ACCCTCAACA TTGTGACACC CCGTGTGCGG CCCGAGGACT GCAGCATTGG 3840 
J 5 GCTCCTGCCC CGGA ACCATG ATAAG AATCG AAGTATGGAC GTGCTGCCTC TGGACCGCTG 3900 
CCTGCCCTTC CTTATCTCAG TGGACGGAGA ATCCAGCAATTACATCAACG CAGCACTGAT 3960 
GGATAGCCAC AAGCAGCCTG CCGCCTTCGT GGTCACCCAG CACCCTCTAC CCAACACCGT 4020 
GGCAGACTTC TGGAGGCTGG TGTTCGATTA CAACTGCTCC TCTGTGGTGA TGCTGAATGA 4080 
GATGGACACT GCCCAGTTCT GTATGCAGTA CTGGCCTGAG AAGACCTCCG GGTGCTATGG 4140 
4U GCCCATCCAG GTGGAGTTCG TCTCCGCAGA CATCGACGAG GACATCATCC ACAGAATATT 4200 
CCGCATCTGT AACATGGCCC GGCCACAGGA TGGTTATCGT ATAGTCCAGC ACCTCCAGTA 4260 
CATTGGCTGG CCTGCCTACC GGGACACGCC CCCCTCCAAG CGCTCTCTGC TCAAAGTGGT 4320 
CCGACGACTG GAGAAGTGGC AGGAGCAGTA TGACGGGAGG GAGGGACGTA CTGTGGTCCA 43S0 
CTGCCTAAAT GGGGGAGGCC GTAGTGGAAC CTTCTGTGCC ATCTGCAGTG TGTGTGAGAT 4440 
HD GATCCAGCAG CAAAACATCA TTGACGTGTT CCACATCGTG AAAACACTGC GTAACAACAA 4500 
ATCCAACATG GTGGAGACCC TGGAACAGTA TAAATTTGTA TACGAGGTGG CACTGGAATA 4560 
TTTAAGCTCC TTTIAGCTCA ATGGGATGGG GAACTGCCGG AGTCCAGAGG CTGCTGTGAC. 4620 
CAAGCCCCCT TTTGTGTGAA TGGCAGTAAC TGGGCTCAGG AGCTCTGAGG TGGCACCCTG 4680 
CCTGACTCCA AGGAGAAGAC TGGTGGCCCT GTGTTCCACG GGGGGCTCTG CACCTTCTGA 4740 
DU GGGGTCTCCT GTTGCCGTGG GAGATGCTGC TCCAAAAGGC CCAGGCTTCC TTTTCAACCT 4800 
AACCAGCCAC AGCCAAGGGC CCAAGCAGAA GTACACCCAC AAGCAAGGCC TTGGATTTCT 4860 
GGCTCCCAGA CCACCTGCTT TTGTTCTGAG TTTGTGGATC TCTTGGCAAG CCAACTGTGC 4920 
AGGTGCTGGG GAGTGGGAGG CTCCCCTGCC CTCCTTCTCC TTAGGAGTGG AGGAGATGTG 4980 
TGTTCTGCTC CTCTACGTCA TGGAAAAGAT TGAGGCTCTT GGGGGTCACT GCTCTGCTGC 5040 
CCCCTGCAAC CTCCTTCAGG GGCCTCTGGC ACCAGACATT TGCAGTCTGG ACCAGTGTGA 5100 
CCTTACGATG TTCCCTAGGC CACAAGAGAG GCCCCCCATC CTCACACCTA ACCTGCATGG 5160 
GGCTTCGCCC ACAACCATTC TGTACCCCTT CCCCAGCCTG GGCCTTGACC GTCCAGCATT 5220 
CACTGGCCGG CCAGCTGTGT CCACAGCAGT TTTTG ATAAA GGTGTTCTTT GCrrnTTGT 5280 
- GTGGTCAGTG GGAGGGGGTG GAACTGCAGG GAACTTCTCT GCTCCTCCTT GTCrrraTAA 5340 
OU AAAGGGACCA CCTCCCTGGG GCAGGGCTTG GGCTGACCTG TAGGATGTAA CCCCTGTGTT 5400 
TCTTTGGTGG TAGCTTTCTT TGGAAG AGAC AAACAAGATA AGATTTG ATT ATTTTCCAAA 5460 
GTGTATGTGA AAAG AAACTT TCTTTTGGAG GGTGTAAAAT CTTAGTCTCT TATGTCAAA A 5520 
AGAAGGGGGC GGGGGAGTTT GAGTATGTAC CTCTAAGACA AATCTCTCGG GCCTTTTATT 5580 
TTTTCCTGGC AATGTCCTTA AAAGCT CCCA CCCTGGGACA GCATGCCACT GAGCAAGGAG 5640 
AGATGGGTGA GCCTGAAGAT GGTCCCTTTG GTTTCTGGGG CAAATAGAGC ACCAGCTTTG 5700 
TGCATAATTT GGATGTCCAA ATTTGA ACTC CTTCCTAAAG AAACCCAGCA GCCACCTTGA 5760 . 
AAAAGGCCAT TGTGGAGCCC ATTATACTTT G ATTTAAAAT AGGCCAAGAG AATCAGGCCT 5820 
GGAGATCTAG GGTCTTGTCC AAAGTGTGAG TGAGTCAATG AGAGGGAACC AACATTTGCT 5880 
AAGTCTCTAC TGTATGCCAG GG ATCATGCT TGGCACTTTC CATAGGACAT TTCACACAGT 5940 
CCTTAGAACCCCCAGGAGAG AGCTACTGAC TTGTTATCAT CTCCATTTGATCATCTCCTC 6000 
CAATGAGGAA ACCCACGCAC CTTCC TTAGT AATGAAATCC TGGGTTCCAA AGGGGCAGGT 6060 
AATGGCAATG AGACTTCTCC GTGCTGTTTT CTTCATCTTC TCTAAGCCAA GCAATTATTT 6120 
TATGGAGGGA AAATAAGGCC AGAAACTTCT GAGCAGATAA CTCCACAAAT GGAAATTTAG 61 80 
TACTTTCTTC CTG ATGCCAG TTCTTCTGGG AAGCGCAGAA TTTCAGATAT ATTTTAGTAA 6240 
/D CACATTCCCA GCTCCCCAGG AAAGCCAGTC TCATCTAATT TCTTAGTCAG TAAAAACAAT 6300 
TCCCTGTTCC TTCAGGCTAT GAATGGACCA GCCAGGGAAA CTCTCGACCT TGATCTCTAG 6360 
CCAGTGCTTA GGCCCAATAT CTGACAGCCT CAGGTGGGCT GGGACCTAGG AAGCTCCATC 6420 

390 



55 



65 
70 



BNSDOCin.<WQ,^_a20S9377Aajl> 



wo 02/059377 



PCT/US02/02242 



TTCAAGGCTG GTCTAGCCCC AGACAGGGCA TGAGGGGCAG AGAATTCAAG AAGGTACAGC 6480 
TTTGGCCCTCAAGAGCCCACTGTATGCTGGGGAAATGGAACCATGGTGCAGTAGTGTGQA 6540 
GTGGATGAGT GTTCCATGAG CCTAGGAGCA AGAAAGTCTC TTCGGCCTCG GGCTTCCTGG 6600 
AGAAGGGGAC GTCCATTCCT GCTGGGTCTT AACAAGCATA AAAAGGAAAA AAAGGAAACT 6660 
5 CAGGCAAAGG G ATCCATATG TGCAATGGCA AAGAAATGTG AAAAGGCATT GGGAGAAGCA 6720 
GTCTGGGGGA GGCCAGCCCA GTGCGGGCAC AGCACAACAC GGGGAGCAGC AAGAGATGAG 6780 
CCAGGGTCCA GGAGACAG AT GCCCATCGCG AGTACAGACT TTGTCCTATT GGCAACAAGG 6840 
AGTCCATGGA GCTTTAGAGA GATGCACTCA GCTTCGTGTT GGCCAAGACT CCTTCTGGGC 6900 
CAATGGGGCT GCCTCTTTTC CTTTCATCAG ACACTGTGAA AACATTCCCT TAAGCGTGCA 6960 
1 0 CmTTAATA TCACATCTAT TTGTCTGTCT GCTCATTGTT TTGTTGCTGG AACTAAATAT 7020 

GCAATGGATC ATGAGACTCA G ATTCTATGA GAAACCCAGG GTCTCTGCTT TACCACGGAG 7080 
CAGGGTCACC AACCCAGATC TCCAGGCCCA TGAGGATGGA ACATGAAAGG AGCCGACAAA 7140 
AGTTGCTTCCATTGGC^TGGGCTCTGGAGCTGTCCAGAAGTCCAGGGACACCAGACTTGA 7200 
TCAAGGAAGG GCTGTCACTT TAGAGGTTCA AAAGGAAGTG CCTCAAAGCA AAGGCAAGCA 7260 
1 5 AAGGAACCCC ACGATGAACT TGCTCTTTTC CTTTGATG AG CCTCTCCCCA GGTGTATTTC 7320 
AGCAGACCCC GGGGACCCAC CCCCACTGGG CCTGCTGGCC TCCCTCGGCT CCAGCCCAAT 7380 
GCCCCAGCTG GCCTTCCCCA GCCTGCAAGG AGCCTGTAGC ATGGCAAATC TGCCTGCTGT 7440 
ATGCTATTTT CTTAGATCTT GGTACATCCA G ACAGGATG A GGGTGGAGGG AGAGCTATTT 7500 
AACACAAATCCTAAGATTTTTTTCTGCTCA GGAAGGGGTG AAATAGCTGG CAGATACAAA 7560 
20 AGACAGTGGC TTTTATCATT TTAAATGGTA GGAATTTAAG GTGTGACTTC AGGGAGAAAC 7620 
AAACTTGCAA AAAAAAAAAA TCTCAGGCCA TGTTGGGGTA ACCCAGCAAG GGCCAGTGAT 7680 
GATTTCCCCC AGCTCATCCC CTTATTTTCC CACAACCCAA CCATTCTCTA AAGCAGG ACA 7740 
GTGAATAGGT CTTAGGCCAG TGCACACAGG AAGAAATTGA GGCTTATGGA TGGGGATGAC 7800 
TTCCCTAAGA TCCCATGGGA CAAGGATGTG GCAAGGCTTG GATGAGATGG GGCACCAGTG 7860 
25 CCCAGGAATT TGAACATTTT CCTTTACCCA GGAAATCTCC GGAGCCAACA CCACCACCCC 7920 
CAGGGGGTCT CCCCACCCCA CCCCATTTAC AGGGTGAGCT CAGCCTGTCA TGAGCAGAGG 7980 
AAAATATTAT TAATGCTCTC TGAGTCTTTA CAACAGGAGC TCTTACCTCA TAG ATGTGGG 8040 
CTCTGTTTGG GGAAGATGCA AGGAAGTAAT GAGAAGCCCA GGAAATTTCT CCACCTGTGT 8100 
TTATGGCCTA AATAGCTTCA GGATGTATCT TAGCTGCACT CCAACATTGC ATCCTTTCTG 81 60' 
30 GGGTGAAGAA TCTGGGCCAA CCAGGGGTCC TTGGGCCTCT AGAAGGCCAC AGTAGGCCTC 8220 
TCTTTGTGGG AATGGAAGGG GACAGTTTGC TTTTAGTGCT GGCCCTCTCT GTGGGTGTGG 8280 
CCTGCAAAGG AACCAACAGA CCCTATGCTG GGGACTCTAA CATGTGAGCT CATTAAATTC 8340 
TTCCAGCATTCTAAAGGAGGGTTTGTGATTGTCACCATTTACTGATGAGG AAACTAAGGC 8400 
TCCTAGGGGA GAAATCACTT GCCCACAGTT CCACAGCTAG TGAGTGAATG AACCAGGATT 8460 
3 5 TAAACCGGTT TTTTCTCACT ACAG AG AC A A TATTTTTCCA CC ATTGTATC TCACATTTTT 8520 

CCCAGGAGGT TACCCATAAC AGAAGAG ACT AG AGTGGA AC AG ATACGTCA GTGGATAAAG 8580 
CTCAAAGCAA ACAACAGTAA GCTTAAAATT CCTTCATAGT CTCATGTTTT ACGTTCACAA 8640 
TTCATGCAAA ATTTGCATTC CACTTTCTGA TTTAGCCTTG TrGGTTTTAA TATGACTCTA 8700 
TGAATATTTC AAAAAAAAAT GTGCTCTGrT CCTCATGTTG TTCTGTTCTG TTCACCCCGC 8760 
40 TATGACGGAC CCTAGGTCAG CTGGTCTTCA GCTTG ACCCT AGAATTG act CTAGGAGCAG 8820 
TGACCCTGCT GCCTCCCAGA GCCAGTTATA GGCTCAAGAT CAAGACCAAC TG ACCTTCTC 8880 
CTAGGCAGCT CCTTTGGTGT GTGGGTGCTC TGACCTCACT GTTCATGAGG GGACCTCAAC 8940 
TAAGGCATCT TCCAGTTGGG TGCTGGAAGG AACCCATTAA CTCACACTAG AATGATGAGG 9000 
ATTTGCTCAT CTGGCGTGGA G AAGGATGAG CCCACAAAAC CCTAAAGGGA AAAGAGAAGC 9060 
45 TGGACACAGC TGTACTCAGC AGATTCCTGA ATGCTAGGCT GGAAAGTGGT GCCTGTTGTC 9120 
CAAGTGGAGT CACATGGTTG CTAATGTGGG CAAGTCTGAG GACACACTTC ATGAGCAGCT 9180 
GGGGTCTGGA AGGCTCCTCA CTTTACCCTA GCCACACATA ATTACTGGGT GCCTACAGCA 9240 
CCTAGCACCT TGG AGGGGGC ACTATTAGGA AATCGAGATT ACTATGGCAC AATTAATTCC 9300 
TGGGTAAGGC ATGGGGTTGTGGTGGACAGAGCTCAGTCTTTAGTTTGAACGAAAACATAC 9360 
50 ATACATGAAA AACATACATG AAAAAAGGAC CCTCATCAAC ATTAGAAGGG GTAGATTTGG 9420 
AGCACTTTAG GCAGG AAAAC AGGAACGCAA GGCCAGGAAA CTGGAACCCA GTGAATACTC 9480 
AGAACCGAGG ATGGAG ATG a CTTATTTAGC AAAATGGTCA CTTCTGTGAC ATAGCTGGAG 9540 
AAAGGATGGG TAACAGCTTG CCAGAGCCAC TTGGAACAAG GGCAAATCTC AGTGTCTGGG 9600 
GCAAAAGATG ATGCATTTCC CTCTGACCCA TCATGTTTAT TCATCCTCCA CTCCCCATTG 9660 
5 5 CCACACTAGC TCTTGCTGTA AGTCCTCACC AGG ATCTACA TTTCCTCGTC GCTGGTGGG a 9720 
ACCCCTTAGA GTACATAGAG GTATCAGTCC AGTAAGACTG CTCTACACAA CAGAAGTGAG 9780 
GCCCAGGGAG TAGCAGCCAG GCCCTTATCC TGTTACCTCT GCAGGAGTGA CTGCCCAACC 9840 
CAGATCCAGA GACATTGAAG GAAATGATAA TTCCTTGGTA CCTCACTGCC TTGGGACAAA 9900 
ATGAAGAAAG CCACCCTTCC TTAGGCTGCA GCTTGCCACT CCTGGGCTGG GTAAACAGGT 9960 
60 CATCAGCACC AGGCTCAACC AGGAGTAACA TTCTGGAAGA CATGGGTGAG CCCAAGAGGA 10020 
AGCATGAACA GGACGCTGTT CCTAAGTCAT GTCAACAGGTTGTGCTGGGC CAGGATCCCC 10080 
AGGGAAAAAA ATGGTCAACC CAACTGGAGG GTAGGTTAGA AGAAAAAAAA CATAAACGTG 10140 
GATAGTCATG TCATCTCAAA TCCCTGACTT GGCTTCCCCA TTACTTGACA GTCTGAGCTC 10200 
CTTCTTAGCC TGTGACCAGC TTCAAATCAC AGCCAAGTAA AACAAGGAAA TAGGAAAAGT 10260 
65 AAATCCAACT AGAAG AGACA AGCTGAGATT CAGATITGTT TACTCCTCCC ATGCAA AGTT 1 0320 
TCCCTGTTGG AGGTTTTCCA TGTATACATG TCTAGAAGTG ATAGAATGCA AGGCCTTGGC 10380 
TTTGTCTTGC AGGGATCTGC CTTTGAGGTC ATAGACTGAA CAGCAGGGAG AGAGGTTAGT 10440 
GGTGGAGTGT GGGGGGAGCT GTTCTAGCTC CAGTTTCTTC TGACACATTT TTCAGGATCA 10500 
TGGATCTGAT CCTCCGAAGC ACAGCAGAGA TATCTAAGCC ATATTTGTGC ACATGAGCAG 10560 
70 ACTCTTCTAG TmTTAGTA ACCAGGGATG GGCTTTTGCA TGGCACTGAC TATAGAGATG 10620 
TCrrCTAGAG ATCAAGCCAG TCTTTTGCAT CCCACCTGCC CACCTCCAGA AGAGATGGGA 10680 
AAAGGTCATC AAAGGGCATT CACCAACTGA AATCCACTCA TGAATGTTAG GTCTCTAAAA 10740 
GGAGGCATCA ACACTCACAA TGGTAGCCTC CAAACCTAGC ATCCCACCTA TCTAAGAGCT 10800 
CAGGGGTGGT CCACTGGGGC AGATACAAGG GAAGTGCAAG GGCTCAGGAT GAAAGAAAAT 10860 
75 CTATTGGG AA G AGTTTTAGG GGCTTGATCA TTATGGGGCT TCCTTCTATA TCTGAG AACT 1 0920 
GCTCTGGGTG GTGAGATGTG GACTCTGATC CTTAATTGGA ATGTTCGGAG AATGAGTGTC 10980 
TGGTGGCCrr GAAGTGTTGG ACAGAAAAGT ATCAGTATAA AAGCCTGGAG CTCAGGGTAA 1 1040 
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TTAATGTAGT TCATGGTTCC TTAGTO AGCA GGACTCTTOG ATGTGGAGGA GAAAGGGTCA 1 1 100 
TAGG AAGTAA ACCACCAAAA TTACAAAATT GAGTCTCTGT ACAATTACTT CAGTXjCCTTT 1 1 160 
GGGCTTATGA ATACAAATCA GTGGGCCTTC TCTATGATGG TCCAACAAAC TCTCAGTGTC 1 1220 
CACCCTGTCC CTGTATCTCC CATGGAAGAT GAATAATGTC AGGTGTTCTT TGGGTCAAAG 1 1280 
GCCCCAGGGC AGTCTGGAGG CTTAGAGGGC AGAGTGGTGT CATTCCATGT AAAGTTAGGC 1 1340 
TTCTGAGGGG TCAGGCAGAA TATGGTGTCC ATATCTTCCA TAGCTCTGCA GATTCTTGGA 1 1400 
TGAAGTCAAG CACAG1TTGC TAGACCCAGG TCACTCCTCT GAGTATAACT AGGACCCATG 1 1460 
AGTGAAACTT AATAGCTGTA AGGAAGAACC TGCTGTCTGC CAGAGAGGAT AAGCTGCCCA 1 1520 
TCTCAGCAGC TGTCTAAAAG AAGGCAGGTG TCTCTTTAAA GGGAAGAGAA GCATTGGTGA 11580 
AATGGATTTC AGGTCACTTC CATTCCAGAT GGGTGAGATC TTGTGGAGCT GGGATCATGT 1 1640 
TTGAACTCAT TCATACCTGT AGAGCACGAA TCCAAGTAGA TTGTGTTTGG TCTGTACAGG 1 1700 
CTGAAGCCCC CTGCTCTCCC ACCCAAGTGC CCCCACTGAG CAGGCCAACA TGCTGTTGTG 1 1760 
GCC ACAT ATA CTGGGCTGAT CCAGGCTGGT TATCACCAAA CAGCAAACCA TAGGGAACAG 1 1820 
CTGCTTTGCC ATAGACCCAA TACCCATGTA GATCTCTCAT G AGAGCAGCC ATAACTCAGA 1 1 880 
CCCACTGACC AACA GGGCCA TGAGTGACAG CCAGAACCAG TGAAGGTCCA AGTAGGACAC 1 1940 
AG AGCAGGGC TTTTCTTACC ATACACATTA TCTCCAGAGG TTATTTCTAC CCCACTCCCT 12000 
ATTCAAGGCC TGTTGGAGCA CACTGCAAAA GCAAAAGCAC AGTAACTCAA TTTACACATG 12060 
ATTATAATCA TTTCCAGTGC ACACATTTCA TCACCAGGTG GATCCTGAGC TAGCCCATGT 12120 
AAATCCGGGT TAACCCATAT TGGTAATCAT ACTCAAAAGC ACTTTTCACC CTACATTCTA 12180 
CTAGCCAATC AAAGACAAAG AGTTGTGGCC TCTACCATTG CCTTGGCTTC TGGACACCCT 12240 
CACAAGCTAT CCCAAGGTTC CCGCTCAACT CCAGGGAGGC TGACATCTTC ACATCXACTG 12300 
GGCATATAAT ATTGCATGAG ACCAAAGTCT CCACACTCTT TGCAGCCTCC TCCATGAATC 12360 
CCAATGGCCT GCACTTGTAC AGTTTGGGTG TTTGATAGAT AAAGCACGTA TGAGAAGAGA 12420 
AAACAAAATA AATCA ACTTT TTAAAAAAGC CAGCACTGTG CTGTCAATGT TTTTTTTTTC 12480 
1TT TCAATT C TAGCTCAGAA AAGCAGAAGG TAAATAATGT CAGGTCAATG AATATCAGAT 12540 
ATATTTTTTG ACTGTACATT ACAGTGAAGT GTAATCTTTT TACACCTGCA AGTCCATCTT 12600 
ATTTATTCTT GTAAATGTTC CCTGACAATG TTTGTAATAT GGCTGTGTTA AAAAATCTAT 12660 
ACAATAAAGC TGTGACCCTG 

Scq ID NO: 98 Protein sequence: 
Protein Accession NP_008981.1 

I 11 ' 21 31 41 51 
1 i I 1 I 1 

GIREFNPPRO TEINTYROSI NEPHOSPHAT ASERECEPTO RTYPETHOMO SAPIENSMAS 60 
LAALALSLLL RLQLPPLPGA RAQSAPGGCS FDEHYSNCGY SVALGTNGFT WEQINTTEKP 120 
MLDQAVPTGS FMMVNSSGRA SGQKAHLLLP TLKENDTHCI DFHYYFSSRD RSSPGALNVY 180 
VKVNGGPQGN PVWNVSG WT EGWVKAELAI STFWPHFYQV IFESVSLKGH PGYIAVDEVR 240 
VLAHPCRKAP HFLRLQNVEV hA/^GQNATFQC lAGGKWSQHD KLWLQQWNGR DTALMVTRW 300 
NHRRFSATVS VADTAQRSVS KYRC VIRSDG GSGVSNYAEL IVKEPPTPIA PPELLAVGAT 360 
YLWDCPNANS IIGDGPIILK EVEYRTTTGT WAETHIVDSP NYKLWHLDPD VEYEIRVLLT 420 
RPGEGGTGPP GAPLTTRTKC ADPVHGPQNV ElVDIRARQL TLQWEPFGYA VTRCHSYNLT 480 
VQYQY VFNQQ QYEAEEVIQT SSHYTLRGLR PFMTIRLRLL LSNPEGRMES EEL WQTEED 540 
VPGAVPLESI QGGPFEEICIY IQWKPPNETN GVITLYEINY KAVGSLDPSA DLSSQRGKVF 600 
iCLRNETHHLF VGLYPGTTYS FTIKASTAKG FGPPVTTRIA TKISAPSMPE YDTDTPLNET 660 
DTTITVMUCP AQSRGAPVSV YQLWKEERL QKSRRAADII ECFSVPVSYR NASSLDSLHY 720 
FAAELKPANL PVTQPFTVGD NKTYNGYWNP PLSPLKSYSI YFQALSKANG ETKINCVRLA 780 
TKAPMGSAQV TPGTPLCLLT TGASTQNSNT VEPEFCQVDNT VKMAGVIAGL LMFHILLGV 840 
MLTDCRRRNA YSYSYYLSQR KLAKKQKETQ SGAQREMGPV ASADKPTTKL SASRNDEGFS 900 
SSSQDVNGFT DGSRGELSQP TLTIQTHPYR TCDPVEMSYP RDQFQLAIRV ADLLQHITQM 960 
KRGQGYGFKE EYEALPEGQT ASWDTAKEDE NRNKNRYGNI ISYDHSRVRL LVLDGDPHSD 1020 
YINANYIDGY HRPRHYIATQ GPMQETVKDF WRMIWQENSA SIVMVTNLVE VGRVKCVRYW 1080 
PDDTEVYGDI KVTLIETEPL AEYVIRTFTV QKKGYHEIRE LRLFHFTSWP DHGVPCYATG 1 140 
LLGFVRQVKF LNPPEAGPIV VHCSAGAGRT GCFIAIDTML DMAENEG WD IFNCVRELRA 1200 
QRVNLVQTEE QYVFVHDAIL EACLCGNTAI PVCEFRSLYY NISRLDPQTN SSQIKDEFQT 1260 
LNIVTPRVRP EDCSIGLLPR NHDKNRSMDV LPLDRCLPFL ISVDGESSNY INAALMDSHK 1320 
QPAAFVVTQH PLPNTV ADFW RLVFDYNCSS WMLNEMDTA QFCMQYWPEK TSGCYGPIQV 1380 
EFVSADIDED HHRIFRICN MARPQDGYRI VQHLQYIGWP AYRDTPPSKR SLLKWRRLE 1440 
KWQEQYDGRE GRTVVHCLNG GGRSGTFCAI CSVCEMIQQQ NIIDVFHIVK TLRNNKSNMV 1500 
ETLEQYKF W EVALEYLSSF 

Seq ID NO: 99 DNA sequence 

Nucleic Acid Accession #: NM 002988.1 

Coding sequence: 71-340 (undeilined sequences correspond to start and stop codons) 

I n 21 31 41 51 

I i I i I I 

CCGGCACGAG AGGAGTTGTG AGTTTCCAAG CCCCAGCTCA CTCTGACCAC TTCTCTGCCT 60 
GCCCAGCATC ATG AAGGGCC TTGCAGCTGC CCTCCTTGTC CTCGTCTGCA CCATGGCCCT 1 20 
CTGCTCCTGT GCACAAGTTG GTACCAACAA AGAGCTCTGC TGCCTCGTCT ATACCTCCTG 180 
GCAG ATTCCA CAAAAGTTCA TAGTrGACTA TTCTGAAACC AGCCCCCAGT GCCCCAAGCC 240 
AGGTGTCATC CTCCTAACCA AGAGAGGCCG GCAGATCTGT GCTGACCCCA ATAAGAAGTG 300 
GGTCCAGAAA TACATCAGCG ACCTGAAGCT GAATGCCTGA GGGGCCTGGA AGCTGCGAGG 360 
GCCCAGTGAA CTTGGTGGGC CCAGGAGGGA ACAGGAGCCT GAGCCAGGGC AATGGCCCTG 420 
CCACCCTGG A GGCCACCTCT TCTAAGAGTC CCATCTGCTA TGCCCAGCCA CATTAACTAA 480 
CTTTAATCTT AGTTTATGCA TCATATTTCA TTTTGAAATT GATTTCTATT GTTGAGCTGC 540 
ATTATGAAAT TAGTATTTTC TCTG ACATCT CATGACATTG TCTTTATCAT CCTTTCCCCT 600 
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TTCCCTTCAA CTCTTCGTAC ATTCAATGCA TGGATCAATC AGTGTGATTA GCTTTCTCAG 660 
CAG ACATTGT GCCATATGTA TCAAATGACA AATCTTTATT GAATGGTTTT GCTCAGCACC 720 
ACCTTTTAAT ATATTGGCAG TACTTATTAT ATAAAAGGTA AACCAGCATT CTCACTGTGA 780 
^ AAAAAAAAAA AAAAAAAAAA AAA 

Seq ID NO: 100 Protein sequence: 
Protein Accession NPJ002979.1 

^ I 11 21 31. 41 51 
10 I I I I 1 I 

MKGLAAALLV LVCTMALCSC AQVGTNKELC CLVYTSWQIP QKFIVDYSET SPQCPKPGVI 60 
LLTKRGRQIC ADPNKKWVQK YISDLKLNA 

Seq ID NO: 101 DN A sequence 
15 Nucleic Acid Accession #: NMjO 1 5507.2 

Coding sequence: 241-1902 (underlined sequences correspond to start and stop codons) 



1 n 21 31 41 51 

o/^ • ' « • ^ « 

20 CCGCAGAGGA GCCTCGGCCA GGCTAGCGAG GGCGCCCCCA GCCCCTCCCC AGGCCGCGAG 60 
CGCCCCTGCC GCGGTGCCTG GCCTCCCCTC CCAG ACTGCA GGGACAGCAC CCGGTAACTG 120 
CG AGTGGAGC GGAGGACCCG AGCGGCTG AG G AGAGAGGAG GCGGCGGCTT AGCTGCTACG 1 80 
GGGTCCGGCC GGCGCCCTCC CGAGGGGGGC TCAGGAGGAG GAAGGAGGAC CCGTGCGAGA 240 

^ ATGCCTCTGC CCTGGAGCCT TGCGCTCCCG CTGCTGCTCT CCTGGGTGGC AGGTGGTTTC 300 

25 GGGAACGCGG CCAGTGCAAG QCATCACGGG TTGTTAGCAT CGGCACGTCA GCCTGGGGTC 360 
TGTCACTATG GAACTAAACT GGCCTGCTGC TACGGCTGGA GAAGAAACAG CAAGGGAGTC 420 
TGTGAAGCTA CATG.CGAACC TGGATGTAAG TTTGGTGAGT GCGTGGGACC AAACAAATGC 480 
AGATGCTTTC CAGGATACAC CGGGAAAACC TGCAGTCAAG ATGTGAATGA GTGTGGAATG 540 
AAACCCCGGC CATGCCAACA CAGATGTGTG AATACACACG GAAGCTACAA GTGCTTTTGC 600 

30 CTCAGTGGCCACATGCTCATGCCAGATGCTACGTGTGTGA ACTCTAGGACATGTGCCATG 660 
ATAAACTGTC AGTACAGCTG TGAAGACACA GAAGAAGGGC CACAGTGCCt GTGTCCATCC 720 
TCAGGACTCC GCCTGGCCCC AAATGGAAGA GACTGTCTAG ATATTGATGA ATGTGCCTCT 780 
GGTAAAGTCA TCTGTCCCTA CAATCGAAGA TGTGTGAACA CATTTGGAAG CTACTACTGC 840 
AAATGTCACA TTGGTTTCGA ACTGCAATAT ATCAGTGGAC GATATGACTG TATAGATATA 900 

35 AATGAATGTA CTATGGATAG CCATACGTGC AGCCACCATG CCAATTGCTT CAATACCCAA 960 
GGGTCCTTCA AGTGTAAATG CAAGCAGGGA TATAAAGGCA ATGGACTTCG GTGTTCTGCT 1020 
ATCCCTGAAA ATTCTGTGAA GGAAGTCCTC AGAGCACCTG GTACCATCAA AGACAGAATC 1080 
AAGAAGTTGC TTGCTCACAA AAACAGCATG AAAAAGAAGG CAAAAATTAA AAATGTTACC 1 140 
CCAGAACCCA CCAGGACTCC TACCCCTAAG GTGAACTTGC AGCCCTTCAA CTATGAAGAG 1200 

40 ATAGTTTCCA GAGGCGGGAA CTCTCATGGA GGTAAAAAAG GGAATGAAGA GAAAATGAAA 1260 
GAGGGGCTTG AGGATGAGAA AAGAGAAGAG AAAGCCCTGA AGAATGACAT AGAGGAGCGA 1320 
AGCCTGCGAG GAGATGTGTT TTTCCCTAAG GTG AATGAAG CAGGTGAATT CGGCCTGATT 1380 
CTGGTCCAAA GGAAAGCGCT AACTTCCAAA CTGGAACATA AAGATTTAAA TATCTCGGTT 1440 
. . GACTGCAGCT TCAATCATGG G ATCTGTGAC TGGAAACAGG ATAGAGAAGA TG ATTTTG AC 1 500 

45 TGG A ATCCTG CTGATCG AGA TAATOCTATT GGCTTCTATA TGGCAGTTCC GGCCTTGGCA 1 560 
GGTCACAAGA AAGACATTGG CCGATTGAAA CTTCTCCTAC CTGACCTGCA ACCCCAAAGC 1620 
AACTTCTGTT TGCTCTTTGA TTACCGGCTG GCCGGAGACA AAGTCGGG AA ACTTCG AGTG 1 680 
TTTGTGAAAA ACAGTAACAA TGCCCTGGCA TGGGAGAAGA CCACG AGTGA GGATG AAAAG 1 740 
_ TGGA AG ACAG GG AAA ATTCA GTTGTATCAA GO AACTGATG CTACCAAAAG CATCATIll i 1 800 

50 G AAGCAGAAC GTGGCAAGGG CAAAACCGGC GAAATCGCAG TGG ATGGCGT CTTGCTTGTT 1 860 
TCAGGCTTAT GTCCAGATAG CCTTTTATCT GTGGATGAC T GAA TGTTACT ATCTTTATAT 1920 
TTGACnTGT ATGTCAGTTC CCTGGTTTTT TTGATATTGC ATCATAGGAC CTCTGGCATT 1980 
TTAGAATTAC TAGCTGAAAA ATTGTAATGT ACCAACAGAA ATATTATTGT AAGATGCCTT 2040 
TCTTGTATA A G ATATGCCAA TATTTGCTTT AA AT ATC ATA TCACTGTATC TTCTCAGTCA 2 1 00 

5 5 TTTCTG AATC TTTCCACATT ATATTATAAA ATATGG AAAT GTCAGTTT AT CTCCCCTCCT 2 1 60 
CAGTATATCT G ATTTGTATA AGTAAGTTGA TG AGCTTCTC TCTACAACAT TTCTAGAA AA 2220 
TAGAAAAAAA AGCACAGAGA AATGTTTAAC TGTTTGACTC TTATGATACT TCTTGGAAAC 2280 
TATGACATCA AAGATAG ACT TTTCCCTAAG TGGCTTAGCT GGGTCTTTCA TAGCCAAACT 2340 
TGTATATTTA AATTCTTTGT AATAATAATA TCCAAATCAT CAAAAAAAAA AAAAAAAA 

60 

Seq ID NO: 102 Protein sequence: 
Protein Accession.*: NPj056322.2 



1 11 21 31 41 51 

65 I I 1 I I I 

MPLPWSLALP LLLSWVAGGF GNAASARHHG LLASARQPGV CHYGTKLACC YGWRRNSKGV 60 
CEATCEPGOC FGECVGPNKC RCFPGYTGKT CSQDVNECGM KPRPCQHRCV NTHGSYKCFC 120 
LSGHMLMPDA TCVNSRTCAM INCQYSCEDT EEGPQCLCPS SGLRLAPNGR DCLDIDECAS 180 
_ _ GKVICPYNRR CVNTFGSYYC KCHIGFELQY ISGRYDCIDI NECTMDSHTC SHHANCFNTQ 240 
70 GSFKCKCKQG YKGNGL^CSA IPENSVKEVL RAPGTIKDRI KKLIJVHKNSM KKKAKC^^ 300 
PEPTRTPTPK VNLQPFNYEE iVSRGGNSHG GKKGNEEKMK EGLEDEKREE KALKNDIEER 360 
SLRGDVFFPK VNEAGEFGLI LVQRKALTSK LEHKDLNISV DCSFNHGICD WKQDREDDFD 420 
WNPADRDNAI GFYMAVPALA GHKKDIGRLK LLLPDLQPQS NFCLLFDYRL AGDKVGKLRV 480 
FVKNSNNALA WEKTTSEDEK WKTGKIQLYQ GTDATKSUF EAERGKGKTG eAVDGVLLV 540 
75 SGLCPDSLLSVDD 

Seq ID NO: 103 DNA sequence 
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Nucleic Acid Accession U: NM_001 S6S. 1 

Coding sequence: 67-363 (underlined sequences correspond to start and stop codons) 
^ 1 II 21 31 41 51 

5 I I [ 1 I i 

GAG ACATTCC TCAATTGCTT AGACATATTC TG AGCCTACA GCAGAGGAAC CTCCAGTCTC 60 
AGCAC CATG A ATCAAACTGC GATTCTGATT TGCTGCCTTA TCTTTCTGAC TCTAAGTGGC 120 
ATTCAAGGAG TACCTCTCTC TAG AACCGTA CGCTGTACCT GCATCAGCAT TAGTAATCAA 1 80 
CCTGTTAATC CAAGGTCTTT AGAAAAACTT GAAATTATTC CTGCAAGCCA ATTTTGTCCA 240 
1 U CGTGTTGAG A TCATTGCTAC AATG AAA AAG AAGGGTGAG A AGAGATGTCT GAATCCAG AA 300 
TCGAAGGCCA TCAAGAATTT ACTGAAAGCA GTTAGCAAGG AAATGTCTAA AAGATCTCCT 360 
TAAAACCAGA GGGGAGCAAA ATCGATGCAG TGCTTCCAAG GATGGACCAC ACAGAGGCTG 420 
CCTCTCCCAT CACTTCCCTA CATGG AGTAT ATGTCAAGCC ATAATTGTTC TTAGTTTGCA 480 
GTTACACTAA AAGGTG ACCA ATGATGGTCA CCAAATCAGC TGCTACTACT CCTGTAGGAA 540 
1 5 GGTTAATGTT CATCATCCTA AGCTATTCAG TAATAACTCT ACCCTGGCAC TATAATGTAA 600 
GC TCTA CTGA GGTGCTATGT TCTTAGTGG A TGTTCTGACC CTGCTTCAAA TATTTCCCTC 660 
ACCmrCCCA TCTTCCAAGG GTACTAAGGA ATCTTTCTGC nTGGGGTTT ATCAGAATTC 720 
TCAGAATCTC AAATAACTAA AAGGTATGCA ATCAAATCTG CTTTTTAAAG AATGCTCTTT 780 
ACTTCATGGA CTTCCACTGC CATCCTCCCA AGGGGCCCAA ATTCTTTCAG TGGCTACCTA 840 
20 CATACAATTC CAAACACATA CAGGAAGGTA GAAATATCTG AAAATGTATG TGTAAGTATT 900 
CTTATTTAAT GAAAG ACTGT ACAAAGTATA AGTCTTAGAT GTATATATTT CCTATATTGT 960 
TTTC AGTGTA CATGGAATAA CATGTAATTA AGTACTATGT ATCAATGAGT AACAGGAAAA 1020 
TTTTAAAAAT ACAGATAGAT ATATGCTCTG CATGTTACAT AAGATAAATG TGCTGAATGG 1080 
TTTTCAAATA AAAATGAGGT ACTCTCCTGG AAATATTAAG 
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Seq ID NO: 104 Protein sequence: . 
Protein Accession #: NP 00 1 556. 1 



I 11 21 31 41 51 

.30 1 I I I I 1 

MNQTAILICC LIFLTLSGIQ G VPLSRTVRC TCISISNQPV NPRSLEKLEI IPASQFCPRV 60 
EIIATMKXKG EICRCLNPESK AIKNLLKAVS KEMSKRSP 

Seq ID NO: 1 05 DNA sequence 
35 Nucleic Acid Accession NM_0IS068.I 

Coding sequence: 1 ] 70-2243 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I 1 I I I i 

40 GTAACAACCG TCACCCTGGG TCCCGACTGC CCACCTCCTC CTCCTCCCCC TCCCCCCAAC 60 

AACAACAACA ACAACAACTC CAAGCACACC GGCCATAAGA GTGCGTGTGT CCCCAACATG 120 
AGOG AACGAA GAAGGGACG A GCTCTCTGAA GAGATCAACA ACTTAAGAG A GAAGGTCATG 1 80 
AAGCAGTCGG AGGAGAACAA CAACCTGCAG AGCCAGGTGC AGAAGCTCAC AGAGGAGAAC 240 
ACCACCCTTC GAGAGCAAGT GGAACCCACC CCTGAGGATG AGGATGATGA CATCGAGCTC 300 

45 CGCGGTGCTG CAGCAGCTGC TGCCCCACCC CCTCCAATAG AGGAAGAGTG CCCAGAAGAC 360 
CTCCCAGAGA AGTTCGATGG CAACCCAGAC ATGCTGGCTC CTTTCATGGC CCAGTGCCAG 420 
ATCTTCATGG AAAAGAGCAC CAOGGATTTC TCAGTTGATC GTGTCCGTGT CTGCTTC(jTG 480 
ACAAGCATGA TGACCGGCCG TG CTGC CCGT TGGGCCrTCAG CAAAGCTGGA GCGCTCCCAC 540 
TACCTGATGC ACAACTACCC AGCTTTCATG ATGGAAATGA AGCATGTCTT TGAAGACCCT 600 

50 CAGAGGCGAG AGGTTGCCAA ACGCAAGATC AGACGCCTGC GCCAAGGCAT GGGGTCTGTC 660 
ATCGACTACT CCAATGCTTT CCAG ATGATT GCCCAGGACC TGGATTGG AA CGAGCCTGCG 720 
CTGATTGACC AGTACCACGA GGGCCTCAGC GACCACATTC AGGAGGAGCT CTCCCACCTC 780 
GAGGTCGCCA AGTCGCTGTC TGCTCTGATT GGGCAGTGCA TTCACATTGA GAGAAGGCTG 840 

^ GCCAGGGCTG CTGCAGCTCG CAAGCCACGC TCGCCACCCC GGGCGCTGGT GTTGCCTCAC 900 

55 ATTGCAAGCC ACCACCAGGT AGATCCAACC GAGCCGGTGG GAGGTGCCCG CATGCGCCTG 960 
ACGCAGGAAG AAAAAGAAAG ACGCAGAAAG CTGAACCTGT GCCTCTACTG TGGAACAGGA 1020 
GGTCACTACG CTGACAATTG TCCTGCCAAG GCCTCAAAGT CTTCGCCGGC GGGAAACTCC 1080 
CCGGCiCCCGC TGTAGAGGGA CCTTCAGCGA CCGGGCCAGA AATAATAAGG TCCCCACAAG 1 140 
ATGATGCCTC ATCTCCACAC TTGCAAGTG A TGC TCCAGAT TCATCTTCCG GGCAGACACA 1200 

OO CCCTGTTCGT CCGAGCCATG ATCGATTCTG GTGCTTCTGG CAACTTCATT GATCACGAAT 1260 
ATGTTGCTCA AAATGGAATT CCTCTAAGAA TCAAGGACTG GCCAATACTT GTGGAAGCAA 1320 
TTGATGGGCG CCCCATAGCA TCGGGCCCAG TTGTCCACGA AACTCACGAC CTGATAGTTG 1380 
ACCTGGGAGA TCACCGAGAG GTGCTGTCAT TTGATGTGAC TCAGTCTCCA TTCTTCCCTG 1440 

- TCGTCCTAGG GGTTCGCTGG CTGAGCACAC ATGATCCCAA TATCACATGG AGCACTCGAT 1500 
05 CTATCGTCTT TG ATTCTGAA TACTGCCGCT ACCACTGCCG GATGTATTCT CCAATACCAC 1560 

CATCGCTCCC ACCACCAGCA CCACAACCGC CACTCTATTA TCCAGTAG AT GG ATACAGAG 1 620 
TTTACCAACC AGTGAGGTAT TACTATGTCC AGAATGTGTA CACTCCAGTA GATGAGCACG 1680 
TCTACCCAGA TCACCGCCTG GTTGACCCTC ACATAGAAAT GATACCTGGA GCACACAGTA 1740 
TTCCCAGTGG ACATGTGTAT TCACTGTCCG AACCTGAAAT GGCAGCTGTT CG AGATTTTG 1 800 
70 TGGCAAGAAA TGTAAAAG AT GGGCTAATTA CTCCAACGAT TGCACCTAAT GGAGCCCAAG 1860 
TTCTCCAGGT GAAGAGGGGG TGGAAACTGC AAGTTTCTTA TGATTGCCGA GCTCCAAACA 1920 
ATTTTACTAT CCAGAATCAG TATCCTCGCC TATCTATTCC AAATTTAGAA GACCAAGCAC 1980 
ACCTGGCAAC GTACACTG AA TTCGTACCTC AAATACCTGG ATACCAAACA TACCCCACAT 2040 

- - ATGCCGCGTA CCCGACCTAC CCAGTAGGAT TCGCCTGGTA CCCAGTGGGA CGAGACGGAC 2100 
75 AAGGAAG ATC ACTATATGTA CCTGTG ATG A TCACTTGGAA TCCACACTGG TACCGCCAGC 21 60 

CTCCGGTACC ACAGTACCCG CCGCCACAGC CGCCGCCTCC ACCACCACCA CCGCCGCCGC 2220 
CTCCATCTTA CAGTACCCTG TAAA TACCTG TCATGTCCTT CAGGATCH'CT GCCCTCAAAA 2280 
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TTTATTCCTG TTCAGCTTCT CAATCAGTG A CTGTGTGCTA AATTTTAGGC TACTGTATCT 2340 
TCAGGCCACCTGAGGCACATCCTCTCTGAA ACGGCTATGG AAGGTTAGGG CCACTCTGGA 2400 
CTGGCACACA TCCTAAAGCA CCAAAAGACC TTCAACATTT TCTG AGAGCA ACAGAGTATT 2460 
TGCCAATAAA TGATCTCTCA TTTTTCCACC TTGACTGCCA ATCTA ACTAA AATAATTAAT 2520 
AAGTrrACTT TCCAGCCAGT CCTGGAAGTC TGGGTTTTAC CTGCCAAAAC CTCCATCACC 2580 
ATCTA AATTA TAGGCTGCXTA AATTTGCTGT TTAACATTTA CAGAGAAGCT GATACAAACG 2640 
CAGGAAATGC TGATTTCTTT ATGGAGGGGG AGACGAGGAG GAGGAGGACA TGACTTTTCT 2700 
TGCGGTTTCG GTACCCTCTT TTTAAATCAC TGGAGGACTG AGGCCTTATT AAGGAAGCCA 2760 
AAATTATCGG TGCAGTGTGG AAAGGCTTCC GTG ATCCTCT CGCTGCACCC TTAG A AACTT 2820 
CACCGTCTTC AAACTCCATT TCCATGGTTC TGTTAATTCT CAAGGAGCAG CAACTCGACT 2880 
GGTTCTCCCA GGAGCAGGAA AAACCCTTGT GACATG AAAC ATCTCAGGCC TGAAAAG AAA 2940 
GTGCTCTCTC AGATGG ACTC TTGCATGTTA AGACTATGTC TTCACATCAT GGTGCAAATC 3000 
ACATGTACCC AATGACTCCG GCTTTG ACAC AACACCTTAC CATCATCATG CCATGATGGC 3060 
TTCCACAAAG CATTAAACCT GGTAACCAGA GATTACTGGT GGCTCCAGCG TTGTTAGATG 3120 
TTCATGAAAT GTGACCACCT CTCAATCACC TTTGAGGGCT AAAG AGTAGC ACATCAAAAG 3 1 80 
GACTCCAAAA TCCCATACCC AACTCTTAAG AGATTTGTCC TGGTACTTCA GAAAGAATTT 3240 
TCATGAGTGTTCTTAATTGG CTGGAAAAGC ACCAGCTGAC GTTTTGGAAG AATCTATCCA 3300 
TGTGTCTGCC TCCATATGCA TCTGGGCATT TCATCTTCAG TCCCCTCATT AGACTGTAGC 3360 
ATTAGGATGT GTGGAGAG AG GAGAAATGAT TTAGCACCCA G ATTCACACT CCTATGCCTG 3420 
GAAGGGGGAC ATCTTTGAAG AAGAGGAATT AGGQCTGTGG ACACTGTCTT GAGG ATGTGG 3480 
ACTTCCTTAG TGAGCTCCAC ATTACTTG AT GGTAACCACT TCAA AAGGAT CAG AATCCAC 3540 
GTAATGAAAA AGGTCCCTCT AGAGGATGGA GCTGATGTGA AGCTGCCAAT GGATGAAAAG 3600 
CCTCAGAAAG CAACTCAAAG GACTCAAAGC AACGGACAAC ACAAGAGTTG TCTTCAGCCC 3660 
AGTGACACCT CTGATGTCCC CTGGAAGCTT TGTGCTAACC TGGGACTGCC TGACTTCCTT 3720 
TAGCCTGGTC CCTTGCTACT ACCTTGAACT GTTTT ATCTA ACCTCTCTTT TTCTGTTTAA 3780 
TTCTTTGCTA CTGCCATTG A CCCTGCTGCA GG ATTTGTGT CATTTTCCTG CCTGGTTGCT 3840 
GAGACTCCAT TTTGCTGCCA CACACAGAGA TGTAAGAGGC AGGCTTTAAT TGCCAA AGCA 3900 
CAGTTTGAGC AGTAGAAAAC AACATGGTGT ATATCTCAAA TTGCCTGACA TGAAG AGGAG 3960 
TCTAACGGTG AAGTTTCACT TTTCATCAGC ATCATCTTTC ACATGTTCAT TATCATCCGC 4020 
TCTTATTCTT GCATGTTTAA ACACTTAAAA TTTTTAGTAT AATTTTTAGT GTGTTTTGAA 4080 
GTGGTGACTA GGCTTTCAAA AACTTCCATT GAATTACAAA GCACTATCCA GTTCTTATTG 4140 
TTAAACTAAG TAAAAATGAT AAGTA ACATA GTGTAAAATA TTCCTTTACT GTGAACTTCT 4200 
TACAATGCTG TGAATG AG AG GCTCCTCAGA ACTGGAGCAT TTGTATA ATA ATTCATCCTG 4260 
TTCATCTTCA ATTTTAACAT CATATATAAT TTCAATTCTA TCAATTGGGC CTTTAAAA AT 4320 
CATATAAAAG GATATAAAAT TTG AAAAGAG AAACCTAATT GGCTATTT AA TCC AAAACAA 4380 
CTriTlTTTT TCCTTCAATG GAATCAGAAA GCTTGTCAAT CACTCATGTG TTTTAGAGTA 4440 
ATTACTTTTA AAATGGTGCA TTTGTGCTTC TGAACTATTT TGAAGAGTCA CTTCTGTTTA 4500 
CCTCAAGTAT CAATTCATCC TCCATACATT TGAATTCAAG TTGTTTTTTG TCAAATTTAC 4560 
AGTTGTCAAT TG ATCTTCAA GCTGCAGGGT GCCTAG AAAT GGGCCGTTGT CTGTAGCCCT 4620 
GGCATGTGCA CACGGACATT TGCCACCACT GCAAGCAAAA GTCTGGAGAA GTTCACCAAC 4680 
GACAAGAACG ATTAGGGAAA ATATGCTGCT GTGGGTTAAC AACTCAGAAA GTCCCTGATC 4740 
CACATTTGGCTGTTT ACTAA AGCTTGTGAT TAACTTTTTG GCAGTGTGTA CTATGCTCTA 4800 
TTGCTATATA TGCTATCTAT AAATGTAGAT GTTAAGGATA AGTAATTCTA AATTTATTAT 4860 
TCTATAGTTT TG AAGTTTGG TTAAGTTTGC TTTCACTCAA TTG ATTTATT TTGTTGTTAA 4920 
TCAAATTTAT GTTAATTGGA TCCTTTAAAT TTTTTTTGGC ATTTTCCAAC AAAAATGGCT 4980 
TTATTCATAA GAAAGGAAAA AAATCAATQG AATTTGATAT CTAAAG AAGT TAGAAA GGGA 5040 
GCAAAATAAA AAACATAAAG GAGATAGATG AATTAGTAAG CAAATCAGTA GTCGAGTTTT 5100 
TCA AACTGGC AAAATTAATT AATTG ACTTT TAGCCCAAAT TTACATTGTT AATTAAATCA 5 1 60 
AGAAGGAAGA AGATCTAAGA GCTCCCATTG ATAGGCAAGC CTAGAGAGAA CTAGCTAAAT 5220 
TTATCATGCT AGGATATTGA AACACAGAAA GTTTACATAC ATTTATGAAG GGTCAATTTA 5280 
GTTTGG ACAG TGAGGTATTT GTCTTAGTGG AAAAAAGGAG AATTAGTCTG ATCAAATCGT 5340 
GAAGTAATAC AGTGAACTTG CAGGTGCACA AAATAAGAdG GCCACATCTA TATGGTGCAG 5400 
TCTGGAATTC TGTTTAAGTT TGTAGGTACC TCTTGG ACTT CTGAATTG AT CCAGTTGTCA 5460 
TCCACCACAG ACATCTCACA TCAG ATACAG ACAGTTCCAA GATTGACAAC AG AGAA CAAC 5520 
CTGCTGGAAA GACCTGGGCA GAAATGGAGA GCCCTGCGGG AACCATGCTA CATTTTCATC 5580 
TAAAGAGAGA ATGCACATCT GATGAGACTG AAAGTTCTTT GTTGTTTTAG ATTGTAGAAT 5640 
GGTATTG AAT TGGTCTGTGG AAAATTGCAT TGCTTTTATT TCITTGTGTA ATCAAGT TTA 5 700 
AGTAATAGGG GATATATAAT CATAAGCATT TTAGGGTGGG AGGGACTATT AAGTAATTTT 5760 
AAGTGGGTGG GGTTATTTAG AATGTTAGAA TAATATTATG TATTAGATAT CGCTATAAGT 5820 
GGACATGCGT ACTTACTTGT AACCCTTTAC CCTATAATTG CTATCCTTAA AGATTTCAAA 5880 
TAAACTCGGA GGGAACTGCA GGGAGACCAA CTTATTTAGA GCGAATTGGA CATGG ATAAA 5940 
AACCCCAGTG GGAGAAAGTT CAAAGGTG AT TAG ATTAATA ATTTAATAGA GGATG AGTG A 6000 
CCTCTGATAA ATTACTGCTA GAATGAACTT GTCAATGATG GATGGTAAAT TTTCATGGAA 6060 
GTTATAAAAG TGATAAATAA AAACCCTTGC TTTTACCCCT GTCAGTAGCC CTCCTCCTAC 6120 
CACTGAACCC CATTGCCCCT ACCCCTCCTT CTAACTTTAT TGCTGTATTC TCTTCACTCT 61 80 
ATATTTCTCT CTATTTGCTA ATATTGCATT GCTGTTACAA TA A AAATTCA AT AAAG ATTT 6240 
AGTGGTTAAG TGC 

Scq ID NO: 106 Protein scQuetice; 
Protein Accession »: NPJ)55883. 1 

1 11 21 31 41 51 

MTERRIOJELS EEINNLREKV MKQSEENNNL QSQVQKLTEE NTTLREQVEP TPEDEDDDJE 60 
LRGAAAAAAP PPPIEEECPE DLPEKFDGNP DMLAPFMAQC QIFMEKSTRD FSVDRVRVCF 1 20 
VTSMMTGRAA RWASAKLEKS HYLMHNYPAF MMEMKHVFED PQRREVAKRK IRRLRQGMGS 1 80 
VIDYSNAFQM lAQDLDWNEP M.IDQYHEGL SDHIQEELSH LEVAKSLSAL IGQCIHIERR 240 
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LARAAAARKP RSPPRALVLP HIASHHQVDP TEPVGGARMR LTQEEKERRR KLNLCLYCGT 300 
GGHYADNCPA KASKSSPAGN SPAPL 

Seq ID NO: 107 DNA sequence 
J Nucleic Acid Accession #: NM__003 679.1 

Coding sequence; 47-l507(underiined sequences coirespond to start and stop codons) 

\ M 21 31 41 51 
1^ I I I I I I 

1 U GGCACG AGC A G AAGCA AC AA TA ATTGTG A A AAATACTTCA GCAGTTATGG ACTCATCTGT 60 
CATTCAAAGG AAAAAAGTAG CTGTCATTGG TGGTGGCTTG GTTGGCTCAT TACAAGCATG 120 
CTTTCTTGCA AAGAGGAATT TCCAGATTGA TGTATATGAA GCTAGGGAAG ATACTCGAGT 180 
GGCTACCTTC ACACGTGGAA GAAGCATTAA CTTAGCCCTT TCTCATAGAG GACGACAAGC 240 
. CTTGAAAGCTGTTGGCCTGG AAGATCAGATTGTATCCCAAGGTATTCCCATGAGAGCAAG 300 

^ ^ ^TGATCCAC TCTCTTTCAG GAAAA A AGTC TGCAATTCCC TATGGG ACAA AGTCTCAGTA 360 
TATTCTTTCT GTAAGCAGAG AAAATCTAAA CAAGGATCTA TTGACTGCTG CTGAGAAATA 420 
CCCCAATGTG AAA ATGCACT TTAACCACAG GCTGTTGAAA TGTAATCCAG AGGAAGGAAT 480 
GATCACAGTG CTTGGATCTG ACAAAGTTCC CAAAGATGTC ACTTGTG ACC TCATTGTAGG 540 
^ ATGTGATGGA GCCTATTCAA CTGTCAG ATC TCACCTG ATG AAGAAACCTC GCTTTG ATTA 600 

CAGTCAGCAG TACATTCCTC ATGGGTACAT GGAGTTGACT ATTCCACCTA AG AACGGAGA 660 
1TATGCCATG G AACCTA ATT ATCTGCATAT TTGGCCTAGA AATACCTTTA TGATGATTGC 720 
ACTTCCTAAC ATG AACAAAT CATTCACATG TACTTTGTTC ATGCCCTTTG AAGAGTTTGA 780 
AAAACTTCTA ACCAGTAATG ATGTGGTAGA TTTCTTCCAG AAATACTTTC CGGATGCCAT 840 
CCCTCTAATT GGAGAGAAAC TCCTAGTGCA AG ATTTCTTC CTGTTGCCTG CCCAGCCCAT 900 
GATATCTGTA AAGTGCTCTT CATTTCACTT TAAATCTCAC TGTGTACTGC TGGGAGATGC 960 
AGCTCATGCT ATAGTGCCGT TTTTTGGGCA AGGAATGAAT GCGGGCTTTG AAGACTGCTT 1020 
GGTATTTGAT GAGTTAATGG ATAAATTCAG TAACGACCTT AGTTTGTGTC TTCCTGTGTT 1080 
CTCAAGATTGAGAATCCCAGATGATCACGCGATTTCAGACCTATCCATGTACAATTACAT 1140 
AGAGATGCG A GCACATGTCA ACTCAAGCTG GTTCATTTTT CAGAAGAACA TGGAGAGATT 1200 
J U TCTTCATGCG ATTATGCC AT CG ACCTTTAT CCCTCTCTAT ACAATGGTCA CmTTCCAG 1260 

AATAAGATAC CATG AGGCTG TGCAGCGTTG GCATTGGCAA AAAAAGGTGA TAAACAAAGG 1320 
ACTCTTTTTC T TGGGA TCAC TGATAGCCAT CAGCAGTACC TACCTACTTA TACACTACAT 1380 
GTCACCACGA TCTTTCCTCT GCTTGAGAAG ACCATGGAAC TGGATAGCTC ACTTCCGGAA 1440 
TACA ACATGT TTCCCCGCAA AGGCCGTGGA CTCCCTAGA A CAAATTTCCA ATCTCATTAG 1 500 
CAGGIGATAG AAAGGTTTTG TGGTAGCAAA TGCATGATTT CTCTGTGACC AAAATTAAGC 1560 
ATGAAAAAAA TGTTTCCATT GCCATATTTG ATTCACTAGT GGAAGATAGT GTTCTGCTTA 1620 
"^^^TyAAACT GAATGTAGAG TATCTCTGTA TGTTAATTGC AATTACTGGT TGGGGGGTGC 1680 
ATTTTAAAAG ATGAAACATG CAGCTTCCCT ACATTACACA CACTCAGGTT GAGTCATTCT 1740 
AACTATAAAA GTGCAATGAC TAAGATCCTT CACTTCTCTG AAAGTAAGGC CCTAGATGCC 1800 
TCAGGG AAG A CAGTAATCAT GCCTTTTCTT TAAAAGACAC AATAGGACTC GCAACAGCAT 1 860 
TGACTCAACA CCTAGGACTA AAAATCACAA CTTAACTAGC ATGTTAACTG CACmTCAT 1920 
TACGTGAATGGAACTTACCTAACCACAGGGCTCAGACTTACTAGATAAAACCAGAAATGG 1980 
AAATAAGGAA TTCAGGGGAG TTCCAGAGAC TTACAAAATG AACTCATTTT ATTTTCCCAC 2040 
CTTCAAATAT AAGTATTATC ATCTATCTGT TTATCGTCTA TCTATCTATC ATCTATCTAT 2100 
^0 CTATCTATCA TCTATCTATC TATCTATCTA TCTATCTATC TATCTATCTA TCTATCTATC 2160 

TCTATTTATT TATGTATTTA G AGATCAGGT CTCACTCTGT TG ACCAGGCT GGAGTGCAGT 2220 
GGTGAG ATCT GGGTTCACTG CAACCTCTGC CTCCTGGGCT CAAGCAATCC TCCCACTTCA 2280 
GCCTCCCAAA TAGCTGGGGC TACCATGGTA TnTTCAGTA GAGACCGGGT CTTGCCATGC 2340 
TGCCCAGGCC AGTCTCAAAC TCCTGGCCTC ATGTGATCTG CCCACCTCAG CCTCCCAAAG 2400 
TACAGGGATT AGAGTTGTGA GCCACCGCTG CCAGCCCAGA GTTACCCTCT AAAGATAAGA '>460 
AAAAGGCTATTAATATCATA CTAAGTGAAG GACAGGAAAG GGTITTATTC ATAAATTAAA 2520 
TGTCTACATG TGCCAGAATG GAAAGGAAAC AAGGGGAGAC AACnTTATA GAAATACAAA 2580 
GCCATTACTT TATTCAATTT CAGACCCTCA GAAGCAATTT ACTAATTTAT TCTTCG ACTA 2640 
CATACTGCAG CAG AACCAGC AATACACTTG ATTTTTAAAA GCACATTTAG TGAAATGTTT 2700 
D 3 TCTTTGGTTC ATCCTTCTTT A ACAGGCTGC TG AGTCACTC AG AAATCCTT CAAACATGAT 2760 

TAATTATGAA GATGAAAC AC TAGAGTCATA TAAG AAATAA AAATTGGGCA ATAAAATAAA 2820 
ATGATTCAGT GTTTCTTTTC TATATTGTCA ATGAAAACCT TGAGTTCTAA TAATCCATGT 2880 
TCAGTTTGTA GGGAAAGAAA AAATAATTTT TCCTTCTACC CACTTTAGGT TCCTTGGCTG 2940 
GGGCCCCT AT AACAAAAGAC AGATTG ACAA GAGAAAAACA AACATAAATT TATTAGCGGG 3000 
OU TATATGTAAT ATATATGTGG GAAATACAGG GGAATGAGCA AATCTCAAAG AGCTGGCGTC 3060 
TTAGAACTCC CTGGCTTATA TAGCATCG AC A AAG AACAGT AAATnTTAG AGAAACAACA 3120 
AAACAAAGAA AAAGAGCTTT GAGTCTGTAG GGGCAGCAAT TTGGGGGAAG CAAATATATG 3180 
GGAGTTTGCC TTGTAGATTC CTCTGGTGGT GGTCTCCAGG CTGACAAGGA TTCAAAGTTG 3240 
^ TCTCTGAAACTCCTCTTTGTCATACTGCACATATAAAACGTC 3300 

OD GATTTCTITT TCATTCTAGA ATTATCTCCT TGATAACTTG ATCAGATATA GGACATGACA 3360 
CTGA ATAGAG TCCAACAGTA CAAAAA AAAT TCAGTATGTT CTAGCTACTT CACACATGTG 3420 
TACGCGACAG TTATTTTTAC AGTAAGGTAT TTTCGAGAAA AATGCATTAC GTGTTTTGGA 3480 
AAATAGAGTA ATFTAAAAAA TATATTTGAA ATGAAAATCT CCAACACATT AGAAGATGAT 3540 
7n ?^^?TT'I^S'!^T ^SitlS?!^ TGCCACAAGT GGTmTTCA TTATGTAAAG CACCCGTTGA 3600 
/ U ATTAAAAG AA TTTGTTTTTG TTCAACCTCT TCCTGAGGCC CAAG AGCATA TGGGCAATTC 3660 
GGATTTCCTG CTGGACCACA AGGTTCTGTT GATATTACAT AGAAACGGGT ATTCCAGACA 3720 
^gTT^?:^ TGAAAGTCCA A AAGTGGCAT CCAATTTAAG GCCCCATCTT TCGTTGCCAT 3780 
TCTTCATTCC TACAAAGGAC GAACTTGGAT TACATCAACT TTGGACCCAT TGGTTTTGTC 3840 
7^ ?^^A^JnTJf^A GGTGATGATA AAAATGATGG AAGAAGAGTT 3900 

/ D GAAAGTCACT TTTTTCTTTG GCCTGTCCCC ATCTTTCTGT GACATCACAA TGGGTCTGAT 3960 
CTGCATTTCACTTCCAGCTGCTGGTAGGTCTTTAGCAGGCCTCTGGCACCTCAGCAGTCG 4020 
GAGGCACAGA AGCTGCAAAA GGGATCTTCG AAACTGGGCA G AGAAAAAAT AAAGTGGAAT 4080 

396 



40 



50 



BNSDCXaO::<WO ^0a0SS377A2J_> 



wo 02/059377 



PCT/US02/02242 



ATTAAGTAAA AGTTGGGCAC TAATCTGGAT TAACATTCGA GGAAATCAGT TGAGCTGATT 4140 
TAAGTTGTTT TTTGTTTGTT AGCAGGTGTG GATGTGGGGT TATGTGGTCA TGCTCAGATC 4200 
TACCTAA ATC ACCCCAG AGC TTTATGTCTT TTATTCATTC TAAATCTTAT TAACCGG AAT 4260 
ATGTAGG ACC ATTTCAATAC CTTGTAATCC TCCAAGCTTC AATCTGCACA CACTTTCTAT 4320 
5 G AGGGCAGGT ACAACTATTA AG AG ATTTTG AACATTAAGT TAGTCC ACAA ATATTCAGTG 4380 
GGCATCTACT AGGTGACAGC CACTGTGCTA TAATTAGAG A CTTTTTACTA TAAGCATCAA 4440 
AAACAGATAA GGCTCTTCCT GGCAGAGTTT ACAGCCTGGT GTACTTGCTA ATGTCTCTTT 4500 
AATTAGGTGA AGAATTTTTT TTTTCTATCG AAATTACTAA TCAGTTGGGG AAAAAAATAC 4560 
TATAGCAGAC AGCACTAATG TCATCAACAA ACATTGTTCT TCTCCGTGTC CTGGGTACAA 4620 

1 0 CATCG AATAA TATTTCTTGG CCTCCTTTCC GCTTCTCCTC TCTGCTGTTC CTCTCTACAA 4680 

GAACCTGGGA GGCCAACGCC TAAAGATCAT AATATCACAA TGGAAGGAAC CTAGATTCCT 4740 
AAATGACtGC ATAGGACAGA TCCCATCTCC TCCACCCAAT ACATTATTAG ACTGAACTGT 4800 
GACCTGAAAT GAGCAATAAA CTCTGTATTA ATTCACTG AA ATGTTGGGGT TGCTTGTTAT 4860 
AGTAGTCGGT CCATCATGAC CAGTAAA ACA TAAATCAAAA GTTAATGTAA TTGTTATCCC 4920 

1 5 ATTATTTAG A GCGAAATAAA TGTTGAATAT ATGGACTTTC TCAG ATTAGG AAATACCAAT 4980 
TAAAAATATA ATAAATAGCT 

Seq ID NO: 108 Protein sequence: 
Protein Accession »: NP_003670. 1 

20 

I 11 2i 31 41 51 

MDSSVIQRKK VAVIGGGLVG SLQACFLAKR NFQIDVYEAR EDTRVATFTR GRSIhOALSH 60 
RGRQALKAVG LEDQIVSQGI PMRARMIHSL SGKKSAIPYG TKSQYILSVS RENLNKDLLT 120 

25 AAEKYPNVKM HFNHRLLKCN PEEGMITVLG SDKVPKDVTC DUVGCDG AY STVRSHLMKK 1 80 
PRFDYSQQYI PHGYMELTIP PKNGDYAMEP NYLHIWPRNT FMMIALPNMN KSFTCTLFMP 240 
FEEFEKI.LTS NDVVDFFQKY FPDAIPLIGE KLLVQDFFLL PAQPMISVKC SSFHFKSHCV 300 
LLGDAAHAIV PFFGQGMNAG FEDCLVFDEL MDKFSNDLSL CLPVFSRLRI PDDHAISDLS 360 
MYNYIEMRAH VNSSWFIFQK NMERFLHAIM PSTFIPLYTM VTFSRIRYHE AVQRWHWQKK 420 

30 VINKGLFFLGSLUISSTYLLIHYMSPRSFLCIJRRPWKSVIAHFRNTTCFPAKAW 480 
SISTLISR 

Seq ID NO: 109 DNA sequence 
Nucleic Acid Accession #: NM_0061 15.1 

3 5 Coding sequence: 236-1 765 (underlined sequences correspond to start and stop codons) 

1 It 21 31 41 51 

GCTTCAGGGT ACAGCTCCCC CGCAGCCAGA AGCCGGGCCT GCAGCCCCTC AGCACCGCTC 60 
40 CGGGACACCC CACCCGCTTC CCAGGCGTGA CCTGTCAACA GCAACTTCGC GGTGTGGTGA 120 
ACTCTCTGAG GAAAAACCAT TTTGATTATT ACTCTCAGAC GTGCGTGGCA ACAAGTGACT 180 
GAGACCTAGA AATCCAAGCG TTGGAGGTCC TGAGGCCAGC CTAAGTCGCT TCAAAATCGA 240 
ACGAAGGCGT TTGTGGGGTT CCATTGAGAG CCGATACATC AGCATG AGTG TGTGGACAAG 300 
CCCACGGAGA CTTGTGGAGC TGGCAGGGCA GAGCCTGCTG AAGGATGAGG CCCTGGCCAT 360 
45 TGCCGCCCTG GAGTTGCTGC CCAGGGAGCT CTTCCCGCCA CTCTTCATGG CAGCCTTTGA 420 
CGGGAGACAC AGCCAGACCC TGAAGGCAAT GGTGCAGGCC TGGCCCTTCA CCTGCCTCCC 480 
TCTGGG AGTG CTGATG AAGG GACAACATCT TCACCTGGAG ACCTTCAAAG CTGTGCTTGA 540 
TGG ACTTG AT GTGCTCCTTG CCCAGGAGGT TCGCCCCAGG AGGTGGAAAC TTCAAGTGCT 600 
GGATTTACGG AAGAACTCTC ATCAGG ACTT CTGGACTGTA TGGTCTGGAA ACAGGGCCAG 660 
50 TCIGTACTCA TTTCCAGAGC CAGAAGCAGC TCAGCCCATG ACAAAGAAGC GAAAAGTAGA 720 
TGGTTTGAGC ACAGAGGCAG AGCAGCCCTT CATTCCAGTA GAGGTGCTCG TAG ACCTGTT 780 
CCTCAAGGAA GGTGCCTGTG ATGAATTGTT CTCCTACCTC ATTGAGAAAG TG AAGCGAAA 840 
GAAAAATGTA CTACGCCTGT GCTGTAAGAA GCTGAAGATT TTTGCAATGC CCATGCAGGA 900 
TATCAAG ATG ATCCTG AAAA TGGTGCAGCT GG ACTCTATT G AAGATTTGG AAGTG ACTTG 960 
5 5 TACCTGGAAG CTACCCACCT TGGCGAAATT TTCTCCTTAC CTGGGCCAGA TGATTAATCT 1020 
GCGTAGACTC CTCCTCTCCC ACATCCATGC ATCTTCCTAC ATTTCCCCGG AGAAGGAAGA 1080 
GCAGTATATC GCCCAGTTCA CCTCTCAGTT CCTCAGTCTG CAGTGCCTGC AGGCTCTCTA 1 140 
TGTGGACTCT TTATTTTTCC TTAGAGGCCG CCTGGATCAG TTGCTCAGGC ACGTGATGAA 1200 
CCCCTTGGAA ACCCTCTCAA TAACTAACTG CCGGCTTTCG GAAGGGGATG TGATGCATCT 1260 
60 GTCCCAGAGT CCCAGCGTCA GTCAGCTAAG TGTCCTGAGT CTAAGTGGGG TCATGCTGAC 1320 
CGATGTAAGT CCCGAGCCCC TCCAAGCTCT GCTGG AO AGA GCCTCTGCCA CCCTCCAGGA 1380 
CCTGGTCTTT GATGAGTGTG GGATCACGGA TG ATCAGCTC CTTGCCCTCC TGCCTTCCCT 1440 
GAGCCACTGC TCCCAGCTTA CA ACCTTAAG CTTCTACGGG AATTCCATCT CC ATATCTGC 1 500 
CTTGCAGAGT CTCCTGCAGC ACCTCATCGG GCTGAGCAAT CTGACCCACG TGCTGTATCC 1 560 
65 TGTCCCCCTG G AG AGTTATG AGGACATCCA TGGTACCCTC CACCTGG AG A GGCTTGCCTA 1 620 
TCTGCATGCC AGGCTCAGGG AGTTGCTGTG TGAGTTGGGG CGGCCCAGCA TGGTCTGGCT 1680 
TAGTGCCAAC CCCTGTCCTC ACTGTGGGGA CAGAACCTTC TATGACCCGG AGCCCATCCT 1 740 
GTGCCCCTGT TTCATGCCTA ACTAGCTGGG TGCACATATC AAATGCTTCA TTCTGCATAC 1 800 
TTGG ACACTA AAGCCAGG AT GTGCATGCAT CTTGA AGCAA CAAAGCAGCC ACAGTTTCAG 1 860 
70 ACAAATGTTC AGTGTQAGTG AGGAAAACAT GTTCAGTGAG GAAAAAACAT TCAGACAAAT 1920 
GTTCAGTGAG GAAAAAAAGG GGAAGTTGGG GATAGGCAGA TGTTGACTTG AGGAGTTAAT 1980 
GTGATCTTTG GGGAG ATACA TCTTATAGAG TTAGAAATAG AATCTGAATT TCTA AAGGGA 2040 
GATTCTGGCT TGGGAAGTAC ATGTAGGAGT TAATCCCTGT GTAGACTGTT GTAAAGAAAC 2100 
TGTTGAAAAT AAAG AGAAGC AATGTGAAGC AAAAAAAAAA AAAAAAAA 

75 
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1 n 21 31 41 51 

I I I I I i 

MERRRLWGSI QSRYISMSVW TSPRRLVELA GQSLLKDEAL AIAALELLPR ELFPPLFMAA 60 
. D FTCRHSQTLKAMVQAWPFTCLPLGVLMKGQHLHLETFKAVLDGLDVLLAQEVRPRRWKLQ 120 
VLDLRKNSHQ DFWT VWSGNR ASLYSFPEPE AAQPMTKXRK VDGLSTEAEQ PFIP VEVLVD 1 80 
LFLKEGACDE LFSYLIEKVK RKKNVLRLCC KKUCIFAMPM QDIKMILKMV QLDSIEDLEV 240 
TCTWKLPTLAKFSPYLGQMINLRRLLLSHIHASSYISPEKEEQYIAQFTSQFLSLQCLQA 300 
1 n LYVDSLFFLR GRLDQLLRHV MNPLETLSIT NCRLSEGDVM HLSQSPSVSQ LSVLSLSGVM 360 
1 U LTDVSPEPLQ ALLERASATL QDLVFDECGI TDDQLLALLP SLSHCSQLTT LSFYGNSISI 420 

SALQSLLQHL IGLSNLTHVL YPVPLESYED IHGTLHLERL AYLHARLREL LCELGRPSMV 480 
WLSANPCPHC GDRTFYDPEP ILCPCFMPN 

15 Seq ID NO: 1 1 1 DNA sequence 

Nucleic Acid Accession #: NM_0038 1 5 

Coding sequence: 8-2452 (underlined sequences correspond (o start and stop codons) 

1 11 21 31 41 51 
20 I I I ( I j 

CGCTGCCATfi CGGCTGGCGC TGCTCTGGGC CCTGGGGCTC CTGGGCGCGG GCAGCCCTCT 60 
GCCTTCCTGG CCGCTCCCAA ATATAGGTGG CACTGAGGAG CAGCAGGCAG AGTCAGAGAA 120 
GGCCCCG AGG G AGCCCTTGG AGCCCC AGGT CCTTCAGG AC G ATCTCCCAA TTAGCCTCAA 1 80 
AAAGGTGCTT CAGACCAGTC TGCCTGAGCC CCTGAGGATC AAGTTGG AGC TGGACGGTGA 240 
ZD CAGTCATATC CTGGAGCTGC TACAGAATAG GG AGTTGGTC CCAGGCCGCC CAACCCTGGT 300 
GTGGTACCAG CCCGATGGCA CTCGGGTGGT CAGTGAGGGA CACACTTTGG AGAACTGCTG 360 
CTACCAGGGA AGAGTGCGGG GATATGCAGG CTCCTGGGTG TCCATCTGCA CCTGCTCTGG 420 
. GCTCAGAGGCTTGGTGGTCCTGACCCCAGAGAGAAGCTATACCCTGQAGCAGGGGCCTGG 480 
GGACCTTCAG GGTCCTCCCA TTATTTCGCG A ATCCAAGAT CTCCACCTGC CAGGCCACAC 540 
CTGTGCCCTG AGCTGGCGGG AATCTGTACA CACTCAGACG CCACCAGAGC ACCCCCTGGG 600 
ACAGCGCCAC ATTCGCCGGA GGCGGGATGT GGTAACAGAG ACCAAGACTG TGG AGTTGGT 660 
GATTGTGGCT GATCACTCGG AGGCCCAGAA ATACCGGGAC TTCCAGCACC TGCTAAACCG 720 
CACACTGGAA GTGGCCCTCT TGCTGG ACAC ATTCTTCCGG CCCCTG AATG TACGAGTGGC 780 
ACTAGTGGGC CTGGAGGCCT GGACCCAGCG TGACCTGGTG GAGATCAGCC CAAACGCAGC 840 

O D TGTCACCCTC G A AAACTTCC TCCACTGGCG C AGGGCACAT TTGCTGCCTC GATTGCCCCA 900 
TGACAGTGCC CAGCTGGTGA CTGGTACTTC ATTCTCTGGG CCTACGGTGG GCATGGCCAT 960 
TCAGAACTCC ATCTGTTCTC CTGACTTCTC AGGAGGTGTG AACATGGACC ACTCCACCAG 1020 
CATCCTGGGA GTCGCCTCCT CCATAGCCCA TGAGTTGGGC CACAGCCTGG GCCTGGACCA 1080 
TGATTTGCCT GGGAATAGCT GCCCCTGTCC AGGTCCAGCC CCAGCCAAGA CCTGCATCAT 1 140 
GGAGGCCTCC ACAGACTTCC TACCAGGCCT GA ACTTCAGC AACTGCAGCC GACGGGCCCT 1 200 
GGAGAAAGCC CTCCT GGATG GAATGGGCAG CTGCCTCTTC GAACGGCTGC CTAGCCTACC 1260 
CCCTATGGCT GCTTTCTGCG GAAATATGTT TGTGGAGCCG GGCGAGCAGT GTGACTGTGG 1320 
CTTCCTGGAT GACTGCGTCG ATCCCTGCTG TGATTCITtG ACCTGCCAGC TG AGGCCAGG 1 380 
. TGCACAGTGT GCATCTGACG GACCCTGTTG TCAAAATTGC CAGCTGCGCC CGTCTGGCTG 1440 

GCAGTGTCGT CCTACCAG AG GGGATTGTGA CTTGCCTGAA TTCTGCCCAG GAGACAGCTC 1 500 
CCAGTGTCCC CCTGAtGTCA GCCTAGGGGA TGGCGAGCCC TGCGCTGGCG GGCAAGCTGT 1560 
GTGCATGCAC GGGCGTTGTG CCTCCTATGC CCAGCAGTGC CAGTCACTTT GGGGACCTGQ 1620 
AGCCCAGCCC GCTGCGCCAC TTTGCCTCCA GACAGCTAAT ACTCGGGGAA ATGCTTTTGG 1680 
GAGCTGTGGG CGCAACCCCA GTGGCAGTTA TGTGTCCTGC ACCCCTAGAG ATGCCATTTG 1740 

DU TGGGCAGCTC CAGTGCCAGA CAGGTAGGAC CCAGCCTCTG CTGGGCTCCA TCCGGGATCT 1800 
ACTCTGGGAG ACAATAGATG TGAATGGGAC TGAGCTGAAC TGCAGCTGGG TGCACCTGGA I860 
CCTGGGCAGT GATGTGGCCC AGCCCCTCCT GACTCTGCCT GGCACAGCCT GTGGCCCTGG 1920 
CCTGGTGTGT ATAGACCATC GATGCCAGCG TGTGGATCTC CTGGGGGCAC AGGAATGTCG 1980 
AAGCAAATGC CATGG ACATG GGGTCTGTGA CAGCAACAGG CACTGCTACT GTGAGGAGGG 2040 

OD CTGGGCACCC CCTGACTGCA CCACTCAGCT CAAAGCAACC AGCTCCCTGA CdACAGGGCT 2100 
GCTCCTCAGC CTCCTGGTCT TATTGGTCCT GGTGATGCTT GGTGCCGGCT ACTGGTACCG 2160 
TGCCCGCCTG CACCAGCGAC TCTGCCAGCT CAAGGGACCC ACCTGCCAGT ACAGGGCAGC 2220 
CCAATCTGGT CCCTCTGAAC GGCCAGGACC TCCGCAGAGG GCCCTGCTGG CACGAGGCAC 2280 
TAAGTCTCAG GGGCCAGCCA AGCCCCCACC CCCAAGGAAG CCACTGCCTG CCQACCCCCA 2340 

OU GGGCCGGTGC CCATCGGGTG ACCTGCCCGG CCCAGGGGCT GGAATCCCGC CCCTAGTGGT 2400 
ACCCTCCAGA CCAGCGCCAC CGCCTCCGAC AGTGTCCTCG CTCTACCTCTGACCTCTCCG 2460 
G AGGTTCCGC TGCCTCCAAG CCGGACTTAG GGCTTCAAGA GGCGGGCGTG CCCTCTGG AG 2520 
TCCCCTACCA TGACTGAAGG CGCCAGAG AC TGGCGGTGTC TTAAGACTCC GGGCACCGCC 2580 

^ _ ACGCGCTGTC AAGCAACACT CTGCGGACCT GCCGGCGTAG TTGCAGCGGG GGCTTGGGGA 2640 

OD GGGGCTGGGG GTTGGACGGG ATTGAGGAAG GTCCGCACAG CCTGTCTCTG CTCAGTTGCA 2700 
ATAAACGTGA CATCTTGGGA GCGTTAAAAA AAAAAAAAAA 



70 



Seq ID NO: 1 12 Protein sequence: 
Protein Accession #: NP 003806.2 



I II 21 31 41 51 
I I i t 1 t 

MRLALLWALG LLGAGSPLPS WPLPNIGGTE EQQAESEKAP REPLEPQVLQ DDLPISLKKV 60 
LQTSLPEPLR DCLELDGDSH ILELLQNREL VPGRPTLVWY QPDGTRWSE GHTLENCCYQ 120 
/D GRVRGYAGSW VSICTCSGLRGLVVLTPERS YTLEQGPGDLQGPPUSRIQDLHLPGHTCA 180 
LSWRESVHTQ TPPEHPLGQR HIRRRRD WT ETKTVELVIV ADHSEAOKYR DFQHLLNRTL 240 
EVALLLDTFF RPLNVRVALV GLEAWTQRDL VEISPNPAVT LENFLHWRRA HLLPRLPHDS 300 
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AQLVTGTSFS GPTVGMAIQN SICSPDFSGG VNMDHSTSIL GVASSIAHEL GHSLGLDHDL 360 
PGNSCPCPGP APAKTCIMEA STDFLPGLNF SNCSRRALEK ALLDGMGSCL FERLPSLPPM 420 
AAFCGNMFVE PGEQCDCGFL DDCVDPCCDS LTCX^LRPGAQ CASDGPCCQN CQLRPSGWQC 480 
RPTRGDCDLPEFCPGDSSQCPPDVSLGDGEPCAGGQAVCMHGRCASYAQQCQSLWGPGAQ 540 
5 PAAPLCLQTA NTRGNAFGSC GRNPSGSYVS CTPRDAICGQ LQCQTGRTQP LLGSIRDLLW 600 
ETIDVNGTEL NCSWVHLDLG SDVAQPLLTL PGTACGPGLV CIDHRCQRVD LLGAQECRSK 660 
CHGHGVCDSN RHCYCEEGWA PPDCTTQLKA TSSLTTGLLL SLLVLLVLVM LGAGYWYRAR 720 
LHQRLCQLKG PTCQYRAAQS GPSERPGPPQ RALLARGTKS QGPAKPPPPR KPLPADPQGR 780 
CPSGDLPGPG AGIPPLWPS RPAPPPPTVS SLYL 
10 ' 
Seq ID NO: 1 13 DNA sequence 
Nucleic Acid Accession #; NMJ)024 1 6 

Coding sequence: 40-417 (underlined sequences correspond to start and stop codons) 

15 1 n 2> 3> ^> 51 

ATCCAATACA GGAGTGACTT GG AACTCCAT TCTATCACTATG AAGAAA AG TGGTGTTCTT 60 
TTCCTCTTGG GCATCATCTT GCTGGTTCTG ATTGGAGTGC AAGGAACCCC AGTAGTGAGA 120 
AAGGGTCGCT GTTCCTGCAT CAGCACCAAC CAAGGGACTA TCCACCTACA ATCCTTGAAA 1 80 
20 G ACCTTAAAC AATTTGCTCC AAGCCCTTCC TGCGAGAAAA TTGAAATCAT TGCTACACTG 240 
AAGAATGGAG TTCAAACATG TCTAAACCCA GATTCAGCAG ATGTGAAGGA ACTGATTAAA 300 
AAGTGQGAGA AACAGGTCAG CCAAAAGAAA AAGCAA AAGA ATGGGAAAAA ACATCAAAAA 360 
AAGAAAGTTC TGAAAGTTCG AAAATCTCAA CGTTCTCGTC AAAAG AAG AC TACATAAGAG 420 
ACCACTTCAC CAATAAGTAT TCTGTGTTAA A AATGTTCTA TTTTAATTAT ACCGCTATCA 480 
25 TTCCAAAGGA GGATGGCATA TAATACAAAG GCTTATTAAT TTG ACTAGAA AATTTAAAAC 540 
ATTACTCTGA AATTGTAACT AAAGTTAGAA AGTTGATnT AAGAATCCAA ACGTTAAGAA 600 
TTGTTAAAGG CTATGATTGT CTTTGTTCTT CTACCACCCA CCAGTTGAAT TTCATCATGC 660 
TTAAGGCCAT G ATTTTAGCA ATACCCATGT CTACACAGAT GTTCACCCAA CCACATCCCA 720 
CTCACAACAG CTGCCTGGAA G AGCAGCCCT AGGCTTCCAC GTACTGCAGC CTCCAGAGAG 780 
30 TATCTGAGGC ACATGTCAGC AAGTCCTAAG CCTGTTAGCA TGCTGGTGAG CCAAGCAGTT 840 
TGAAATTGAG CTGGACCTCA CCAAGCTGCT GTGGCCATCA ACCTCTGTAT TTGAATCAGC 900 
CTACAGGCCT CACACACAAT GTGTCTGAGA GATTCATGCT GATTGTTATT GGGTATCACC 960 
ACTGGAGATC ACCAGTGTGT GGCTTTCAGA GCCTCCTTTC TGGCTTTGGA AGCCATGTGA 1020 
TTCCATCTTG CCCGCTCAGG CTGACCACTT TATTTCTTTT TG1TCCCCTT TGCTTCATTC 1080 
3 5 AAGTCAGCTC TTCTCCATCC TACCACAATG CAGTGCCTTT CITCTCTCCA GTGCACCTGT 1 1 40 
CATATGCTCT GATTTATCTG AGTCAACTCC TTTCTCATCT TGTCCCCAAC ACCCCACAGA 1200 
AGTGC1TTCT TCTCCCAATT CATCCTCACT C AGTCCAGCT TAGTTC AAGT CCTGCCTCTT 1 260 
AAATAAACCT TTTTGGACAC ACAAATTATC TTAAAACTCC TGTTTCACTT GGTTCAGTAC 1320 
CACATGGGTG AACACTCAAT GGTTAACTAA TTCTTGGGTG TTTATCCTAT CTCTCCAACC 1380 
40 AGATTGTCAG CTCCTTGAGG GCAAG AGCCA CAGTATATTT CCCTGTTTCT TCCACAGTGC 1440 
CTAATAATAC TGTGGAACTA GQTTTTAATA ATTTTTTAAT TGATGTTGTT ATGGGCAGGA 1500 
TGGCAACCAG ACCATTGTCT CAGAGCAGGT GCTGGCTCTT TCCTGGCTAC TCCATGTTGG 1560 
CTAGCCTCTG GTAACCTCTT ACTTATTATC TTCAGGACAC TCACTACAGG GACCAGGGAT 1620 
G ATGCA ACAT CCTTGrCTTT TTATG ACAGG ATGTTTGCTC AGCTTCTCCA ACAATA AGAA 1 680 
45 GCACGTGGTA AAACACTTGC GGATATTCTG GACTG m I 1 AAAAA ATATA CAGTTTACCG 1 740 
AAAATCATAT AATCTTACAA TG AAAAGG AC TTTATAG ATC AGCCAGTG AC CAACCTTTTC 1 800 
CCAACCATAC A AAAATTCCT TTTCCCGAAG GAAAAGGGCT TTCTCAATAA GCCTCAGCTT 1860 
TCTAAGATCT AACAAGATAG CCACCGAGAT CCTTATCGAA ACTCATTTTA GGCAAATATG 1920 
AGTTTTATTG TCCGTTTACT TGTTTCAG AG TTTGTATTGT G ATT ATCA AT TACC AC ACC A 1 980 
50 TCTCCCATGA AGAAAGGGAA CGGTGAAGTA CTAAGCGCTA GAGGAAGCAG CCAAGTCGGT 2040 
TAGTGGAAGC ATGATTGGTG CCCAGTTAGC CTCTGCAGGA TGTGGAAACC TCCTTCCAGG 2100 
GGAGGTTCAG TGAATTGTGT AGG AG AGGTT GTCTGTGGCC AGAATTTAAA CCTATACTCA 21 60 
CTTTCCCAAA TTGAATCACT GCTCACACTG CTGATGATTT AGAGTGCTGT CCGGTGGAGA 2220 
TCCCACCCGA ACGTCTTATC TAATCATGAA ACTCCCTAGT TCCTTCATGT AACTTCCCTG 2280 
55 AAAAATCTAA GTGTTTCATA AATTTGAGAG TCTGTGACCC ACTTACCTTG CATCTCACAG 2340 
GTAGACAGTA TATAACTAAC AACCAAAGAC TACATATTGT CACTGACACA CACGTTATA A 2400 
TCATTTATCA TATATATACA TACATGCATA CACTCTCAAA G CAAA TAATT TTTCACTTCA 2460 
AAACAGTATT G ACTTGTATA CCTTGTAA1T TGAAATATTT TCTTTGTTAA AATAGAATGG 2520 
TATCAATAAA TAGACCATTA ATCAG 

60 

Seq ID NO: 1 14 Protein sequence: 
- Protein Accession #: NP_002407 

1 11 21 31 41 51 

MKKSGVLFLL giIllvligv qgtpwrkgr cscistnqgt ihlqslkdlk qfapspsch:-: 60 

lEIIATUCNGVQTCLNPDSADVKELKKWEKQVSQKKKQKNGKKHQKKKVUCVRKS^^^^^ 120 . 
QKKTT 

Seq ID NO: 1 15 DNA sequence 
Nucleic Acid Accession #: NMj003238- 1 

Coding sequence: 182-1426 (underlined sequences correspond to start and stop codons) 
1 U 21 31 41 . 51 

CAAGCAGGAT ACG TI Ui CT GTTGGGCATT G ACTAG ATTG TTTGCAAAAG TTTCGCATCA 60 
AAAACAAACA ACAACAACAA AAAACCAAAC AACTCTCCTT GATCTATACT TTGAGAATTG 120 
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TTG ATTTCTT TTTTTTTATT CTGACTTTTA AAA ACAACTT TTTTTTCCAC TTTTTTAAA A 1 80 
AATGCACTAC TGTGTGCTGA GCGCTTTTCT G ATCXTTGCAT CTGGTCACGG TCGCGCTCAG 240 
CCTGTCTACC TGCAGCACAC TCGATATGGA CCAGTTCATG CGCAAGAGGA TCGAGGCGAT 300 
CCGCGGGCAG AT<XTGAGCAAGCTGAAGCTCACCAGTCCX:CCAGAAGACTATCCTGAGCC 360 
5 CGAGGAAGTCCCCCCGGAGGTGATTTCCATCTACAACAGCACCAGQGACTTGCTCCAGGA 420 

GAAGGCGAGC CGGAGGGCGG CCGCCTGCGA GCGCGAGAGG AGCGACGAAG AGTACTACGC 480 
CAAGGAGGTT TACA AAATAG ACATGCCGCC Cl ICl lCCCC TCCGAAAATG CCATCCCGCC 540 
CACTTTCTAC AGACCCTACT TCAGAATTGT TCGATTTGAC GTCTCAGCAA TGGAGAAGAA 600 
TGCTTCCAAT TTGGTGAAAG CAGAGTTCAG AGTCTTTCGT TTGCAGAACC CAAAAGCCAG 660 

1 0 AGTGCCTGAA CAACGGATTG AGCTATATCA G ATTCTCAAG TCCA AAGATT TAACATCTCC 720 
AACCCAGCGC TACATCGACA GCAAAGTTGT G AAAACAAGA GCAGAAGGCG AATGGCTCTC 780 
CTTCGATGTA ACTGATGCTG TTCATGAATG GCTTCACCAT AAAGACAGGA ACCTGGGATT 840 
TAAAATAAGC TTACACTGTC CCTGCTGCAC TTTTGTACCA TCTAATAATT ACATCATCCC 900 
AAATAAAAGT GAAGAACTAG AAGCAAGATT TGCAGGTATT GATGGCACCT CCACATATAC 960 

1 5 CAGTGGTGAT CAGAAAACTA TAAAGTCCAC TAGGAAAAAA AACAGTGGGA AGACCCCACA 1020 
TCTCCTGCTA ATGTTATTGC CCTCCTACAG ACTTGAGTCA CAACAGACCA ACCGGCGGAA 1080 
GAAGCGTGCT TTGGATGCGG CCTATTGCTT TAGAAATGTG CAGGATAATT GCTGCCTACG 1 140 
TCCACTTTAC ATTG ATTTCA AGAGGGATCT AGGGTGGAAA TGGATACACG AACCCAAAQG 1200 
GTACAATGCC A ACTTCTGTG CTGGAGCATG CCCGTATTTA TGGAGTTCAG ACACTCAGCA 1260 

20 CAGCAGGGTC CTGAGCTTAT ATAATACCAT AAATCCAGAA GCATCTGCTT CTCCTTGCTG 1320 
GGTGTCCCAA GATTTAGAAC CTCTAACCAT TCTCTACTAC ATTGGCAAAA CACCCAAGAT 1380 
TGAACAGCTT TCTAATATGA TTGTAAAGTC TTGCAAATGC AG CTAAA ATT CTTGGAAAAG 1440 
TGGCAAGACC AAAATGACAA TGATGATGAT AATGATGATG ACGACGACAA CGATGATGCT 1500 
TGTAACA AG A A AACATA AG A G AGCCTTGGT TCATCAGTGT TAAAAAATTT TTG AAAAGGC 1 560 

25 GGTACTAGTTCAGACACTTTGGAAGTrroTGrrCTGTTrGTTAAAACTGGCATCTGACAC 1620 
AAAAAAAGTT G AAGGCCTTA TTCTACATTT CACCTACTTT GTAAGTGAGA QAGACAAGAA 1680 
GCAAATTTTT TTAAA 

Seq ID NO: 1 16 Protein sequence: 
30 Protein Accession NP_003229.1 

1 11 21 31 41 51 
1)111) 

MHYCVLSAFL ILHLVTVALS LSTCSTLDMD QFMRKRlEAi RGQILSKLKL TSPPEDYPEP 60 
3 5 EEVPPEVISI YNSTRDLLQE KASRRAA ACE RERSDEEYYA KEVYKIDMPP FFPSENAIPP 1 20 
TFYRPYFRI V RFDVSAMEKN ASNLVKAEFR VFRLQNPKAR VPEQRIELYQ ILKSKDLTSP 1 80 
TQRYIDSKVV KTRAEGEWLS FDVTDAVHEW LHHKDRNLGF KISLHCPCCT FVPSNNYIIP 240 
NKSEELEARF AGIDGTSTYT SGDQKTIKST RKKNSGKTPH LLLMLLPSYR LESQQTNRRK 300 
KRALDAAYCF RNVQDNCCLR PLYIDFKRDL GWKWIHEPKG YNANFCAGAC PYLWSSDTQH 360 
40 SRVLSLYNTI NPEASASPCC VSQDLEPLTI LYYIGKTPKI EQLSNMIVKS CKCS 

Seq ID NO: 1 17 DNA sequence 

Nucleic Acid Accession #: NM_000095. 1 

Coding sequence: 26-2299 (underlined sequences correspond to start and stop cddons) 

1 U 21 31 41 51 

. ] I I I I I 

CAGCACCCAG CTCCCCGCCA CCGCCATGGT CCCCG ACACC GCCTGCGTTC TTCTGCTCAC 60 
CCTGGCTGCC CTCGGCGCGT CCGG ACAGGG CCAG AGCCCG TTGGGCTCAG ACCTGGGCCC 120 

D 0 GCAG ATGCTT CGGG AACTGC AGG AAACCAA CGCGGCGCTG CAGGACGTGC GGGACTGGCT 1 80 
GCGGCAGCAG GTCAGGGAGA TCACGTTCCT GAAAAACACG GTGATGGAGT GTGACGCGTG 240 
CGGGATGCAG CAGTCAGTAC GCACCGGCCT ACCCAGCGTG CGGCCCCTGC TCCACTGCGC 300 
GCCCGGCTTC TGCTTCCCCG GCGTGGCCTG CATCCAGACG GAGAGCGGCG GCCGCTGCGG 360 
CCCCTGCCCC GCGGGCTTCA CGGGCAACGG CTCGCACTGC ACCGACGTCA ACGAGTGCAA 420 

3 J CCCCCACCCC TGCTTCCCCG G AGTCCGCTG TATCAACACC AGCCCGGGGT TCCGCTGCGA 480 
GGCTTGCCCG CCGGGGTACA GCGGCCCCAC CCACCAGGGC QTGGGGCTGG CTTTCGCCAA 540 
GGCCAACAAG CAGGTTTGCA CGGACATCAA CGAGTGTGAG ACCGGGCAAC ATAACTGCGT 600 
CCCCAACTCC GTGTGCATCA ACACCCGGGG CTCCTTCCAG TGCGGCCCGT GCCAGCCCGG 660 
CTTCGTGGGC GACCAGGCGT CCGGCTGCCA GCGCGGCGCA CAGCGCTTCT GCCCCGACGG 720 

OO CTCGCCCAGC GAGTGCCACG AGCATGCAGA CTGCGTCCTA GAGCGCGATG GCTCGCGGTC 780 
GTGCGTGTGT CGCGTTGGCT GGGCCGGCAA CGGGATCCTC TGTGGTCGCG ACACTG ACCT 840 
AGACGGCTTC CCGGACGAGA AGCTGCGCTG CCCGGAGCCG CAGTGCCGTA AGGACAACTG 900 
CGTGACTGTG CCCAACTCAG GGCAGGAGGA TGTGGACCGC GATGGCATCG GAGACGCCTG 960 

^ CGATCCGGAT GCCGACGGGG ACGGGGTCCC CAATGAAAAG GACAACTGCC CGCTGGTGCG 1020 

05 GAACCCAGAC CAGCGCAACA CGGACGAGGA CAAGTGGGGC GATGCGTGCG ACAACTGCCG 1080 
GTCCCAGAAG AACGACGACC AAAAGGACAC AGACCAGGAC GGCCGGGGCG ATGCGTGCGA 1 140 
CGACGACATC GACGGCGACC GGATCCGCAA CCAGGCCGAC AACTGCCCTA GGGTACCCAA 1200 
CTCAGACCAG AAGGACAGTG ATGGCGATGG TATAGGGG AT GCCTGTGACA ACTGTCCCCA 1260 
GAAGAGCAAC CCGGATCAGG CGGATGTGGA CCACGACTTT GTGGGAGATG CTTGTGACAG 1320 

/O CGATCAAGAC CAGGATGGAG ACGGACATCA GGACtCTCGG GACAACTGTC CCACGGTGCC 1380 
TAACAGTGCC CAGGAGGACT CAGACCACGA TGGCCAGGGT GATGCCTGCG ACGACGACGA 1440 
CGACAATGAC GGAGTCCCTG ACAGTCGGGA CAACTGCCGC CTGGTGCCTA ACCCCGGCCA 1500 
GGAGGACGCG GACAGGGACG GCGTGGGCGA CGTGTGCCAG GACOACTTTG ATGCAGACAA 1560 
GGTGGTAG AC AAG ATCG ACG TGTGTCCGG A G AACGCTG AA GTCACGCTCA CCGACTTCAG 1 620 

75 GGCCTTCCAG ACAGTCGTGC TGG ACCCGGA GGGTGACGCG CAGATTGACC CCAACTGGGT 1 680 
GGTGCTCAAC CAGGGAAGGG AGATCGTGCA GACAATGAAC AGCGACCCAG GCCTGGCTGT 1740 
GGGTTACACT GCCTTCAATG GCGTGGACTT CGAGGGCACG TTCCATGTGA ACACGGTCAC 1800 
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GGATGACGAC TATGCGGGCT TCATCTTTGG CTACCAGGAC AGCTCCAGCT TCTACGTGGT 1860 
CATGTGGAAG CAGATGGAGC AAACGTATTG GCAGGCGAAC CCCTTCCGTG CTGTGGCCGA 1920 
GCCTGGCATC CAACTCAAGG CTGTGAAGTC TTCCACAGGC CCCGGGGAAC AGCTGCGGAA 1980 
CGCTCTCTGG CATACAGGAG ACACAGAGTC CCAGGTGCGG CTGCTGTGGA AGGACCCGCG 2040 
5 AAACGTGGGT TGGAAGGACA AGAAGTCCTA TCGTTGGTTC CTGCAGCACC GGCCCCAAGT 2100 
GGGCTACATC AGGGTGCG AT TCTATGAGGG CX:CTGAGCTG GTGGCCGACA GCAACGTGGT 2160 
CTTGGACACA ACCATGCGGG GTGGCCGCCT GGGGGTCTTC TGCTTCTCCC AGOAGAACAT 2220 
CATCTGGGCC A ACCTGCGTT ACCGCTGCAA TOACACCATC CCAGAGGACT ATGAGACCCA 2280 
TCAGCTGCGG CAAGCCTAGG GACCAGGGTG AGGACCCGCG GGATGACAGC CACCCTCACC 2340 
1 0 GCGGCTGGAT GGGGGCTCTG CACCCAGCCC AAGGGGTGGC CGTCCTGAGG GGGAAGTGAG 2400 
AAGGGCTCAG AGAGGACAAA ATAAAGTGTG TGTGCAGGG 

Scq ID NO: 118 Protein sequence: 
Protein Accession #: NP 000086.1 

15 

1 M 21 31 41 51 

MVPDTACVLL LTLAALGASG QGQSPLGSDL GPQMLRELQE TNAALQDVRD WLRQQVREIT 60 
FLKNTVMECD ACGMQQSVRT GLPSVRPLLH CAPGFCFPGV ACIQTESGGR CGPCPAGFTG 120 

20 NGSHCTDVNE CNAHPCFPRV RCINTSPGFR Cf^ACPPGYSG PTHQG VGLAF AKANKQVCTD 1 80 
INECETGQHN CVPNSVCINT RGSFQCGPCQ PGFVGDQASG CQRGAQRFCP DGSPSECHEH 240 
ADCVLERDGS RSCVCRVGWA GNGILCGRDT DLDGFPDEKL RCPEPQCRKD NCVTVPNSGQ 300 
EDVDRDGIGD ACDPDADGDG VPNEKDNCPL VRNPDQRhTTD EDKWGDACDN CRSQKNDDQK 360 
DTDQDGRGDA CDDDIDGDRI RNQADNCPRV PNSDQKDSDG DGIGDACDNC PQKSNPDQAD 420 

25 VDHDFVGDAC DSDQDQDGDG HQDSRDNCPT VPNSAQEDSD HDGQGDACDD DDDNDGVPDS 480 
RDNCRLVPNP GQEDADRDGV GDVCQDDFDA DKWDKIDVC PENAEVTLTD FRAFQTVVLD 540 
PEGDAQIDPN WVVLNQGREI VQTMNSDPGL AVGYTAFNGV DFEGTFHVNT VTDDDYAGFl 600 
FGYQDSSSFY WMWKQMEQT YWQANPFRAV AEPGIQLKAV KSSTGPGEQL RNALWHTGDT 660 
ESQVRLLWKD PRNVG WKDKK SYRWFLQHRP QVGYIRVRFY EGPELVADSN WLDTTMRGG 720 

30 RLGVFCFSQE NUWANLRYR CNDTIPEDYE THQLRQA 

Seq ID NO: 1 19 DNA sequence 

Nucleic Acid Accession #: NM_0 1 42 11 

Coding sequence: 1 57-1479 (underlined sequences correspond to start and slop codons) 
I 11 21 31 41 51 

GGG ACAGGGC TGAGGATGAG GAGAACCCTG GGG ACCCAGA AGACCGTGCC TTGCCCGGAA 60 
GTCCTGCCTG TAGGCCTGAA GGACTTGCCC TAACAGAGCC TCAACAACTA CCTGGTGATT 120 

40 CCTACTTCAGCCCCTTGGTGTGAGCAGCTTCTCAACAIGAACTACAGCCTCCACTTGGCC 180 
TTCGTGTGTC TGAGTCTCTT CACTGAG AGG ATGTGCATCC AGGGGAGTCA GTTCAACGTC 240 
GAGGTCGGCA GAAGTGACAA GCTTTCCCTG CCTGGCTTTG AGAACCTCAC AGCAGG ATAT 300 
AACAAATTTC TCAGGCCCA A TTTTGGTGGA G AACCCGTAC AGATAGCGCT GACTCTGGAC 360 
ATTGCAAGTA TCTCTAGCAT TTCAGAGAGT AACATGGACT ACACAGCCAC CATATACCTC 420 

45 CGACAGCGCT GGATGGACCA GCGGCTGGTG TTTGAAGGCA ACAAGAGCTT CACTCTGGAT 480 
GCCCGCCTCG TGGAGTTCCT CTGGGTGCCA GATACTTACA TTGTGGAGTC CAAGAAGTCC 540 
TTCCTCCATG AAGTCACTGT GGGAAACAGG CTCATCCGCC TCTTCTCCAA TGGCACGGTC 600 
CTGTATGCCC TCAG AATCAC G ACA ACTGTT GCATGTAACA TGGATCTGTC TAAATACCCC 660 
ATGGACACAC AGACATGCAA GTTGCAGCTG GAAAGCTGGG GCTATGATGG AAATGATGTG 720 

50 GAGTTCACCT GGCTGAGAGG GAACGACTCT GTGCGTGGAC TGGAACACCT GCGGCTTGCT 780 
C AGTAC ACC A TAG AGCGGT A TTTC ACCTT A GTCACCAGAT CGCAGCAGG A G ACAG GAAAT 840 
TACACTAGAT TGGTCTTACA GTTTGAGCTT CGGAGG AATG TTCTGTATTr CATTTTGGAA 900 
ACCTACGTTC CTTCCACTTT CCTGGTGGTG TTGTCCTGGG TTTCATTTTG GATCTCTCTC 960 
GATTCAGTCC CTGCAAGAAC CTGCATTGGA GTG ACG ACCG TGTTATCA AT G ACCACACTG 1 020 

55 ATGATCGGGT CCCGCACTTC TCTTCCCAAC ACCAACTGCT TCATCAAGGC CATCGATGTG 1080 
TACCTGGGGA TCTGCTTTAG CTTTGTGTTT GGGGCCTTGC TAGAATATGC AGTTGCTCAC 1 140 
TACAGTTCCT TACAGCAGAT GGCAGCCAAA GATAGGGGGA CAACAAAGGA AGTA GAAGA A 1200 
GTCAGTATTA CTAATATCAT CAACAGCTCC ATCTCCAGCT TTAAACGGAA GATCAGCTTT 1260 
GCCAGCATTG AAATTTCCAG CGACAACGTT GACTACAGTG ACTTG ACAAT G AAAACC AGC 1320 
60 GACAAGTTCA AGTTTGTCTT CCGAGAAAAG ATGGGCAGGA TT GTTG ATT A TTT CACAATT 1380 
CAAAACCCCA GTAATGTTGA TCACTATTCC AAACTACTGT TTCCTTTGAT TTTTATGCTA 1440 
GCCAATGTAT TTTACTGGGC ATACTACATG TATTTTTGAG TCAATGTTAA ATTTCTTGCA 1500 
TGCCATAGGT CTTCAACAGG ACAAGATAAT GATGTAAATG GTATTTTAGG CCAAGTGTGC 1560 
ACCCACATCC AATGGTGCTA CAAGTGACTG AAATAATATT TGAGTCTTTC TGCTCAAAGA 1620 
65 ATGAAGCTCC AACCATTGTT CTAAGCTGTG TAGAAGTCCT AGCATTATAG G ATCTTGTAA 1 680 
TAGAAACATC AGTCCATTCC TCITTCATCT TAATCAAGGA CATTCCCATG G AGCCCAAG A 1 740 
TTACA AATGT ACTCAGGGCT GnTATTCGG TGGCTCCCTG GTTTGCATTT ACCTCATATA 1 800 
AAGAATGGGA AGGAGACCAT TGGGTAACCC TCAAGTGTCA GAAGTTGTTT CTAAAGTAAC 1860 
TATACATGTT TTTTACTAAA TCTCTGCAGT GCTTATAAAA TACATTGTTG CCTATTTAGG 1920 
70 G AGTAACATT TTCTAGTTTT TGTTTCTGGT TAAAATG AAA T ATGGGCTTA TGTCAATTCA 1980 
TTGGAAGTCA ATGCACTAAC TCAATACCAA G ATGAGITTT TAAATAATGA ATATTATTTA 2040 
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AAAGTACTGG GTTGACTCAG AGAGTCGCTG TCATTCTGTC ATTGCTGCTA CTCTAACACT 2460 
GAGCAACACT CTCCCAGTGG CAGATCCCCT GTATCATTCC AAGAGGAGCA TTCATCCCTT 2520 
TGCTCTAATG ATCAGGAATG AT GCTT ATTA GAAAACAAAC TGCTTGACCC AGGAACAAGT 2580 
GGCTTAGCTT AAGTAAACTT GGCTTTGCTC AGATCCCTGA TCCTTCCAGC TGGTCTGCTC 2640 
D TGAGTGGCTT ATCCCGCATG AGCAGG AGCG TGCTGGCCCT GAGTACTGAA CTTTCTGAGT 2700 
AACAATGAGA CACGTTACAG AACCTATGTT CAGGTTGCGG GTGAGCTGCC CTCTCCAAAT 2760 
CCAGCCAGAG ATGCACATTC CTCGGCCAGT CTCAGCCAAC AGTACCAAAA GTGATTTTTG 2820 
AGTGTGCCAG GGTAAAGGCT TCCAGTTCAG CCTCAGTTAT TTTAGACA AT CTCGCCATCT 2880 
TTAATTTCTT AGCTTCCTGT TCTAATAAAT GCACGGCTTT ACCTTTCCTG TCAGAAATAA 2940 

1 U ACCAAGGCTC TAAAAGATGA TTTCCCTTCT GTAACTCCCT AG AGCCACAG GTTCTCATTC 3000 
CmrCCCAT TATACTTCTC ACAATTCAGT TTCTATGAGT TTGATCACCT GAill ili lA 3060 
ACAAAATATT TCTAACGGGA ATGGGTGGGA GTGCTGGTGA AAAGAGATGA AATGTGGTTG 3120 ■ 
TATGAGCCAA TCATATTTGT GATTTTTTAA AAAAAGTTTA AAAGGAAATA TCTGTTCTGA 3180 
AACCCCACTT AAGCATTGTT TTTATATAAA AACAATGATA AAGATGTGAA CTGTGAAATA 3240 

1 D AATATACCAT ATTAGCTACC CACC 

Seq ID NO: 120 Protein sequence: 
Protein Accession #: NP_055026.1 

20 1 11 21 31 41 51 
) I I I I I 

MNYSLHLAFV CLSLFTERMC IQGSQFNVEV GRSDKLSLPG FENLTAGYNK FLRPNFGGEP 60 
VQIALTLDIA SISSISESNM DYTATIYLRQ RWMDQRLVFE GNKSFTLDAR LVEFLWVPDT 120 
_ YIVESKKSFL HEVT VGNRLl RLFSNGTVLY ALRmTV AC NMDLSKYPMD TQTCKLQLES 1 80 

Zd WGYDGNDVEF TWLRGNDSVR GLEHLRLAQY TIERYFTLNTT RSQQETGNYT RLVLQFELRR 240 
NVLYFILETY VPSTFLVVLS WVSFWISLDS VPARTCIGVT TVLSMTTLMI GSRTSLPNTN 300 
CFDCAIDVYL GICFSFVFGA LLEYAVAHYS SLQQMAAKDR GTTKEVEEVS ITNIINSSIS 360 
SFKRKISFAS lEISSDNVDY SDLTMKTSDK FKFVFREKMG RIVDYFTIQN PSNVDHYSKL 420 
LFPLIFMLAN VFYWAyVmYF 

Seq ID NO: 121 DNA sequence 

Nucleic Acid Accession #: NM J30 1 854 

Coding sequence: 163*5582 (underlined sequences correspond to start and stop codons) 

35 1 II 21 31 41 51 

I f I ■ I I I 

AACCATCAAA TTTAG AAGAA AAAGCCCTTT G ACTTTTTCC CCCTCTCCCT CCCCAATGGC 60 
TGTGTAGCAA ACATCCCTGG CGATACCTTG GAAAGGACGA AGTTGGTCTG CAGTCGCAAT 120 
TTCGTGGGTT GAGTTCAC AG TTGTG AGTGC GGGGCTCGG A G ATG GAGCCG TGGTCCTCTA 1 80 

4U GGTGGAAAAC GAAACGGTGG CTCTGGGATT TCACCGTAAC AACCCTCGCA TTGACCTTCC 240 
TCTTCCAAGC TAGAG AGGTC AGAGGAGCTG CTCCAGTTGA TGTACTAAAA GCACTAGATT 300 
TTCACAATTC TCCAGAGGGA ATATCAAAAA CAACGGGATT TTGCACAAAC AGAAAGAATT 360 
CTAAAGGCTC AGATACTGCT TACAGAGTTT CAAAGCAAGC ACAACTCAGT GCCCCAACAA 420 
AACAGTTATT TCCAGGTGGA ACTTTCCCAG AAGACTTTTC AATACTATTT ACAGTAAAAC 480 
CAAAAAAAGG AATTCAGTCT TTCCTTTTAT CTATATATAA TGAGCATGGT ATTCAGCAAA 540 
TTGGTGTTGA GGTTGGGAGA TCACCTGTTT TTCrGTITGA AGACCACACT GGAAAACCTG 600 
CCCCAGAAGA CTATCCCCTC TTCAGAACTG TTAACATCGC TG ACGGG AAG TGGCATCGGG 660 
TAGCAATCAG CGTGGAGAAG AAAACTGTGA CAATGATTGT TGATTGTAAG AAGAAAACCA 720 
CG AA ACCACT TG ATAG A AGT G AG AG AGCA A TTGTTG ATAC CA ATGG AATC ACGG iTlTl G 780 
GAACAAGGATTTTGGATGAAGAAGTTTTTG AGGGGGACATTCAGCAGTTTTTGATCACAG 840 
GTGATCCCAA GGCAGCATAT GACTACTGTG AGCATTATAG TCCAGACTGT GACTCTTCAG 900 
CACCCAAGGC TGCTCAAGCT CAGGAACCTC AGATAGATGA GTATGCAGCA GAGGATATAA 960 
TCGAATATGA CTATGAGTAT GGGGAAGCAG AGTATAAAGA GGCTGAAAGT GTAACAGAGG 1020 
GACCCACTGT AACTGAGG AG ACAATAGCAC AGACGGAGGC AAACATCGTT GATGATTTTC 1080 
. AAGAATACAA CTATGGAACA ATGGAAAGTT ACCAGACAGA AGCTCCTAGG CATGTTTCTG 1 140 

GGACAAATGA GCCAAATCCA GTTGAAGAAA TATTTACTGA AGAATATCTA ACGGG AGAGG 1200 
ATTATGATTCCCAGAGGAAAAATTCTGAGG ATACACTATATGAAAACAAAGAAATAGACG 1260 - 
GCAGGGATTC TGATCTTCTG GTAGATGGAG ATTTAGGCGA ATATGATTTT TATGAATATA 1320 
AAGAATATGA AGATAAACCA ACAAGCCCCC CTAATGAAGA ATTTGGTCCA GGTGTACCAG 1380 

OU CAGAAACf GA TATTACAGAA ACAAGCATAA ATGGCCATGG TGCATATGGA GAGAAAGGAC 1440 
AGAAAGGAGA ACCAGCAGTG GTTGAGCCTG GTATGCTTGT CGAAGGACCA CCAGGACCAG 1500 
CAGGACCTGC AGGTATTATG GGTCCTCCAG GTCTACAAGG CCCCACTGGA CCCCCTGGTG 1560 
ACCCTGGCGA TAGGGGCCCC CCAGGACGTC CTGGCTTACC AGGGGCTGAT GGTCTACCTG 1620 
GTCCTCCTGG TACTATGTTG ATGTTACCGT TCCGTTATGG TGGTGATGGT TCCAAAGG AC 1 680 

03 CAACCATCTC TGCTCAGG AA GCTCAGGCTC AAGCTATTCT TCAGCAGGCT CGGATTGCTC 1 740 

TGAGAGGCCC ACCTGGCCCA ATGGGTCTAA CTGGAAG ACC ACGTCCTGTG GGGGGGCCTG 1 800 
GTTCATCTGG GGCCAAAGGT GAGAGTGGTG ATCCAGGTCC TCAGGGCCCT CGAGGCGTCC 1860 
AGGGTCCCCC TGGTCCAACG GGAAAACCTG GAAAAAGGGG TCGTCCAGGT GCAGATGGAG 1920 
GAAGAGGAAT GCCAGGAGAA CCTGGGGCAA AGGGAGATCG AGGGTTTGAT GGACTTCCGG 1980 

/ U GTCTGCCAGG TGACAA AGGT CACAGGGGTG AACGAGGTCC TCAAGGTCCT CCAGGTCCTC 2040 
CTGGTGATGA TGGAATGAGG GGAGAAGATG GAGAAATTGG ACCAAGAGGT CTTCCAGGTG 2100 
AAGCTGGCCC ACGAGGTTTG CTGGGTCCAA GGGGAACTCC AGGAGCTCCA GGGCAGCCTG 2160 
GTATGGCAGG TGTAGATGGC CCCCCAGGAC CAAAAGGGAA CATGGGTCCC CAAGGGGAGC 2220 
CTGGGCCTCC AGGTCA ACAA GGGAATCCAG GACCTCAGGG TCTTCCTGGT CCACAAGGTC 2280 

/D CAATTGGTCCTCCTGGTGAA AAAGGACCACAAGGAAAACC AGGACTTGCTGGACTTCCTG 2340 
GTGCTGATGG GCCTCCTGGT CATCCTGGG A AAGAAGGCCA GTCTGGAGAA AAGGGGGCTC 2400 
TGGGTCCCCC TGGTCCACAA GGTCCTATTG GATNNCCGGG CCCCCGGGGA GTAAAGGGAG 2460 
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CAGATGGTGT CAG AGGTCTC AAGGGATCTA AAGGTGAAAA GGGTGAAGAT GGTTTTCCAG 2520 
GATTCAAAGG TGACATGGGT CTAAAAGGTG ACAGAGGAGA AGTTGGTCAA ATTGGCCCAA 2580 
GAGGGNAAGA TGGCCCTGAA GGACCCAAAG GTCGAGCAGG CCCAACTGGA GACCCAGGTC 2640 
CTTCAGGTCA AGCAGGAGAA AAGGGAAAAC TTGGAGTTCC AGGATTACCA GGATATCCAG 2700 
5 GAAG ACAAGG TCCAAAGGGT TCCACTGGAT TCCCTGGGTT TCCAGGTGCC AATGGAGAG A 2760 
AAGGTGCACG GGGAGTAGCT GGCAAACCAG GCCCTCGGGG TCAGCGTGGT CCAACGGGTC 2820 
CTCGAGGTTC AAGAGGTGCA AG AGGTCCCA CTGGG AAACC TGGGCCAAAG GGCACTTCAG 2880 
GTGGCGATGG CCXTCCTGGC CCTCCAGGTO AAAGAGGTCC TCAAGG ACCT CAGGGTCCAG 2940 
TTGGATTCX:C TGGACCAAAA GGCCCTCCTG GACCACCAGG AAGGATGGGC TGCCCAGGAC 3000 
1 0 ACCCTGGGCA ACGTGGGG AG ACTGGATTTC AAGGCAAG AC CGGCCCTCCT GGGCCAGGGG 3060 
GAGTGGTTGG ACCACAGGGA CCAACCX3GTG AGACTGGTCC AATAGGGGAA CGTGGGTATC 3120 
CTGGTCCTCC TGGCCCTCCT GGTGAGCAAG GTCTTCCTGG TGCTGCAGGA AAAGAAGGTG 3180 
CAAAGGGTGA TCCAGGTCCT CAAGGTATCT CAGGGAAAGA TGGACCAGCA GGATTACGTG 3240 
GTTTCCCAGG GGAAAGAGGT CTTCCTGGAG CTCAGGGTGC ACCTGGACTG AAAGGAGGGG 3300 
1 5 AAGGTCCCCA GGGCCCACCA GGTCCAGTTG GCTCACCAGG AGAACGTGGG TCAGCAGGTA 3360 

CAGCTGGCCC AATTGGTTTA CGAGGGCGCC CGGGACCTCA GGGTCCTCCT GGTCCAGCTG 3420 . 
GAGAGAAAGG TGCTCCTGGA GAAAAAGGTC CCCAAGGGCC TGCAGGGAG A GATGGAGTTC 3480 
AAGGTCCTGT TGGTCTCCCA GGGCCAGCTG GTCCTGCCGG CTCCCCTGGG GAAGACGGAG 3540 
ACAAGGGTG A AATTGGTGAG CCGGGACAAA AAGGCAGCAA GGGTGGCAAG GGAGAAAATG 3600 
20 GCCCTCCCGG TCCCCCAGGT CTTCAAGGAC CAGTTGGTGC CCCTGGAATT GCTGGAGGTG 3660 

ATGGTG AACC AGGTCCTAGA GGACAGCAGG GGATGTTTGG GCAAAAAGGT GATGAGGGTG 3720 
CCAGAGGCTT CCCTGGACCT CCTGGTCCAA TAGGTCTTCA GGGTCTGCCA GGCCCACCTG 3780 
GTGAAAAAGG TGAAAATGGG GATGTTGGTC CATGGGGGCC ACCTGGTCCT CCAGGCCCAA 3840 
GAGGCCCTCA AGGTCCCAAT GGAGCTGATG GACCACAAGG ACCCCCAGGT TCTGTTGGTT 3900 
25 CAGTTGGTGG TGTTGGAGAA AAGGGTGAAC CTGGAGAAGC AGGAAACCCA GGGCCTCCTG 3960 
GGGAAGCAGG TGTAGGCGGT CCCAAAGGAG AAAGAGGAGA GAAAGGGGAA GCTGGTCCAC 4020 
CTGGAGCTGC TGGACCTCCA GGTGCCAAGG GGCCGCCAGG TGATGATGGC CCTAAGGGTA 4080 
ACCCGGGTCC TGTTGGTTTT CCTGGAGATC CTGGTCCTCC TGGGG AACTT GGCCCTGCAG 41 40 
GTCAAGATGG TGTTGGTGGT G ACAAGGGTG AAGATGGAG A TCCTGGTCAA CCGGGTCCTC 4200 
30 CTGGCCCATC TGGTGAGGCT GGCCCACCAG GTCCTCCTGG AAAACGAGGT CCTCCTGGAG 4260 

CTGCAGGTGC AGAGGGAAGA CAAGGTGAAA AAGGTGCTAA GGGGGAAGCA GGTGCAGAAG 4320 
GTCCTCCTGG AAAAACCGGC CCAGTCGGTC CTCAGGGACC TGCAGGAAAG CCTGGTCCAG 4380 
AAGGTCTTCG GGGCATCCCT GGTCCTGTGG GAGAACAAGG TCTCCCTGGA GCTGCAGGCC 4440 
AAGATGGACC ACCTGGTCCT ATGGGACCTC CTGGCTTACC TGGTCTCAAA GGTGACCCTG 4500 
35 GCTCCAAGGG TGAAAAGGGA CATCCTGGTT TAATTGGCCT GATTGGTCCT CCAGGAGAAC 4560 

AAGGGGAAAA AGGTGACCGA GGGCTCCCTG GAACTCAAGG ATCTCCAGGA GCAAAAGGGG 4620 
ATGGGGGAAT TCCTGGTCCT GCTGGTCCCT TAGGTCCACC TGGTCCTCCA GGCTTACCAG 4680 
GTCCTCAAGG CCCAAAGGGT AACAAAGGCT CTACTGG ACC CGCTGGCCAG AAAGGTGACA 4740 
GTGGTCTTCC AGGGCCTCCT GGGCCTCCAG GTCCACCTGG TGAAGTCATT CAGCCTTTAC 4800 
40 CAATCTTGTC CTCCAAAAAA ACGAGAAGAC ATACTGAAGG CATGCAAGCA GATGCAGATG 4860 
ATAATATTCT TGATTACTCG G ATGGAATGG AAGAAATATT TGGTTCCCTC AATTCCCTGA 4920 
AACAAGACAT CGAGCATATG AAATTTCCAA TGGGTACTCA GACCAATCC a GCCCGAACTT 4980 
GTAAAGACCT GCAACTCAGC CATCCTGACT TCCCAGATGG TGAATATTGG ATTGATCCTA 5040 
ACCAAGGTTG CTCAGGAGAT TCCTTCAAAG TTTACTGTAA TTTCACATCT GGTGGTGAGA 5100 
45 CTTGCATTTA TCCAG ACAAA AAATCTG AGG G AGTAAG AAT TTCATC ATGG CCAAAGG AG a 5 1 60 
AACCAGGAAG TTGGTTTAGT G AATTTAAG A GGGGAAAACT GCTTTCATAC TTAGATGTTG 5220 
AAGG AAATTC CATCAATATG GTGCAAATGA CATTCCTGAA ACTTCTGACT GCCTCTGCTC 5280 
GGCAAAATTT CACCTACCAC TGTCATCAGT CAGCAGCCTG GTATG ATGTG TCATCAGGA A 5340 
GTTATG ACAA AGCACTTCGC TTCCTGGGAT CAAATG ATGA GGAG ATGTCC TATGACAATA 5400 
50 ATCCTTTTAT CAAAACACTG TATGATGGTT GTACGTCCAG AAAAGGCTAT GAAAAAACTG 5460 
TCATTGAAAT CAATACACCA AAAATTGATC AAGTACCTAT TGTTGATGTC ATGATCAGTG 5520 
ACTTTGGTGA TCAGAATCAG AAGTTCGGAT TTGAAGTTGG TCCTGTTTGT 1 IICIIGGCT. 5580 
AAGATTAAGA CAAAGAACAT ATCAAATCAA CAGAAAATGT ACCTTGGTGC CACCAACCCA 5640 
TTTTGTGCCA CATGCAAGTT TTGAATAAGG ATGTATGGAA AACAACGCTG CATATACAGG 5700 
5 5 TACCATTTAG G AAATACCG A TGCCTTTGTG GGGGCAG AAT CACAG AC AA A AGCTTTG AAA 5760 
ATCATAAAGA TATAAGTTGG TGTGGCTAAG ATGGAAACAG GGCTGATTCT TG ATTCCCAA 5820 
TTCTCAACTC TCCTTTTCCT ATTTGAATTT CTTTGGTGCT GTAGAAAACA AAAAAAGAAA 5880 
AATATATATT CATAA AAA AT ATGGTGCTCA TTCTCATCCA TCCAGGATGT ACTAAAACAG 5940 
TGTGTTTAAT AAATTGTAAT TATTTTGTGT ACAGTTCTAT ACTGTTATCT GTGTCCATTT 6000 
60 CCAAAACTTG CACGTGTCCC TGAATTCCGC TG ACTCTAAT TTATGAGG AT GCCGAACTCT 6060 
GATGGCAATA ATATATGTAT TATGAAAATG AAGTTATGAT TTCCGATGAC CCTAAGTCCC 6120 
TTTCTTTGGT TAATG ATGAA ATTCCTTTGT GTGTGTTT 

5!«.q ID NO: 122 Protein sequence: 
65 Protein Accession NPJ00184S 



1 n 21 31 ,41 51 

70 MEPWSSRWKT KRWLWDFTVT TLALTFLFQA REVRGAAPVD VLKALDFHNS PEGISKTTGF 60 
CTNRKNSKGS DTAYRVSKQA QLSAPTKQLF PGGTFPEDFS ILFTVKPKKG IQSFLLSIYN 120 
EHGIQQIGVE VGRSPVFLFE DHTGKPAPED YPLFRTVNIA DGKWHRV AIS VEKKTVTMIV 1 80 
DCKKKTTKPL DRSERAIVDT NGITVFGTRI U5EEVFEGDI QQFLITGDPK A A YDYCEHYS 240 
PDCDSSAPKA AQAQEPQIDE YAPEDIIEYD YEYGEAEYKE AESVTEGPTV TEETIAQTEA 300 

75 NiVDDFQEYNYGTMESYQTEAPRHVSGTNEPNPVEEIFTEEYLTGEDYDSQRKNSEDTLY 360 
ENKEIDGRDS DLLVDGDLGE YDFYEYKEYE DKPTSPPNEE FGPGVPAETD ITETSINGHG 420 
AYGEICGQKGE PAVVEPGMLV EGPPGPAGPA GIMGPPGLQG PTGPPGDPGD RGPPGRPOLP 480 
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GADGLPGPPG TMLMLPFRYG GDGSKGPTIS AQEAQAQAIL QQARIALRGP PGPMGLTGRP 540 
GPVGGPGSSG AKGESGDPGP QGPRGVQGPP GPTGKPGKRG RPGADGGRGM PGEPGAKGDR 600 
GFDGLPGLPG DKGHRGERGP QGPPGPPGDD GMRGEDGEIG PRGLPGEAGP RGLLGPRGTP 660 
GAPGQPGMAG VDGPPGPKGN MGPQGEPGFP GQQGNPGPQG LPGPQGPIGP PGEKGPQGKP 720 
D GLAGLPGADG PPGHPGKEGQ SGEKGALGPP GPQGPIGXPG PRGVKGADGV RGLKGSKGEK 780 
GEDGFPGFKG DMGLKGDRGE VGQIGPRGXD GPEGPKGRAG PTGDPGPSGQ AGEKGKU3VP 840 
GLPGYPGRQG PKGSTGFPGF PGANGEKGAR GVAGKPGPRG QRGPTGPRGS RGARGPTGKP 900 
GPKGTSGGDG PPGPPGERGP QGPQGPVGFP GPKGPPGPPG RMGCPGHPGQ RGETGFQGKT 960 
GPPGPGGVVG PQGPTGETGP IGERGYPGPP GPPGEQGLPG AAGKEGAKGD PGPQGISGKD 1020 

1 U GPACLRGFPG ERGLPG AQG A PGLKGGEGPQ GPPGPVGSPG ERGSAGTAGP IGLRGRPGPQ J 080 
GPPGPAGEKG APGEKGPQGP AGRDGVQQPV GLPGPAGPAG SPGEDGDICGE IGEPGQKGSK 1 140 
GGKGENGPPG PPGLQGPVGA PGIAGGDGEP GPRGQQGMFG QKGDEGARGF PGPPGPIGLQ 1200 
GLPGPPGEKG ENGDVGPWGP PGPPGPRGPQ GPNGADGPQG PPGSVGSVGG VGEKGEPGEA 1260 
GNPGPPGEAG VGGPKGERGE KGEAGPPGAA GPPGAKGPPG DDGPKGNPGP VGFPGDPGPP 1320 

1 D GELGPAGQDG VGGDKLGEDGD PGQPGPPGPS GEAGPPGPPG KRGPPGAAGA EGRQGEKGAK 1380 
GEAGAEGPPG KTGPVGPQGP AGKFGPEGLR GIPGPVGEQG LPGAAGQDGP PGPMGPPGLP 1440 
GliCGDPGSKG EKGHPGLIGL IGPPGEQGEK GDRGLPGTQG SPGAKGDGGI PGPAGPLGPP 1500 
GPPGLPGPQG PKGNKGSTGP AGQKGDSGLP GPPGPPGPPG EVIQPLPILS SKKTRRHTEG 1560 
MQAD ADDNIL DYSDGMEEIF GSLNSLKQDl EHMKFPMGTQ TNPARTCKDL QLSHPDFPDG 1 620 

20 EYWIDPNQGC SGDSFKVYCN FTSGGETCIY PDKKSEGVRI SSWPKEKPGS WFSEFKRGKL 1680 

LSYLDVEGNS INMVQMTFLK LLTASARQNF TYHCHQSAAW YDVSSGSYDK ALRFLGSNDE 1740 
EMSYDNNPFI KTLYDGCTSR KGYEKTVIEI NTPKIDQVPI VDVMISDFGD QNQKFGFEVG 1800 
PVCFLG 

25 Seq ID NO: 123 DNA sequence 

Nucleic Acid Accession #: NM_01 5886 

Coding sequence: 485- 1 26 1 (underlined sequences coirespond to start and stop codons) 

1 II 21 31 41 51 
30 I I I j I I 

GAATTCCCCC CCCCCCCCCC TCACITGGTG TGTCTATATG TCTGGCAGAC ATTATCAGCA 60 
C ATTCTCTGT TGTTACCTGT GATTCATTTT TTCTTCACTC TCCAGGTG A A TTTC A ATTGC 1 20 
TGAAAATTTC CCACTGA AAA TATGCAGTAA TATATTTTGT GGTTCAGACA TTTGGGGCAA 1 80 
• ATGGTTCACA TTCATTTTAG GGTTAGTGGT CATGCTGTTT ATTTTTCTCT GCTATACAAA 240 

35 GTTCCTCTTA GGGGTCTGCC TCATGACACT AAAAAATGAA TAGAGATTCT ACTGTAGGTT 300 
ATCTCCTAGG CTTGAGTTCA ACATTTGTTT GGATTTTTGA AG AAAGTCAA ATCAAGCAAT 360 
GCTCCCAAAT G ATGTCTTTG TAAATTCATA CCCTCTGGCC CTA ll IIT TT TCATAGACCC 420 
TAACTCTACC TTTCTGCTTT AAAGCAAAGT AAACTCGGTG GCCTCTTCTT CTCCACCCCT 480 
CAAAATGATA GCAATCTCTG CCGTCAGCAG TGCACTCCTG TTCTCCCTTC TCTGTGAAGC 540 
40 AAGTACCGTC GTCCTACTCA ATTCCACTGA CTCATCCCCG CCAACCAATA ATTTCACTGA 600 
TATTGAAGCA GCTCTGAAAG CACAATTAGA TTCAGCGGAT ATCCCCAAAG CCAGGCGGAA 660 
GCGCTACATT TCGCAGAATG ACATGATCGC CATTCTTGAT TATCATAATC AAGTTCXiGGG 720 
CAAAGTGTTC CCACCGGCAG CAAATATGGA ATATATGGTT TGGGATGAAA ATCTTGCAAA 780 
A TCGGC AGAG GCTTGGGCGG CTACTTGCAT TTGGGACCAT GGACCTTCTT ACTTACTGAG 840 
45 ATTTTTGGGC CAAAATCTAT CTGTACGCAC TGGAAGATAT CGCTCTATTC TCCAGTTGGT 900 
CAAGCCATGG TATGATGAAG TGAAAGATTA TGCTTrrCCA TATCCCCAGG ATTGCAACCC 960 
CAGATGTCCT ATGAGATGTT TTGGTCCCAT GTGCACACAT TATACGCAGA TGGTTTGGGC 1020 
CACTTCCAAT CGGATAGGAT GCGCAATTCA TGCTTGCCAA AACATGAATG TTTGGGGATC 1080 
TGTGTGGCG A CGTGCAGTTT ACTTGGTATG CAACTATGCC CCAAAGGGCA ATTGGATTGG 1 1 40 
50 AGAAGCACCA TATAAAGTAG GGGTACCATG TTCATCTTGT CCTCCAAGTT ATGGGGGATC 1200 
TTGTACTGAC AATCTGTGTT TTCCAGGAGT TACGTCAAAC TACCTGTACT GGTTTAAATA 1 260 
AGTTTACCTT TTCCTCCAGG AAATATAATG ATTTCTGGGA ACATGGGCAT GTATATATAT 1320 
ATATGGAGAG AGAATTTTGC ACATATTATA CATATTTTGT GCTAATCTTG TTTTCCTCTT 1380 
AGTATTCCTT TGTATAAATT AGTGTTTGTC TAGCATGTTT GTTTAATCCT TTGGG AATTC 
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Seq ID NO: 124 Protein sequence: 
Protein Accession #: NP 056970.1 



1 11 21 31 41 51 

60 1 I I I I 1 ■ 

MIAISAVSSA IXFSLLCEAS TVVLLNSTDS SPPTNNFTDI EAAUCAQLDS ADIPKARRKR 60 
YISQNDMIAI LDYHNQVRGK VFPPAANMEY MVWDENLAKS AEAWAATCIW DHGPSYLLRF 120 
LGQNLSVRTG RYRSILQLVK PWYDEVKDYA FPYPQDCNPR CPMRCFGPMC THYTQMVWAT 180 

^ - SNRIGCAIHA CQNMNVWGSV WRRA VYLVCN YAPKGNWIGE APYKVGVPCS SCPPSYGGSC 240 

65 TDNLCFPGVT SNYLYWFK 

Seq ID NO: 125 DNA sequence 
— ^ Nucleic Acid Accession #: NMj001793 

/ 0 Coding sequence: 54>2543 (underlined sequences conespond to start and stop codons) 

I 11 21 31 41 51 
I I I i I i 

GCGGAACACC GGCCCGCCGT CGCGGC AGCT GCTTCACCCC TCTCTCTGCA GC CATGG GGC 60 
75 TCCCTCGTGG ACCTCTCGCG TCTCTCCTCC TTCTCCAGGT TTGCTGGCTG CAGTGCGCGG 120 

CCTCCGAGCC GTGCCGGGCG GTCTTCAGGG AGGCTGAAGT GACCTTGGAG GCGGGAGGCG 1 80 
CGGAGCAGGA GCCCGGCCAG GCGCTGGGGA AACjTATTCAT GGGCTGCCCT GGGCAAGAGC 240 
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CAGCTCTGTT TAGCACTGAT AATGATGACT TCACTGTGCG GAATGGCGAG ACAGTCCAGG 300 
AAAGAAGGTC ACTG AAGG AA AGGAATCCAT TGAAGATCTT CCCATCCAAA CGTATCTTAC 360 
GAAGACACAA GAGAGATTGG GTGGTTGCTC CAATATCTGT CCCTGAAAAT GGCA AGGGTC 420 
CCTTCCCCCA GAGACTGAAT GAGCTCAAGT CTAATAAAGA TAGAGACACC AAGATTTTCT 480 
5 ACAGCATCAC GGGGCCGGGG GCAGACAGCC CCCCTGAGGG TGTCTTCGCT GTAGAGAAGG 540 
AGACAGGCTG GTTGTTGTTG AATAAGCCAC TGGACCGGGA GGAGATTGCC AAGTATGAGC 600 
TCTTTGGCCA CGCTGTGTCA G AGAATGGTG CCTCAGTGG A GGACCCCATG AACATCTCCA 660 
TCATCGTGAC CGACCAGAAT GACCACAAGC CCAAGTTTAC CCAGG ACACC TTCCGAGGGA 720 
GTGTCTTAGA GGGAGTCCTA CCAGGTACTT CTGTGATGCA GGTGACAGCC ACAGATGAGG 780 

1 0 ATGATGCCAT CTACACCTAC AATGGGGTGG TTGCTTACTC CATCCATAGC CAAGAACCAA 840 
AGGACCCACA CG ACCTCATG TTCACAATTC ACXXSGAGCAC AGGCACCATC AGCGTCATCT 900 
CCAGTGGCCT GG ACCGGG AA AAAGTCCCTG AGTACACACT G ACCATCCAG GCCACAGACA 960 
TGGATGGGGA CGGCTCCACC ACCACGGCAG TGGCAGTAGT GGAGATCCTT GATGCCAATG 1020 
ACAATGCTCC CATGTTTGAC CCCCAGAAGT ACGAGGCCCA TGTGCCTGAG AATGCAGTGG 1080 

1 5 GCCATG AGGT GCAG AGGCTG ACGGTCACTG ATCTGG ACGC CCCCAACTCA CCAGCGTGGC 1 1 40 
GTGCCACCTA CCTTATCATG GGCGGTGACG ACGGGGACCA TTTTACCATC ACCACCCACC 1200 
CTGAGAGCAA CCAGGGCATC CTGACAACCA GGAAGGGTTT GGATTTTGAG GCCAAAAACC 1260 
AGCACACXXrr GTACGTTGAA GTGACCAACG AGGCCCCTTT TGTGCTGAAG CTCCCAACXT 1320 
CCACAGCCAC CATAGTGGTC CACGTGGAGG ATGTGAATGA GGCACCTGTG TTTGTCCCAC 1380 

20 CCTCCAAAGT CGTTGAGGTC CAGGAGGGCA TCCCCACTGG GGAGCCTGTG TGTGTCTACA 1440 
CTGCAGAAGA CCCTGACAAG GAGAATCAAA AGATCAGCTA CCGCATCCTG AGAGACCCAG 1500 
CAGGGTGGCT AGCCATGG AC CCAG ACAGTG GGCAGGTCAC AGCTGTGGGC ACCCTCG ACC 1 560 
GTGAGGATGA GCAGTTTGTG AGGAACAACA TCTATGAAGT CATGGTCTTG GCCATGGACA 1620 
ATGGAAGCCC TCCCACCACT GGCACGGGAA CCCTTCTGCT AACACTGATT GATGTCAACG 1680 

25 ACCATGGCCCAGTCCCTGAGCCCCGTCAGATCACCATCTGCAACCAAAGCCCTGTGCGCC 1740 
ACGTGCTG AA CATCACGG AC AAGGACCTGT CTCCCCACAC CTCCCCTTTC CAGGCCCAGC 1 800 
TCACAGATGA CTCAGACATC TACTGGACGG CAGAGGTCAA CGAGGAAGGT GACACAGTGG 1860 
TCTTGTCCCT GAAGAAGTTC CTGAAGCAGG ATACATATGA CGTGCACCTT TCTCTGTCTG 1920 
ACCATGGC AA CAA AG AGC AG CTG ACGGTG A TC AGGGCCAC TGTGTGCG AC TGCCATGGCC 1 980 

30 ATGTCGAAAC CTGCCCTGGA CCCTGGAAAG GAGGTTTCAT CCTCCCTGTG CTGGGGGCTG 2040 
TCCTGGCTCT GCTGTTCCTC CTGCTGGTGC TGCTTTTGTT GGTGAGAAAG AAGCGGAAGA 2100 
TCAAGGAGCC CCTCCTACTC CCAGAAGATG ACACCCGTGA CAACGTCTTC TACTATGGCG 2160 
AAGAGGGGGG TGGCGAAGAG GACCAGGACT ATGACATCAC CCAGCTCCAC CGAGGTCTGG 2220 
AGGCCAGGCC GGAGGTGGTT CTCCGCAATG ACGTGGCACC AACCATCATC CCGACACCCA 2280 

3 5 TGTACCGTCC TAGGCCAGCC AACCCAG ATG AA ATCGGCAA CTTT ATAATT GAG AACCTG A 2340 
AGGCGGCTAA CACAGACCCC ACAGCCCCGC CCTACGACAC CCTCTTGGTG TTCGACTATG 2400 
AGGGCAGCGG CTCCGACGCC GCGTCCCTGA GCTCCCTCAC CTCCTCCGCC TCCGACCAAG 2460 
ACCAAG ATTA CG ATTATCTG A ACGAGTGGG GCAGCCGCTT CAAGAAGCTG GCAGACATGT 2520 
ACGGTGGCGG GGAGGACGAC TAG GCGGCCT GCCTGCAGGG CTGGGGACCA AACGTCAGGC 2580 

40 CACAGAGCAT CTCCAAGGGG TCTCAGTTCC CCCTTCAGCT GAGGACTTCG GAGCTTGTCA 2640 
GGAAGTGGCC GTAGCAACTT GGCGG AGACA GGCTATGAGT CTGACGTTAG AGTGGTTGCT 2700 
TCCTTAGCCT TTCAGGATGG AGGAATGTGG GCAGTTTGAC TTCAGCACTG AAAACCTCTC 2760 
CACCTGGGCC AGGGTTGCCT CAGAGGCCAA GTTTCCAGAA GCCTCTTACC TGCCGTAAAA 2820 
TGCTCAACCC TGTGTCCTGG GCCTGGGCCT GCTGTGACTG ACCTACAGTG GACTTTCTCT 2880 

45 CTGGAATGGA ACCTTCTTAG GCCTCCTGGT GCAACTTAAT TTTTTTTTTT AATGCTATCT 2940 

TCAAAACGTT AGAGAAAGTT CTTCAAAAGT GCAGCCCAGA GCTGCTGGGC CCACTGGCCG 3000 
TCCTGCATTT CTGGTTTCCA GACCCCAATG CCTCCCATTC GGATGGATCT CTGCGTTTTT 3060 
ATACTGAGTG TGCCTAGGTT GCCCCTTATT TTTTATTTTC CCTGTTGCGT TGCTATAGAT 3120 

^ Q G A AGGGTG AG G ACA ATCGTG T ATATGTACT AGA ACTTTTT TATTA AAG AA A 

Seq ID NO: 126 Protein sequence: 
Protein Accession #: MP J)0 1 784 

1 II 21 31 41 51 

55 I I I I I 1 

MGLPRGPLAS LLLLQVCWLQ CAASEPCRAV FREAEVTLEA GGAEQEPGQA LGKVFMGCPG 60 
QEPALFSTDN DDFTVRNGET VQERRSLKER NPLKIFPSKR ILRRHKRDWV VAPISVPENG 120 
KGPFPQRLNQ UCSNKDRDTK IFYSITGPGA DSPPEGVFAV EKETGWLLLN KPLDREEIAK 180 
YELFGHAVSE NG ASVEDPMN ISIIVTDQND HKPKFTQDTF RGSVLEG VLP GTSVMQVTAT 240 . 

60 DEDDAIYTYN G VVAYSIHSQ EPKDPHDLMF TIHRSTGTIS VISSGLDREK VPEYTLTIQA 300 

TDMDGDGSTT TAVAVVEILD ANDNAPMFDP QKYEAHVPEN AVGHEVQRLT VTDLDAPNSP 360 
AWRATYLIMG GDDGDHFTIT THPESNQGIL TTRKGLDFEA KNQHTLYVEV TNEAPFVLKL 420 
PTSTATIWH VEDVNEAPVF VPPSKWEVQ EGIPTGEPVC VYTAEDPDKE NQKISYRILR 480 
DPAGWLAMDP DSGQVTAVGT LDREDEQFVR NNIYEVM VLA MDNGSPPTTG TGTLLLTUD 540 

65 VNDHGPVPEP RQITICNQSP VRHVLNTTDK DLSPHTSPFQ AQLTDDSDIY WTAEVNEEGD 600 
TVVLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCDC HGHVETCPGP WKGGFILPVL 660 
GAVLALLFLL LVLLLLVRKK RKIKEPLLLP EDDTRDNVFY YGEEGGGEED QDYDITQLHR 720 
GLEARPEWL RNDVAPTIIP TPMYRPRPAN PDEIGNFUE NLKAANTDPT APPYDTLLVF 780 
DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKKJLA DMYGGGEDD 

Seq ID NO: 127 DNA sequence 

Nucleic Acid Accession #: NMJ0032S6. 1 

Coding sequence: 60-734 (underlined sequences conespond to start and stop codons) 

75 ! II 21 31 41 51 
I I I I 1 I 

CCTGCTGGGG CCGTCCAGTC CCCCAGACCT CACAGGCTCA GTCGCGGATC TGCAGTGTCA 60 
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IGCCTGGGAG CCCTCGGCCC GCGCCAAGCT GGGTGCTGTT GCTGCGGCTG CTGGCGTTGC 120 
TGCGGCCCCC GGGGCTGGGT GAGGCATGCA GCTGCGCCCC GGCGCACCCT CAGCAGCACA 180 
TCTGCCACTC GGCACTTGTG ATTCGGGCCA AAATCTCCAG TG AG AAGGTA GTTCCGGCC A 240 
GTGCAGACCC TGCTGACACT GAAAAAATGC TCCGGTATGA AATCAAACAG ATAAAGATGT 300 
5 TCAAAGGGTT TGAG AAAGTC AAGGATGTTC AGTATATCTA TACGCCTTTT GACTCTTCCC 360 
TCTGTGGTGT GAAACTAGAA GCCAACAGCC AGAAGCAGTA TCTCTTGACT GGTCAGGTCC 420 
TCAGTGATGG AAAAGTCTTC ATCCATCTGT GCAACTACAT CGAGCCCTGG GAGGACCTGT 480 
CCTTGGTGCA GAGGGAAAGT CTGAATCATC ACTACCATCT GAACTGTGGC TGCCAAATCA 540 
CCACCTGCTA CACAGTACCC TGTACCATCT CGGCCCCTAA CG AGTGCCTC TGG ACAGACT 600 
1 0 GGCTGTTGGA ACGA AAGCTC TATGGTTACC AGGCTCAGCA TTATGTCTGT ATGAAGCATG 660 
TTGACGGCAC CTGCAGCTGG TACCGGGGCC ACCTGCCTCT CAGGAAGGAG TTTGTTGACA 720 
TCGTTCAGCC CTAGTAGGGA CCAGTGACCA TCACATCCCT TCAAGAGTCC TGAAGATCAA 780 
GCCAGTTCTC CTTCCCTGCA G AGCTTTGGC CATTACCACC TGACCTCTTG CTGCCAGCTA 840 
ATAAGAAGTG CCAAGTGGAC AGTCTGGCCA CTGTCAAGGC AGGGAAGGGG CCATGACTTT 900 
1 5 TCTGCCCTGC CCTCAGCCTG TTG CCCTGCC TCCCAAACCC CATTAGTCTA GCCTTGTAGC 960 
TGTTACTGCA AGTGTTTCTT CTGGCTTAGT CTGTTTTCTA AAGCCAGGAC TATTCCCTTT 1020 
CCTCCCCAGG AATATGTGTTTTCCTTTGTCTTAATCGATCTGGTAGGGGA GAAATGGCGA 1080 
ATGTCATACA CATGAGATGG TATATCCTTG CGATGTACAG AATCAGAAGG TGGTTTGACA 1 140 
GCATCATAAA CAGGCTGACT GGCAGGAATG AAAAAAAAAA AAAAAAAAA 



20 



Seq ID NO: 128 Protein sequence: 
Protein Accession #: NP 003247.1 



i n 21 31 41 51 

25 I 1 1 1 1 1 

MPGSPRPAPS WVLLLRLLAL LRPPGLGEAC SCAPAHPQQH ICHSALVIRA KiSSEKVVPA 60 
SADPADTEKM LRYEIKQIKM FKGFEKVKDV QYIYTPFDSS LOG VKLEANS QKQYLLTGQ V 1 20 
LSDGkVFIHL CNYIEPWEDL SLVQRESLNH'HYHLNCGCQl TTCYTVPCTI SAPNECLWTD ISO 
WLLERKLYGY QAQHYVCMKH VDGTCSWYRG HLPLRKEFVD IVQP 

Seq ID NO: 129 DNA sequence 

Nucleic Acid Accession ^: NM_007207.2 

Coding sequence: 143-1 591 (underlined sequences correspond to start and stop codons) 

35 i 11 21 31 41 51 

I • I I I I I 

CCACGCGTCC GCAATGAAGC CGAGTGAATG GGGGCTGAAT GTGCGAGTCC ATAGCTGAAG 60 
AGGAGCGCCA GATGGTGG AG G AATACACTT ATTTATG AAA CTGTCTTG AG TTCTTCTTG A 1 20 
ATTGCCAGTT TTCAGCCTCC T CATG CCTCC GTCTCCTTTA G AGG ACAGGG TAGTAGTGGC 1 80 

40 ACTATCTAGGCCCGTCCXiACCTCAGGATCTCAACCTTTGTTTAGACTCTAGTTACCTTGG 240 
CTCTGCCAAC CCAGGCAGTA ACAGCCACCC TCCTGTCATC GCCACCACCG TTGTGTCCCT 300 
CAAGGCTGCG AATCTGACGT ATATGCCCTC ATCCAGCGGC TCTGCCCGCT CGCTGAATTG 360 
TGGATGCAGC AGTGCCAGCT GCTGCACTGT GGCAACCTAC GACAAGGACA ATCAGGCCCA 420 
AACCCAAGCC ATTGCCGCTG GCACCACCAC CACTGCCATC GGAACCTCTA CCACCTGCCC 480 

4D TGCTAACCAG ATGGTCAACA ATAATGAG AA TACAGGCTCT CTAAGTCCAT CAAGTGGGGT 540 
GGGCAGCCCT GTGTCAGGGA CCCCCAAGCA GCTAGCCAGC ATCAAAATAA TCTACCCCAA 600 
TGACTTGGCA AAGAAGATGA CCAAATGCAG CAAGAGTCAC CTGCCGAGTC AGGGCCCTGT 660 
CATCAtTGAC TGCAGGCCCT TCATGGAGTA CAACAAGAGT CACATCCAAG G AGCTGTCCA 720 
CATTAACTGT GCCG ATA AGA TCAGCCGGCG GAGACTGCAG CAGGGCAAGA TCACTGTCCT 780 

jU AGACTTGATT TCCTGTAGGG AAGGCAAGGA CTCTTTCAAG AGGATCTTTT CCAAAG AAAT 840 
TATAGTTTAT GATGAGAATA GCAATGAAGC AAGCCGAGTG ATGCCCTCCC AGCCACTTCA 900 
CATAGT CCTC GAGTCCCTGA AG AGAGAAGG CAAAGAACCT CTGGTGTTGA AAGGTGGACT 960 
TAGTAGTTTT AAGCAGAACC ATGAAAACCT CTGTG ACAAC TCCCTCCAGC TCCAAGAGTG 1020 
CCGGGAGGTG GGGGGCGGCG CATCCGCGGC CTGGAGCTTG CTACCTCAGC CCATCCCCAC 1080 

D3 CACCCCTGAC ATCGAGAACG CTGAGCTCAC CCCCATCTTG CCCTTCCTGT TCXTTTGCCAA 1 140 
TGAGCAGGAT GCTCAGGACC TGGACACCAT GCAGCGGCTG AACATCGGCT ACGTCATCAA 1200 
CGTCACCACT CATCTTCCCC TCTACCACTA TGAGAAAGGC CTGTTCAACT ACAAGCGGCT 1260 
GCCAGCCACT GACAGCAACA AGCAGAACCT GCGGCAGTAC TTTGAAGAGG CTTTTGAGrr 1320 
CATTGAGGAA GCTCACCAGT GTGGGAAGGG GCTTCTCATC CACTGCCAGG CTGGGGTGTC 1380 

OU CCGCTCCGCC ACCATCGTCATCGCTTACTTGATGAAGCAC ACTCGGATGACCATGACTGA 1440 
TGCTTATAAA TTTGTCAAAG GCAAACGACC AATTATCTCC CCAAACCTTA ACTTCATGGG 1500 
GCAGTTGCTA GAGTTCGAGG AAGACCTAAA CAACGGTGTG ACACCGAGAA TCCTTACACC 1560 
AAAGCTGATG GGCGTGGAGA CGGTTGTG TG A CAATGGTCT GGATGGAAAG GATTGCTGCT 1620 
CTCCATTAGG AGACAATG AG G AAGG AGGAT GGATTCTGGT TTTTTTTCTT TCTTTTTTTT 1 680 

05 TTGTAGTTGG G AGTAAAGTT TGTG AATGG a AACAAACTTG GTTAA ACACT TTATTTTTAA 1 740 
CAAGTGTAAG AAG ACTATAC TTTTG ATGCC ATTG AGATTC ACCTTCCACA AACTGGCCAA 1 800 
ATTAAGGAGG T TAAAGA AGT AATTTTTTTT AAGCCCAACC ATTAAAAATT TAATACAACT 1860 
TGGTTTCTCC CCCTTTTTCC TTTAAAGCTA NTTTGTAAAA GTTTATGAG 

70 Seq ID NO: 130 Protein sequence: 

Protein Accession #: NP_009 138,1 

1 U 21 31 41 51 
\ \ \ \ \ \ 
/j MPPSPLDDRV VVALSRPVRP QDLNLCLDSS YLGSANPGSN SHPPVIATTV VSLKAANLTY 60 
MPSSSGSARS LNCGCSSASC CTVATYDKDN QAQTQAIA AG TTTTAIGTST TCPANQMVNN 1 20 
NENTGSLSPSSGVGSPVSGTPKQLASIKII YPNDLAKKMTKCSKSHLPSQGPVIIDCRPF 180 
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MEYNKSHIQG AVHINCADKI SRRRLQQGKI TVLDUSCRE GKDSFKRIFS KEIWYDENT 240 
NEPSRVMPSQ PLHIVLESLK REGKEPLVLK GGLSSFKQNH ENLCDNSLQL QECREVGGG A 300 
SAASSLLPQP IPTTPDIENA ELTPILPFLF LGNEQDAQDL DTMQRLNIGY VINVTTHLPL 360 
YHYEFCGLFNY KRLPATDSNK QNLRQYFEEA FEFIEEAHQC GKGLLIHCQA GVSRSATIVI 420 
5 AYLMiCHTRMT MTDAYKFVKG KRPIISPNLN FMGQLLEFEE DLNNGVTPRI LTPiCLMGVET 480 
W 

SeqlDNOilsi DN A sequence 
Nucleic Acid Accession NM_005409 J 

1 0 . Coding sequence: 94-378 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

TTCCTTTCAT GTTCAGCATT TCTACTCCTT CCAAGAAGAG CAGCAAAGCT GAAGTAGCAG 60 

1 5 CAACAGCACC AGCAGCAACA GCAAAAAACA AACATGAGTG TGAAGGGCAT GGCTATAGCC 120 
TTGGCTGTGA TATTGTGTGC TACAGTTGTT CAAGGCTTCC CCATGTTCAA AAG AGG ACGC 1 80 
TGTCTTTGCA TAGGCCCTGG GGTAAAAGCA GTGAAAGTGG CAGATATTGA GAAAGCCTCC 240 
ATAATGTACC CAAGTAACAA CTGTG ACAAA ATAGAAGTGA TTATTACCCT GAAAGAAAAT 300 
AAAGGACAAC G ATGCCTAAA TCCCAAATCG AAGCAAGCAA GGCTTATAAT CAAAAAAGTT 360 

20 fiA AAfiAAAQA ATTTTTAAAA ATATCAAAAC ATXtGAAGTC CTGGAAAAGG GCATCTGAAA 420 
AACCTAGAAC AAGTTTAACT GTGACTACTG AAATGACAAG AATTCTACAG TAGGAAACTG 480 
AGACTTTTCT ATGGTTTtGT GACTTTCAAC TTTTGTACAG TTATGTGAAG GATGAAAGGT 540 
GGGTGAAAGG ACCAAAAACA GAAATACAGT CTTCCTGAAT GAATGACAAT CAGAATTCCA 600 
CTGCCCAAAG G AGTCCAGCA ATTAAATGGA TTTCTAGGAA AAGCTACCTT AAGAAAGGCT 660 

25 GGTTACCATCGGAGTTTACAAAGTGCTTTCACGTTCITACTTGTrGTATTATACATTCAT 720 
GCATTTCTAG GCTAG AGAAC CTTCTAG ATT TGATGCTTAC AACTATTCTG TTGTGACTAT 780 
GAGAACATTT CTGTCTCTAG AAGTTATCTG TCTGTATTGA TCTTTATGCT ATATTACTAT 840 
CTGTGGTTAC AGTGGAGACA TTGACATTAT TACTGG AGTC AAGCCCTTAT AAGTCAAAAG 900 
CATCTATGTGTCGTAAAGCATTCCTCAAAC ATTTTTTCATGCAAATACAC ACITCTT^ 960 

30 CCAAATATCA TGTAGCACAT CAATATGTAG GGAAACATTC TTATGCATCA TTTGGTTTGT 1 020 
TTTATAACCA ATTCATTAAATGTAATTCATAAAATGTACTATGAAAAAAATTATACGCTA 1080 
TGGGATACTG GCAACAGTGC ACATATTTCA TAACCAAATT AGCAGCACCG GTCTTAATTT 1 140 
G ATGTTTTtC AACTTTTATT CATTG AGATG TTTTGAAGCA ATTAGG ATAT GTGTGTTTAC 1 200 
TGTACTTTTT GTTTTGATCC GTTTGTATAA ATGATAGCAA TATCTTGGAC ACATTTGAAA 1260 

35 TACAAAATGTTTTTGTCTAC CAAAGAAAAA TGTTGAAAAA TAAGCAAATG TATACCTAGC 1320 
AATCACTTTT AClTll IGTA ATTCTGTCTC TTAGAAAAAT ACATAATCTA ATCAATTTCT 1380 
TTGTTCATGC CTATATACTG TAAAATTTAG G TATACTCAA GACTAGTTTA AAGAATCAAA 1440 
GTCATTTTTT TCTCTAATAA ACTACCACAA CCTTTCTTTT TTAAAAAAAA AAA 

40 Seq ID NO: 132 Protein sequence: 

Protein Accession #: NP_OO540O.l 

1 n 21 31 41 51 

45 MSVKGMAIALAVILCATVVQGFPMFKJIGRCLCIGPGVKAVKVADIEKASIMY^^ 60 
EVirrUCENK GQRCLNPKSK QARLIUCKVE RKNF 

Seq ID NO: 133 DNA sequence 
Nucleic Acid Accession #: NM_012342 
5 0 Coding sequence: 373-1 155 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

CTGGCGCGGG CGGGAGCTGC GGCGGATACC CTTGCGTGCT'GTGGAGACCC TACTCTCTTC 60 

55 GCTGAGAACG GCCGCTAGCG GGGACTGAAG GCCGGGAGCC CACTCCCGAC CCGGGGCTAG 120 
CGTGCGTCCC TAGAGTCG AG CGGGGCAAGG GAGCCAGTGG CCGCCGACGG GGGACCGGGA 1 80 
AACTTTTCTG GGCTCCTGGA GAGCCCTGTA GCCGCGCTCC ATGCTCCGGC AGCGGCCCGA 240 
AACCCAGCCC CGCCGCTGAC GGAGCCCGCC GCTCCGGGCA GGGCCCATGC CCTGCGCGCT 300 
CCGGGGGTCG TAGCTGCCGC CGAGCCGGGG CTCCGGAAGC CGGCGGGGGC GCCGCGGCCG 360 

60 TGCGGGGCGT CA AIGGATCG CCACTCCAGC TACATCTTCA TCTGGCTGCA GCTGGAGCTC 420 
TGCGCCATGG CCGTGCTGCT CACCAAAGGT GAAATTCGAT GCTACTGTGA TGCTGCCCAC 480 
TGTGTAGCCA CTGGTTATAT GTGTAAATCT GAGCTCAGCG CCTGCTTCTC TAGACTTCTT 540 
G ATCCrCAG A ACTCAAATTC CCCACTCACC CATGGCTGCC TGGACTCTCT TGCAAGCACG 600 
ACAGACATCT GCCAAGCCAA ACAGGCCCGA AACCACTCTG GCACCACCAT ACCCACATTG 660 

6 5 G AATGCrrOTC ATGAAGACAT GTGCAATTAC AGAGGGCTGC ACGATGTTCT CTCTCCTCCC 720 
AGGGGTGAGG CCTCAGGACA AGGAAACAGG TATCAGCATG ATGGTAOCAG AAACCTTATC 780 
ACCAAGGTGC AGGAGCTGAC TTCTTCCAAA GAGTTGTGGT TCCGGGCAGC GGTCATTGCC 840 
GTGCCCATTG CTGG AGGGCT G ATTTTAGTG TTGCTTATTA TGTTGGCCCT G AGGATGCTT 900 
CGAAGTGAAA ATAAGAGGCT GCAGGATCAG CGGCAACAGA TGCTCTCCCG TTTGCACTAC 960 

70 AGCTTTCACGGACACCATTCCAAAAAGGGGCAGGTTGCAAAGTTAGACTTGGAATGCATG 1020 
GTGCCGGTCA GTGGGCACGA GAACrTGCTGT CrrGACCTGTG ATAAAATGAG ACAAGCAGAC 1080 
CTCAGCAACG ATAAGATCCT CTCGCTTGTT CACTGGGGCA TGTACAGTGG GCACGGGAAG 1140 
CTGGAATTCG TA TGA CGGAG TCTTATCTGA ACTACACTTA CTGAACAGCT TGAAGGCCTT 1200 
TTGAGTTCTG CTGGACAGGA GCACTTTATC TGAAG ACAAA CTCATTTAAT CATCTTTGAG 1260 

75 AGACAAAATG ACCTCTGCAA ACAGAATCTT GGATATTTCT TCTGAAGGAT TATTTGCACA 1320 
GACTTAAATA CAGTTAAATG TGTTATTTGC TTTTAAAATT ATAAAAAGCA AAGAGAAGAC 1380 
TTTGTACAC:A CTGTCACCAG GGTTATTTGC ATCCAAGGGA GCTGGAATTG AGTACCTAAA 1440 
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TAAACAAAAA TGTGCCCTAT GTAAGCTTCT ACATCTTGAT TTATTGTAAA GATTTAAAAG 1500 
AAATATATAT ATTTTGTCTG A 

Seq ID NO: 134 Protein sequence: 
5 Protein Accession #: NP_036474.1 

1 11 21 31 41 51 
I I i I I I 

MDRHSSYIFI WLQLELCAMA VLLTKGEIRC YCDAAHCVAT GYMCKSELSA CFSRLLDPQN 60 
1 0 SNSPLTHGCL DSLASTTDIC QAK.QARNHSG TTIPTLECCH EDMCNYRGLH DVLSPPRGEA 120 

SGQGmVQHDGSRNLrrKVQELTSSKELWFRAAVIAWIAGCULVUJMLALRMLRSEN 180 . 
KRLQDQRQQM LSRLHYSFHG HHSKKGQVAK LDLECMVPVS GHENCCLTCD KMRQADLSND 240 
KILSLVHWGM YSGHGKLEFV 

15 Seq ID NO: 135 DNA sequence 

Nucleic Acid Accession #: NM_001 627. 1 

Coding sequence: 64- 1815 (underlined sequences conespond to start and stop codons) 

1 II 21 31 41 51 

I I I I I I 

CGGGACGACG CCCCCTCCTG CGGCGTGGAC TCCGTCAGTG GCCCACCAAG AAGGAGGAGG 60 
AATATGGAAT CCAAGGGGGC CAGTTCCTGC CGTCTGCTCT TCTGCCTCTT GATCTCCGCC 120 
ACCGTCTTCA GGCCAGGCCT TGGATGGTAT ACTGTAAATT CAGCATATGG AG ATACCATT 1 80 
ATCATACCTT GCCGACTTGA CGTACCTCAG AATCTCATGT TTGGCAAATG GAAATATGAA 240 
AAGCCCG ATG GCTCCCCAGT ATTTATTGCC TTCAGATCCT CTACAAAGAA AAGTGTGCAG 300 
TACG ACG ATG TACCAGAATA CAAAGACAGA TTGAACCTCT CAGAAAACTA CACTTTGTCT 360 
ATCAGTAATG CAAGGATCAG TGATGAAAAG AGATTTGTGT GCATGCTAGT AACTGAGGAC 420 
AACGTGTTTG AGGCACCTAC AATAGTCAAG GTGTTCAAGC AACCATCTAA ACCTGAAATT 480 
GTAAGCAAAG CACTGTTTCT CG AA ACAG AG CAGCTAAA AA AGTTGGGTGA CTGCATTTCA 540 
GAAG ACAGTT ATCCAGATGG CAATATCACA TGGTACAGGA ATGG AAAAGT GCTACATCCC 600 
CTTGAAGGAG CGGTGGTCAT AATTTTTAAA AAGGAAATGG ACCCAGTGAC TCAGCTCTAT 660 
ACCATGACTT CCACCCTGGA GTACAAGACA ACCAAGGCTG ACATACAAAT GCCATTCACC 720 
TGCTCGGTGA CATATTATGG ACCATCTGGC CAGAAAACAA TTCATTCTGA ACAGGCAGTA 780 
TTTGATATTT ACTATCCTAC AGAGCAGGTG ACAATACAAG TGCTGCCACC AAAAAATGCC 840 
ATCAAAGAAG GGGATAACAT CACTCTTAAA TGCTTAGGGA ATGGCAACCC TCCCCCAGAG 900 
GAATTTTTGT TTTACTTACC AGGACAGCCC GAAGGAATAA GAAGCTCAAA TACTTACACA 960 
CTGATGGATG TGAGGCGCAA TGCAACAGGA GACTACAAGT GtTCCCTGAT AGACAAAAAA 1020 
AGCATGATTG CTTCAACAGC CATC ACAGTT CACTATTTGG ATTTGTCCTT AAACCCAAGT 1080 
GGAGAAGTGA CTAGACAGAT TGGTGATGCC CTACCCGTGT CATGCACAAT ATCTGCTAGC 11 40 
AGGAATGCAA CTGTGGTATG GATGAAAGAT AACATCAGGC TTCGATCTAG CCCGTCATTT 1200 
TCTAGTCTTC ATTATCAGGA TGCTGGAAAC TATGTCTGCG AAACTGCTCT GCAGGAGGTT 1260 
GAAGGACTAA AGAAAAGAGA GTCATTGACT CTCATTGTAG AAGGCAAACC TCAAATAAAA 1320 
ATGACAAAGA AAACTGATCC CAGTGGACTA TCTAAAACAA TAATCTGCCA TGTGGAAGGT 1380 
TTTCCAAAGC CAGCCATTCA GTGGACAATT ACTGGCAGTG GAAGCGTCAT AAACCAAACA 1440 
GAGGAATCTCCTTATATTAATGGCAGGTATTATAGTAAAATTATCATTTCCCCTGAAGAG 1500 . 
AATGTTACAT TAACTTGCAC AGCAGAAAAC CAACTGGAGA GAACAGTAAA CTCCTTGAAT 1560 
GTCTCTGCTA TAAGTATTCC AGAACACGAT GAGGCAGACG AGATAAGTGA TGAAAACAGA 1620 
GAAAAGGTGA ATGACCAGGC AAAACTAATT GTGGGAATCG TTGTTGGTCT CCTCCTTGCT 1680 
GCCCTTGTTG CTGGTGTCGT CTACTGGCTG TACATGAAGA AGTCAAAGAC TGCATCAAAA 1740 . 
CATGTAAACA AGGACCTCGG TAATATGGAA GAAAACAAAA AGTTAGAAGA AAACAATfcAC 1800 
AAAACTGAAG CCJAAGAGAG AAACTGTCCT AGTTGTCCAG AGATAAAAAT CATATAGACC 1860 
AATTGAAGCA TGAACGTGGA TTGTATTTAA GACATAAACA AAGACATTGA CAGCAATTCA 1920 
TGGTTCAAGT ATTAAGCAGT TCATTCTACC AAGCTGTCAC AGGTTTTCAG AGAATTATCT 1980 
CAAGTAAAAC AAATGAAATT TAATTACAAA CAATAAGAAC AAGTTTTGGC AGCCATGATA 2040 
ATAGGTCATA TGTTGTGTTT GGTTCAATTT TTTTTCCGTA AATGTCTGCA CTGAGGATTT 2100 
CTTTTTGGTT TGCCTTTTAT GTAAA ITi 1 1 ' TACGTAGCTA i l l r 1 A TACA CTGTAAGCTT 2160 
TGTTCTGGGA GTTGCTGTTA ATCTGATGTA TAATGTAATG TTTTTATTTC AATTGTTTAT 2220 
ATGGATAATC TGAGCAGGTA CATTTCTG AT TCTGATTGCT ATCAGCAATG CCCCAAACTT 2280 
TCTCATAAGC ACCTAAAACC CAAAGGTGGC AGCTTGTGAA GATTGGGGAC ACTCATATTG 2340 
CCCTAATTAA AAACTGTGAT TTTTATCACA AGGGAGGGG A GGCCGAG AGT CAGACTG ATA 2400 
GACACCATAG GAGCCGACTC TTTG ATATGC CACCAGCGAA CTCTCAGAAA TAAATCACAG 2460 
ATGCATATAG ACACACATAC ATAATGGTAC TCCCAAACTG ACAATTTTAC CTATTCTGAA 2520 
AAAGACATAA AACAGAATT 

65 Seq ID NO: 136 Protein seouence: 

Protein Accession #: NP^OO 1 6 1 8. 1 

1 11 21 31 41 51 

I I I I I I 

/U MESKGASSCR LLFCLUSAT VFRPGLGWYT VNSAYGDTII IPCRLDVPQN LMFGKWKYEK 60 
PDGSP VFI AF RSSTKKSVQ Y DO VPEYKDRL NLS ENYTLSI SN ARISDEKR FVCMLVTEDN 1 20 
VFEAPTIVKV FKQPSKPEIV SKALFLETEQ LKKLGDCISE DSYPDGNITW YRNGKVLHPL 180 
EGAWnFKK EMDPVTQLYT MTSTLEYKTT KADIQMPFTC SVTYYGPSGQ KTIHSEQAVF 240 
DIYYPTEQVTIQVLPPIO^AIKEGDNITLKCLGNGNPPPEEFlJYLPGQPEGIRSSNrYTL 300 

75 MDVRRNATGD YKCSLIDKKS MIASTAITVH YLDLSLNPSG EVTRQIGDAL PVSCTISASR 360 

NATWWMKDN IRLRSSPSFS SLHYQDAGNY VCETALQEVE GLKKRESLTL IVEGKPQIKM 420 
TFOCTDPSGLSKTIICHVEGFPKPAIQWTITGSGSVINQTEESPYINGRYYSiailSPEEN 480 . 
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VTLTCTAENQ LERTVNSLNV SAISIPEHDE ADEISDENRE KVNDQAKLIV GIWGLLLAA 540 
LVAG VVYWLY MKKSKTASKH VNKDLGNMEE hOGCLEENNHK TEA 

5 Seq ID NO: 137 DNA sequence 

Nucleic Acid Accession #: XM_030559 

Coding sequence: 1-119 (underlined sequences correspond to start and stop codons) 
1 II 21 31 41 51 

ATGAACCGCA GCGACCGGCA CGGGGCGGGC AGCGGCTGCC TGGGCACTAT GGAGGTGAAG 60 
AGCAAGTTTG GAGCTGAATT TCGTCGGTTT TCGCTGGAAA GATCAAAACC T GGAAA ATTT 120 
G AGG AGTTTT ATGG ATTACT ACAACATGTT CATAAG ATCC CCAATGTTGA CGTnTGGTA 1 80 
GGCTATGCAG ACATCCATGG AGACTTACTA CCTATAAATA ATGATGATAA TTATCACAAA 240 
15 GCTGTTTCAACGGCCAATCCACTGCTTAGGATATTTATACAAAAGAAGGAAGAAGCAGAC 300 
TACAGTGCCT TTGGTACAGA CACGCTAATA AAGAAGAAGA ATGinTAAC CAACGTATTG 360 
CGTCCTGACA ACCATAGAAA AAAGCCACAT ATAGTCATTA GTATGCCCCA AGACTTTAGA 420 
CCTGTGTCTT CTATTATAGA CGTGGATATT CTCCCAGAAA CGCATCGTAG GGTACGTCTT 480 
TACAAATACG GCACGGAGAA ACCCCTAGGA TTCTACATCC GGGATGGCTC CAGTGTCAGG 540 
20 GTAACACCAC ATGGCTTAG A AAAGGTTCCA GGGATCTTTA TATCCAGGCT TGTCCCAGGA 600 
GGTCTGGCTC AAAGTACAGG ACTATTAGCT GTTAATGATG AAGTITTAGA AGTTAATGGC 660 
ATAGAAGTTT CAGGGAAGAG CCTTGATCAA GTAAC AGACA TGATGATTGC AAATAGCCGT 720 
AACCTCATCA TAACAGTGAG ACCGGCAAAC CAGAGGAATA ATGTTGTGAG G AACAGTCGG 780 
ACTTCTGGCA GTTCCGGTCA GTCTACTGAT AACAGCCTTC TTGGCTACCC ACAGCAG ATT 840 
25 G AACCAAGCT TTG AGCCAGA GGATGAAG AC AGCGAAGAAG ATGACATTAT CATTG AAG AC 900 
AATGG AGTGC CACAGCAGAT TCCAAAAGCT GTTCCTAATA CTGAGAGCCT GGAGTCATTA 960 
ACACAGATAG AGCTAAGCTT TGAGTCTGGA CAGAATGGCT TTATTCCCTC TAATGAAGTG 1020 
AGCTTAGCAG CCATAGCAAG CAGCTCAAAC ACGGAATTTG AAACACATGC TCCAGATCAA 1080 
AAACTCTTAG AAGAAGATGG AACAATCATA ACATTAIGA 



30 



Seq ID NO: 138 Protein seouence: 
Protein Accession ffi XP_030559 



1 11 21 31 41 51 

MNRs'hRHGAG SGCLGTMEVK SKFGAEFRRF SLERSKPGKF EEFYGLLQHV HKIPNVDVLV 60 
G Y ADIHGDLL PINNDDNYHK A VSTANPLLR IFIQKKEEAD YSAFGTDTU KKKNVLTNVL 1 20 
RPDNHRKKPH IVISMPQDFR PVSSIIDVDI LPETHRRVRL YKYGTEKPLG FYIRDGSSVR 1 80 
VTPHGLEKVP GIFISRLVPG GLAQSTGLLA VNDEVLEVNG lEVSGKSLDQ VTDMMIANSR 240 
40 NLIITVRPAN QRNNWRNSR TSGSSGQSTD NSLLGYPQQI EPSFEPEDED SEEDDHIED 300 
NGVPQQIPKA VPNTESLESL TQIELSFESG QNGFIPSNEV SLAAIASSSN TEFETHAPDQ 360 
iCLLEEDGTII TL 



45 



409 



wo 02/059377 PCTAJS02/02242 



It is xinderstood that the examples described above in no way serve to limit the 
true scope of this invention, but rather are presented for illustrative purposes. All 
5 publications, sequences of accession numbers, and patent applications cited in this 

specification are herein incorporated by reference as if each individual publication or patent 
application were specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 



1 . 1. A method of detecting a breast cancer-associated transcript in a cell 

2 from a patient, the method comprising contacting a biological sample from the patient with a 

3 polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 

4 as shown in Tables 1-25. 

1 2. The method of claim 1, wherein the biological sample comprises 

2 isolated nucleic acids. 

1 3. The method of claim 2, wherein the nucleic acids are niRNA. 

1 4. The method of claim 2, further comprising the step of amplifying 

2 nucleic acids before the step of contacting the biological sample with the polynucleotide. 

1 5. The method of claim 1, wherein the polynucleotide comprises a 

2 sequence as shown in Tables 1-25. 

1 6. The method of claim 1, wherein the polynucleotide is immobilized on 

2 a solid surface. 

1 7. The method of claim 1, wherein the patient is undergoing a therapeutic 

2 regimen to treat breast cancer. 

1 8, The method of claim 1, wherein the patient is suspected of having 

2 breast cancer. 

1 9. An isolated nucleic acid molecule consisting of a polynucleotide 

2 sequence as shown in Tables 1 -25. 

1 10. The nucleic acid molecule ofclaim 9, which is labeled. 

1 11. An expression vector comprising the nucleic acid of claim 9. 

1 12. A host cell comprising the expression vector of claim 1 1 . 
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1 13. An isolated polypeptide which is encoded by a nucleic acid molecule 

2 having polynucleotide sequence as shown in Tables 1-25. 

1 14. An antibody that specifically binds a polypeptide of claim 1 3 . 

1 15. The antibody of claim 14, further conjugated to an effector component. 

1 16. The antibody of claim 15, wherein the effector component is a 

2 fluorescent label. 

1 17. The antibody of claim 15, wherein the effector component is a 

2 radioisotope or a cytotoxic chemical. 

1 18. The antibody of claim 15, which is an antibody fi-agment. 

1 19. The antibody of claim 15, which is a humanized antibody 

1 20. A method of detecting a breast cancer cell in a biological sample from 

2 a patient, the method comprising contacting the biological sample with an antibody of claim 

3 14. 

1 21. The method of claim 20, wherein the antibody is further conjugated to 

2 an effector component. 

1 22. The method of claim 21 , wherein the effector component is a 

2 fluorescent label. 

1 23. A method for identifying a compoxmd that modulates a breast cancer- 

2 associated polypepitide, the method comprising the steps of: 

3 (i) contacting the compoimd with a breast cancer-associated polypeptide, the 

4 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 

5 80% identical to a sequence as shown in Tables 1-25; and 

6 (ii) determining the functional effect of the compoxmd upon the polypeptide, 
1 24. A dmg screening assay comprising the steps of 
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2 (i) administering a test compound to a manraial having breast cancer or a cell 

3 isolated therefrom; 

4 (ii) comparing the level of gene expression of a polynucleotide that selectively 

5 hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1-25 in a 

6 treated cell or mammal with the level of gene expression of the polynucleotide in a control 

7 cell or mammal, wherein a test compoimd that modulates the level of expression of the 

8 polynucleotide is a candidate for the treatment of breast cancer. 
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